100+ datasets found
  1. d

    Data Matching Imputation System

    • catalog.data.gov
    • fisheries.noaa.gov
    • +1more
    Updated Oct 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (Point of Contact, Custodian) (2024). Data Matching Imputation System [Dataset]. https://catalog.data.gov/dataset/data-matching-imputation-system1
    Explore at:
    Dataset updated
    Oct 19, 2024
    Dataset provided by
    (Point of Contact, Custodian)
    Description

    The DMIS dataset is a flat file record of the matching of several data set collections. Primarily it consists of VTRs, dealer records, Observer data in conjunction with vessel permit information for the purpose of supporting North East Regional quota monitoring projects.

  2. n

    Data from: Comparison of photo-matching algorithms commonly used for...

    • data.niaid.nih.gov
    • search.dataone.org
    • +1more
    zip
    Updated Jun 9, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maximilian Matthé; Marco Sannolo; Kristopher Winiarski; Annemarieke Spitzen - van der Sluijs; Daniel Goedbloed; Sebastian Steinfartz; Ulrich Stachow (2018). Comparison of photo-matching algorithms commonly used for photographic capture-recapture studies [Dataset]. http://doi.org/10.5061/dryad.4f0bh
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 9, 2018
    Dataset provided by
    University of Massachusetts Amherst
    Leibniz Centre for Agricultural Landscape Research
    Technische Universität Braunschweig
    Universidade do Porto
    Reptile, Amphibian and Fish Conservation the Netherlands; Nijmegen The Netherlands
    Vodafone Chair Mobile Communication Systems; Technical University Dresden; Dresden Germany
    Authors
    Maximilian Matthé; Marco Sannolo; Kristopher Winiarski; Annemarieke Spitzen - van der Sluijs; Daniel Goedbloed; Sebastian Steinfartz; Ulrich Stachow
    License

    https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

    Description

    Photographic capture–recapture is a valuable tool for obtaining demographic information on wildlife populations due to its noninvasive nature and cost-effectiveness. Recently, several computer-aided photo-matching algorithms have been developed to more efficiently match images of unique individuals in databases with thousands of images. However, the identification accuracy of these algorithms can severely bias estimates of vital rates and population size. Therefore, it is important to understand the performance and limitations of state-of-the-art photo-matching algorithms prior to implementation in capture–recapture studies involving possibly thousands of images. Here, we compared the performance of four photo-matching algorithms; Wild-ID, I3S Pattern+, APHIS, and AmphIdent using multiple amphibian databases of varying image quality. We measured the performance of each algorithm and evaluated the performance in relation to database size and the number of matching images in the database. We found that algorithm performance differed greatly by algorithm and image database, with recognition rates ranging from 100% to 22.6% when limiting the review to the 10 highest ranking images. We found that recognition rate degraded marginally with increased database size and could be improved considerably with a higher number of matching images in the database. In our study, the pixel-based algorithm of AmphIdent exhibited superior recognition rates compared to the other approaches. We recommend carefully evaluating algorithm performance prior to using it to match a complete database. By choosing a suitable matching algorithm, databases of sizes that are unfeasible to match “by eye” can be easily translated to accurate individual capture histories necessary for robust demographic estimates.

  3. o

    Data and Code for: Correlation Neglect in Student-to-School Matching

    • openicpsr.org
    delimited
    Updated Jun 6, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alex Rees-Jones; Ran Shorrer; Chloe Tergiman (2023). Data and Code for: Correlation Neglect in Student-to-School Matching [Dataset]. http://doi.org/10.3886/E192088V1
    Explore at:
    delimitedAvailable download formats
    Dataset updated
    Jun 6, 2023
    Dataset provided by
    American Economic Association
    Authors
    Alex Rees-Jones; Ran Shorrer; Chloe Tergiman
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    2019 - 2022
    Area covered
    United States
    Description

    Data and Code to accompany the paper "Correlation Neglect in Student-to-School Matching."Abstract: We present results from three experiments containing incentivized school-choice scenarios. In these scenarios, we vary whether schools' assessments of students are based on a common priority (inducing correlation in admissions decisions) or are based on independent assessments (eliminating correlation in admissions decisions). The quality of students' application strategies declines in the presence of correlated admissions: application strategies become substantially more aggressive and fail to include attractive ``safety'' options. We provide a battery of tests suggesting that this phenomenon is at least partially driven by correlation neglect, and we discuss implications for the design and deployment of student-to-school matching mechanisms.

  4. LoL Match History & Summoner Data – 78k Matches

    • kaggle.com
    zip
    Updated Oct 20, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    nsmall (2025). LoL Match History & Summoner Data – 78k Matches [Dataset]. https://www.kaggle.com/datasets/nathansmallcalder/lol-match-history-and-summoner-data-80k-matches
    Explore at:
    zip(3770315 bytes)Available download formats
    Dataset updated
    Oct 20, 2025
    Authors
    nsmall
    Description

    League of Legends Relational Database for Match Prediction

    Context

    This dataset contains detailed match and player data from League of Legends, one of the most popular multiplayer online battle arena (MOBA) games in the world. It includes 35,000 matches and contains 78,000 summoner statistics, capturing a wide range of in-game statistics, such as champion selection, player performance metrics, match outcomes, and more.

    The dataset is structured to support a variety of analyses, including:

    • Predicting match outcomes based on team compositions and player stats
    • Evaluating player performance and progression over time
    • Exploring trends in champion popularity and win rates
    • Building machine learning models for esports analytics

    Whether you are interested in competitive gaming, data science, or predictive modeling, this dataset provides a rich source of structured data to explore the dynamics of League of Legends at scale.

    Data Schema and Dictionary

    Data was collected from Riot Games API using Python script(link) from Patch 25.19

    The datase consists of 7 csv files:

    • MatchStatsTbl - Match Stats given a summonerID and MatchID.Contains K/D/A, Items, Runes,Ward Score, Summoner Spells, Baron Kills, Dragon Kills, Lane, DmgTaken/Dealt, Total Gold, cs,Mastery Points and Win/Loss
    • TeamMatchStatsTbl - Containes Red/Blue Champions,Red/Blue BaronKills,Blue/Red Turret Kills, Red/Blue Kills, RiftHearaldKills and Win/loss
    • MatchTbl- Contains MatchID,Rank,Match Duration and MatchType.
    • RankTbl - Contains RankID and RankName
    • ChampionTbl- Contains ChampionID and ChampionName
    • ItemTbl - Contains ItemID and ItemName
    • SummonerTbl - Contains SummonerID and SummonerName
    • SummonerMatchTbl - Links MatchID,SummonerID and ChampionID

    Database Features

    • This dataset contains 35,422 League of Legends matches and 78,863 summoner statistics from those games.
    • Uses Data from over 2,381 summoners.
    • Consists of data only from Europe West(EUW)
    • Data is sampled from Unranked to Challenger tiers.

    Database Setup

    -MySQL Database using Linux -Database Schema Script can be found here. (Works with the gtihub project to collect your own data)

    Limitations

    The Riot API only provides the "BOTTOM" lane for bot-lane players. During Data collection, roles were inferred by combining chapions that often played support with CS metrics to distinguish ADC vs Support — especially for ambiguous picks like Senna or off-meta choices.

    Acknowledgements/Privacy

    Data is collected using the official Riot Games API. We thank Riot Games for providing the data and tools that make this project possible. This dataset is not endorsed or certified by Riot Games. No personal or identifiable player data (e.g., Summoner Names, Summoner IDs, or PUUIDs) are included. The SummonerTbl has been intentionally excluded from this public release.

    Github

    The Python scripts used for data collection, as well as various scripts I developed for API calls, database management, and initial data analytics, can be found on GitHub

  5. ONC Patient Matching Algorithm Challenge Data

    • linkagelibrary.icpsr.umich.edu
    Updated Sep 20, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Office of the National Coordinator for Health (2019). ONC Patient Matching Algorithm Challenge Data [Dataset]. http://doi.org/10.3886/E111962V1
    Explore at:
    Dataset updated
    Sep 20, 2019
    Dataset authored and provided by
    Office of the National Coordinator for Health
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The goal of the Patient Matching Algorithm Challenge is to bring about greater transparency and data on the performance of existing patient matching algorithms, spur the adoption of performance metrics for patient data matching algorithm vendors, and positively impact other aspects of patient matching such as deduplication and linking to clinical data. Participants will be provided a data set and will have their answers evaluated and scored against a master key. Up to 6 cash prizes will be awarded with a total purse of up to $75,000.00.https://www.patientmatchingchallenge.com/The test dataset used in the ONC Patient Matching Algorithm Challenge is available for download by students, researchers, or anyone else interested in additional analysis and patient matching algorithm development. More information about the Patient Matching Algorithm Challenge can be found: https://www.patientmatchingchallenge.com/.The dataset containing 1 million patients was split into eight files of alphabetical groupings by the the patient's last name, plus an additional file containing test patients with no last name recorded (Null). All files should be downloaded and merged for analysis.https://github.com/onc-healthit/patient-matching

  6. f

    Matching results of drug-disease-gene top 100 paths with database.

    • datasetcatalog.nlm.nih.gov
    Updated Jun 13, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Li, Daifeng; Song, Min; Bu, Yi; Madden, Andrew; Liang, Xiaomin; Ding, Ying (2019). Matching results of drug-disease-gene top 100 paths with database. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000118676
    Explore at:
    Dataset updated
    Jun 13, 2019
    Authors
    Li, Daifeng; Song, Min; Bu, Yi; Madden, Andrew; Liang, Xiaomin; Ding, Ying
    Description

    “Number of triplets” is the number of triplets in specific relation involved in top 100 paths. “Predictions” is the number of relations neither in data sets nor in chosen databases. “Proven predictions” is the number of relations not in data sets but matched with chosen databases.

  7. Counter Strike 2 Match Data/Status for Betting

    • kaggle.com
    Updated Jan 21, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Victor Picinin (2024). Counter Strike 2 Match Data/Status for Betting [Dataset]. https://www.kaggle.com/datasets/victorpicinin/counter-strike-2-hltv-match-data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 21, 2024
    Dataset provided by
    Kaggle
    Authors
    Victor Picinin
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Counter Strike Match Data

    This dataset covers competitive gaming matches from Nov '23 to 12/01/2024. Player statuses are collected 90 days prior to each match, specifically from the map where the match occurred. It includes team rankings and team map winrates.

    Key Features:

    Temporal Context:

    Snapshot of gaming from Nov '23 to 12/01/2024, with player statuses reflecting the 90 days before each match.

    Map-Specific Player Performance:

    Player stats linked to the map of each match.

    Team Map Winrate: Insight into team proficiency on specific maps.

    Potential Use Cases:

    Performance Evolution Analysis:

    Track how player statuses were over the 90 days before the match.

    Competitive Landscape Assessment:

    Evaluate team dominance, dynamics, and trends.

    Betting Analysis:

    Provide valuable insights for betting enthusiasts by leveraging historical performance data for informed decisions.

    Disclaimer: Efforts have been made for accuracy, but gaming variability may impact findings. Cross-reference with additional data for thorough analysis.

  8. u

    MIVIA ARG Dataset

    • mivia.unisa.it
    • zenodo.org
    text/vf-format
    Updated Jan 1, 2013
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MIVIA Lab (2013). MIVIA ARG Dataset [Dataset]. http://doi.org/10.1016/S0167-8655(02)00253-2
    Explore at:
    text/vf-formatAvailable download formats
    Dataset updated
    Jan 1, 2013
    Dataset authored and provided by
    MIVIA Lab
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    The ARG Database is a huge collection of labeled and unlabeled graphs realized by the MIVIA Group. The aim of this collection is to provide the graph research community with a standard test ground for the benchmarking of graph matching algorithms.

  9. Data from: Web Data Commons Training and Test Sets for Large-Scale Product...

    • linkagelibrary.icpsr.umich.edu
    • da-ra.de
    Updated Nov 26, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ralph Peeters; Anna Primpeli; Christian Bizer (2020). Web Data Commons Training and Test Sets for Large-Scale Product Matching - Version 2.0 [Dataset]. http://doi.org/10.3886/E127481V1
    Explore at:
    Dataset updated
    Nov 26, 2020
    Dataset provided by
    University of Mannheim (Germany)
    Authors
    Ralph Peeters; Anna Primpeli; Christian Bizer
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Many e-shops have started to mark-up product data within their HTML pages using the schema.org vocabulary. The Web Data Commons project regularly extracts such data from the Common Crawl, a large public web crawl. The Web Data Commons Training and Test Sets for Large-Scale Product Matching contain product offers from different e-shops in the form of binary product pairs (with corresponding label “match” or “no match”) for four product categories, computers, cameras, watches and shoes. In order to support the evaluation of machine learning-based matching methods, the data is split into training, validation and test sets. For each product category, we provide training sets in four different sizes (2.000-70.000 pairs). Furthermore there are sets of ids for each training set for a possible validation split (stratified random draw) available. The test set for each product category consists of 1.100 product pairs. The labels of the test sets were manually checked while those of the training sets were derived using shared product identifiers from the Web weak supervision. The data stems from the WDC Product Data Corpus for Large-Scale Product Matching - Version 2.0 which consists of 26 million product offers originating from 79 thousand websites. For more information and download links for the corpus itself, please follow the links below.

  10. Dataset for matching estimation, based on Social Diagnosis, Statistics...

    • figshare.com
    txt
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Łukasz Skrok (2023). Dataset for matching estimation, based on Social Diagnosis, Statistics Poland and Orliki databases. [Dataset]. http://doi.org/10.6084/m9.figshare.10109261.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    figshare
    Authors
    Łukasz Skrok
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    Poland
    Description

    To construct the datasaet, firstly, the Social Diagnosis (SD) data was used – a dataset based on results of a representative, longitudinal study conducted by an interdisciplinary board of academics in 2000-2015 [[i]]. Furthermore, the SD dataset has been combined with two datasets on sports infrastructure: The Statistics Poland (SP), conducted every four years, the latest, from 2014, was used for our analysis [[ii]] and the “Orliki” database, containing information on facilities constructed in 2008–2012 within a publicly-funded programme [[iii]]. The data has been combined by calculation indicators of availability: number of sports facilities of particular types per capita, at the NUTS (Classification of Territorial Units for Statistics) 3 level.

    1. Czapiński J, Panek T. Social diagnosis. 2015. Available from: http://www.diagnoza.com/.

    2. Statistics Poland. Surveys on sports facilities - results of KFT-OB/a (municipalities), KFT-OB/b (external administrators), KFT-1 (sport clubs) surveys. Warsaw: Statistics Poland. 2015.

    3. Biernat E, Piątkowska M, Zembura P, Gołdys A. Problem zarządzania orlikami z perspektywy animatorów z gmin wiejskich i miejskich [Challenges of management of Orlik pitches in the perspective of the animators from municipal and rural communities]. Przeds Zarz. 2017;18(8):429–44. Polish.

  11. Data from: Fast Bayesian Record Linkage With Record-Specific Disagreement...

    • tandf.figshare.com
    txt
    Updated Jun 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Thomas Stringham (2023). Fast Bayesian Record Linkage With Record-Specific Disagreement Parameters [Dataset]. http://doi.org/10.6084/m9.figshare.14687696.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    Taylor & Francishttps://taylorandfrancis.com/
    Authors
    Thomas Stringham
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Researchers are often interested in linking individuals between two datasets that lack a common unique identifier. Matching procedures often struggle to match records with common names, birthplaces, or other field values. Computational feasibility is also a challenge, particularly when linking large datasets. We develop a Bayesian method for automated probabilistic record linkage and show it recovers more than 50% more true matches, holding accuracy constant, than comparable methods in a matching of military recruitment data to the 1900 U.S. Census for which expert-labeled matches are available. Our approach, which builds on a recent state-of-the-art Bayesian method, refines the modeling of comparison data, allowing disagreement probability parameters conditional on nonmatch status to be record-specific in the smaller of the two datasets. This flexibility significantly improves matching when many records share common field values. We show that our method is computationally feasible in practice, despite the added complexity, with an R/C++ implementation that achieves a significant improvement in speed over comparable recent methods. We also suggest a lightweight method for treatment of very common names and show how to estimate true positive rate and positive predictive value when true match status is unavailable.

  12. Pupil parent matched data (PPMD): how we use and share data

    • gov.uk
    • s3.amazonaws.com
    Updated Feb 20, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department for Education (2024). Pupil parent matched data (PPMD): how we use and share data [Dataset]. https://www.gov.uk/government/publications/pupil-parent-matched-data-ppmd-how-we-use-and-share-data
    Explore at:
    Dataset updated
    Feb 20, 2024
    Dataset provided by
    GOV.UKhttp://gov.uk/
    Authors
    Department for Education
    Description

    We are analysing family circumstances and education by matching parent and pupil data. The data compares household income and educational outcomes of pupils in England.

    Read more information about how we share student and workforce data.

    To ensure this privacy notice is up to date, we will review this information annually.

  13. n

    Glycan Mass Spectral Database (GMDB)

    • neuinfo.org
    • dknet.org
    Updated Jan 29, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). Glycan Mass Spectral Database (GMDB) [Dataset]. http://identifiers.org/RRID:SCR_014667
    Explore at:
    Dataset updated
    Jan 29, 2022
    Description

    A multistage tandem mass spectral database using a variety of structurally defined glycans. It provides tools for glycomics research that enable users to identify glycans by spectral matching. The database stores MS2, MS3, and MS4 spectra of N-and O-linked glycans, and glycolipid glycans as well as the partial structures of these glycans.

  14. H

    Replication Data for: Matching with Text Data: An Experimental Evaluation of...

    • dataverse.harvard.edu
    Updated Dec 24, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Reagan Mozer (2019). Replication Data for: Matching with Text Data: An Experimental Evaluation of Methods for Matching Documents and of Measuring Match Quality [Dataset]. http://doi.org/10.7910/DVN/K8IL3V
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 24, 2019
    Dataset provided by
    Harvard Dataverse
    Authors
    Reagan Mozer
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This repository contains the materials needed to replicate the results presented in Mozer et al. (2019), "Matching with Text Data: An Experimental Evaluation of Methods for Matching Documents and of Measuring Match Quality", forthcoming in Political Analysis.

  15. H

    Replication Data for: Looking for twins: how to build better counterfactuals...

    • dataverse.harvard.edu
    Updated Feb 3, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stefano Costalli; Fedra Negri (2021). Replication Data for: Looking for twins: how to build better counterfactuals with matching [Dataset]. http://doi.org/10.7910/DVN/CYZFCC
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 3, 2021
    Dataset provided by
    Harvard Dataverse
    Authors
    Stefano Costalli; Fedra Negri
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    A primary challenge for researchers that make use of observational data is selection bias (i.e., the units of analysis exhibit systematic differences and dis-homogeneities due to non-random selection into treatment). This article encourages researchers in acknowledging this problem and discusses how and - more importantly - under which assumptions they may resort to statistical matching techniques to reduce the imbalance in the empirical distribution of pre-treatment observable variables between the treatment and control groups. With the aim of providing a practical guidance, the article engages with the evaluation of the effectiveness of peacekeeping missions in the case of the Bosnian civil war, a research topic in which selection bias is a structural feature of the observational data researchers have to use, and shows how to apply the Coarsened Exact Matching (CEM), the most widely used matching algorithm in the fields of Political Science and International Relations.

  16. d

    Data release for Using social-context matching to improve value-transfer...

    • catalog.data.gov
    • data.usgs.gov
    • +1more
    Updated Oct 30, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Data release for Using social-context matching to improve value-transfer performance for cultural ecosystem service models [Dataset]. https://catalog.data.gov/dataset/data-release-for-using-social-context-matching-to-improve-value-transfer-performance-for-c
    Explore at:
    Dataset updated
    Oct 30, 2025
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Description

    Recreational and aesthetic enjoyment of public lands is increasing across a wide range of activities, highlighting the need to assess and adapt management to accommodate these uses. Despite a growing number of studies on mapping cultural ecosystem services, most are local- scale assessments that rely on costly and time-consuming primary data collection. As a result, the availability of spatial information on non-market values associated with cultural ecosystem services (social values) remains limited. Spatial function transfer, if it could be justified for social-value models, would expedite the development of social-value information and promote its more regular inclusion in ecosystem service assessments. We used survey data from six national forests in Colorado and Wyoming to explore the potential for transferring cultural ecosystem service models between forests and specifically to test the hypothesis that transfer performance increases with social-context similarity between transferring and receiving areas. Results confirm this relationship but fall just short of being able to predict with certainty when transferred models will meet the minimum performance criterion needed for defensible use by managers. Social values are highly variable and can be difficult to predict, but our results suggest that with the right combination of indicators that spatial function transfer can become a defensible means of generating social-value information when primary data collection is not feasible.

  17. h

    celeb-face-matching-data

    • huggingface.co
    Updated Oct 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mohammad Zaid (2025). celeb-face-matching-data [Dataset]. https://huggingface.co/datasets/mzhappyface/celeb-face-matching-data
    Explore at:
    Dataset updated
    Oct 14, 2025
    Authors
    Mohammad Zaid
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    mzhappyface/celeb-face-matching-data dataset hosted on Hugging Face and contributed by the HF Datasets community

  18. h

    task-matching-data

    • huggingface.co
    Updated Apr 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NOOR FATHIMA (2025). task-matching-data [Dataset]. https://huggingface.co/datasets/Noor-ai/task-matching-data
    Explore at:
    Dataset updated
    Apr 4, 2025
    Authors
    NOOR FATHIMA
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Noor-ai/task-matching-data dataset hosted on Hugging Face and contributed by the HF Datasets community

  19. k

    Data from: The U.S. Syndicated Loan Market: Matching Data

    • kansascityfed.org
    pdf
    Updated May 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). The U.S. Syndicated Loan Market: Matching Data [Dataset]. https://www.kansascityfed.org/research/research-working-papers/us-syndicated-loan-market-matching-data-2018/
    Explore at:
    pdfAvailable download formats
    Dataset updated
    May 13, 2024
    Description

    A simple, replicable methodology can help researchers link corporate loan datasets.

  20. t

    Data from: Transformer-based Map Matching Model with Limited Ground-Truth...

    • service.tib.eu
    • resodate.org
    Updated Dec 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Transformer-based Map Matching Model with Limited Ground-Truth Data using Transfer-Learning Approach [Dataset]. https://service.tib.eu/ldmservice/dataset/transformer-based-map-matching-model-with-limited-ground-truth-data-using-transfer-learning-approach
    Explore at:
    Dataset updated
    Dec 16, 2024
    Description

    This study proposes a framework for developing a novel deep learning-based map-matching model in the limited ground-truth data environment.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
(Point of Contact, Custodian) (2024). Data Matching Imputation System [Dataset]. https://catalog.data.gov/dataset/data-matching-imputation-system1

Data Matching Imputation System

Explore at:
4 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Oct 19, 2024
Dataset provided by
(Point of Contact, Custodian)
Description

The DMIS dataset is a flat file record of the matching of several data set collections. Primarily it consists of VTRs, dealer records, Observer data in conjunction with vessel permit information for the purpose of supporting North East Regional quota monitoring projects.

Search
Clear search
Close search
Google apps
Main menu