100+ datasets found
  1. o

    Data and Code for: Correlation Neglect in Student-to-School Matching

    • openicpsr.org
    delimited
    Updated Jun 6, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alex Rees-Jones; Ran Shorrer; Chloe Tergiman (2023). Data and Code for: Correlation Neglect in Student-to-School Matching [Dataset]. http://doi.org/10.3886/E192088V1
    Explore at:
    delimitedAvailable download formats
    Dataset updated
    Jun 6, 2023
    Dataset provided by
    American Economic Association
    Authors
    Alex Rees-Jones; Ran Shorrer; Chloe Tergiman
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    2019 - 2022
    Area covered
    United States
    Description

    Data and Code to accompany the paper "Correlation Neglect in Student-to-School Matching."Abstract: We present results from three experiments containing incentivized school-choice scenarios. In these scenarios, we vary whether schools' assessments of students are based on a common priority (inducing correlation in admissions decisions) or are based on independent assessments (eliminating correlation in admissions decisions). The quality of students' application strategies declines in the presence of correlated admissions: application strategies become substantially more aggressive and fail to include attractive ``safety'' options. We provide a battery of tests suggesting that this phenomenon is at least partially driven by correlation neglect, and we discuss implications for the design and deployment of student-to-school matching mechanisms.

  2. d

    Data Matching Imputation System

    • catalog.data.gov
    • datadiscoverystudio.org
    • +2more
    Updated Oct 19, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (Point of Contact, Custodian) (2024). Data Matching Imputation System [Dataset]. https://catalog.data.gov/dataset/data-matching-imputation-system1
    Explore at:
    Dataset updated
    Oct 19, 2024
    Dataset provided by
    (Point of Contact, Custodian)
    Description

    The DMIS dataset is a flat file record of the matching of several data set collections. Primarily it consists of VTRs, dealer records, Observer data in conjunction with vessel permit information for the purpose of supporting North East Regional quota monitoring projects.

  3. Data from: Comparison of photo-matching algorithms commonly used for...

    • zenodo.org
    • data.niaid.nih.gov
    • +2more
    application/gzip
    Updated May 28, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maximilian Matthé; Marco Sannolo; Kristopher Winiarski; Annemarieke Spitzen - van der Sluijs; Daniel Goedbloed; Sebastian Steinfartz; Ulrich Stachow; Maximilian Matthé; Marco Sannolo; Kristopher Winiarski; Annemarieke Spitzen - van der Sluijs; Daniel Goedbloed; Sebastian Steinfartz; Ulrich Stachow (2022). Data from: Comparison of photo-matching algorithms commonly used for photographic capture-recapture studies [Dataset]. http://doi.org/10.5061/dryad.4f0bh
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    May 28, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Maximilian Matthé; Marco Sannolo; Kristopher Winiarski; Annemarieke Spitzen - van der Sluijs; Daniel Goedbloed; Sebastian Steinfartz; Ulrich Stachow; Maximilian Matthé; Marco Sannolo; Kristopher Winiarski; Annemarieke Spitzen - van der Sluijs; Daniel Goedbloed; Sebastian Steinfartz; Ulrich Stachow
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Photographic capture–recapture is a valuable tool for obtaining demographic information on wildlife populations due to its noninvasive nature and cost-effectiveness. Recently, several computer-aided photo-matching algorithms have been developed to more efficiently match images of unique individuals in databases with thousands of images. However, the identification accuracy of these algorithms can severely bias estimates of vital rates and population size. Therefore, it is important to understand the performance and limitations of state-of-the-art photo-matching algorithms prior to implementation in capture–recapture studies involving possibly thousands of images. Here, we compared the performance of four photo-matching algorithms; Wild-ID, I3S Pattern+, APHIS, and AmphIdent using multiple amphibian databases of varying image quality. We measured the performance of each algorithm and evaluated the performance in relation to database size and the number of matching images in the database. We found that algorithm performance differed greatly by algorithm and image database, with recognition rates ranging from 100% to 22.6% when limiting the review to the 10 highest ranking images. We found that recognition rate degraded marginally with increased database size and could be improved considerably with a higher number of matching images in the database. In our study, the pixel-based algorithm of AmphIdent exhibited superior recognition rates compared to the other approaches. We recommend carefully evaluating algorithm performance prior to using it to match a complete database. By choosing a suitable matching algorithm, databases of sizes that are unfeasible to match "by eye" can be easily translated to accurate individual capture histories necessary for robust demographic estimates.

  4. H

    Data for: "Linking Datasets on Organizations Using Half a Billion...

    • dataverse.harvard.edu
    Updated Jan 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Connor Jerzak (2025). Data for: "Linking Datasets on Organizations Using Half a Billion Open-Collaborated Records" [Dataset]. http://doi.org/10.7910/DVN/EHRQQL
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 13, 2025
    Dataset provided by
    Harvard Dataverse
    Authors
    Connor Jerzak
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Abstract: Scholars studying organizations often work with multiple datasets lacking shared unique identifiers or covariates. In such situations, researchers usually use approximate string (``fuzzy'') matching methods to combine datasets. String matching, although useful, faces fundamental challenges. Even when two strings appear similar to humans, fuzzy matching often does not work because it fails to adapt to the informativeness of the character combinations. In response, a number of machine-learning methods have been developed to refine string matching. Yet, the effectiveness of these methods is limited by the size and diversity of training data. This paper introduces data from a prominent employment networking site (LinkedIn) as a massive training corpus to address these limitations. We show how, by leveraging information from LinkedIn regarding organizational name-to-name links, we can improve upon existing matching benchmarks, incorporating the trillions of name pair examples from LinkedIn into various methods to improve performance by explicitly maximizing match probabilities inferred from the LinkedIn corpus. We also show how relationships between organization names can be modeled using a network representation of the LinkedIn data. In illustrative merging tasks involving lobbying firms, we document improvements when using the LinkedIn corpus in matching calibration and make all data and methods open source. Keywords: Record linkage; Interest groups; Text as data; Unstructured data

  5. ListOfDBPediaUnmappedClass

    • figshare.com
    txt
    Updated May 1, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rafiqul Haque (2016). ListOfDBPediaUnmappedClass [Dataset]. http://doi.org/10.6084/m9.figshare.3206587.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    May 1, 2016
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Rafiqul Haque
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This file contains a list of unmapped classes of DBPedia ontologies.

  6. w

    Web Data Commons - The WDC Data Training Dataset and Gold Standard for...

    • webdatacommons.org
    json
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Christian Bizer; Anna Primpeli; Ralph Peeters, Web Data Commons - The WDC Data Training Dataset and Gold Standard for Large-Scale Product Matching [Dataset]. http://www.webdatacommons.org/largescaleproductcorpus/
    Explore at:
    jsonAvailable download formats
    Authors
    Christian Bizer; Anna Primpeli; Ralph Peeters
    Description

    The training dataset consisting of 20 million pairs of product offers referring to the same products. The offers were extracted from 43 thousand e-shops which provide schema.org annotations including some form of product ID such as a GTIN or MPN. We also created a gold standard by manually verifying 2000 pairs of offers belonging to four different product categories.

  7. Datasets from Approximate equality of character strings and its application...

    • zenodo.org
    • data.niaid.nih.gov
    bin, txt, zip
    Updated Apr 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jan Dobiášovský; Jan Dobiášovský (2025). Datasets from Approximate equality of character strings and its application to record linkage in metadata of scientific publications thesis [Dataset]. http://doi.org/10.5281/zenodo.3785363
    Explore at:
    zip, bin, txtAvailable download formats
    Dataset updated
    Apr 24, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Jan Dobiášovský; Jan Dobiášovský
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The datasets were produced in my thesis project. The thesis (in Czech language) explores the application of approximate string matching in scientific publication record linkage process. An introduction to record matching along with five commonly used metrics for string distance (Levenshtein, Jaro, Jaro-Winkler, Cosine distances and Jaccard coefficient) are provided. These metrics are applied on publication metadata from V3S current research information system of the Czech Technical University in Prague. Based on the findings, optimal thresholds in the F1, F2 and F3-measures are determined for each metric.

    Thesis citation:
    DOBIÁŠOVSKÝ, Jan. Approximate equality of character strings and its application to record linkage in metadata of scientific publications [online]. Praha, 2020 [cit. 2020-05-04]. Masters thesis. Charles University. Faculty of Arts. Institute of Information Studies and Librarianship.

  8. i

    Map Matching Dataset

    • ieee-dataport.org
    Updated Oct 10, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Youliang chen (2023). Map Matching Dataset [Dataset]. https://ieee-dataport.org/documents/map-matching-dataset
    Explore at:
    Dataset updated
    Oct 10, 2023
    Authors
    Youliang chen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    China

  9. R

    Data from: Image Matching System Dataset

    • universe.roboflow.com
    zip
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hyojin (2023). Image Matching System Dataset [Dataset]. https://universe.roboflow.com/hyojin-jwabh/image-matching-system
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset authored and provided by
    Hyojin
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Recaptcha
    Description

    Image Matching System

    ## Overview
    
    Image Matching System is a dataset for classification tasks - it contains Recaptcha annotations for 8,828 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  10. P

    Data from: ICM Dataset

    • paperswithcode.com
    Updated Jul 23, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chunyu Xie; Heng Cai; Jincheng Li; Fanjing Kong; Xiaoyu Wu; Jianfei Song; Henrique Morimitsu; Lin Yao; Dexin Wang; Xiangzheng Zhang; Dawei Leng; Baochang Zhang; Xiangyang Ji; Yafeng Deng (2022). ICM Dataset [Dataset]. https://paperswithcode.com/dataset/icm
    Explore at:
    Dataset updated
    Jul 23, 2022
    Authors
    Chunyu Xie; Heng Cai; Jincheng Li; Fanjing Kong; Xiaoyu Wu; Jianfei Song; Henrique Morimitsu; Lin Yao; Dexin Wang; Xiangzheng Zhang; Dawei Leng; Baochang Zhang; Xiangyang Ji; Yafeng Deng
    Description

    ICM is curated for the image-text matching task. Each image has a corresponding caption text, which describes the image in detail. We first use CTR to select the most relevant pairs. Then, human annotators manually perform a 2nd round manual correction, obtaining 400,000 image-text pairs, including 200,000 positive cases and 200,000 negative cases. We keep the ratio of positive and negative pairs consistent in each of the train/val/test sets.

  11. H

    Replication Data for: Matching Methods for Causal Inference with Time-Series...

    • dataverse.harvard.edu
    Updated Oct 13, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kosuke Imai; In Song Kim; Erik Wang (2021). Replication Data for: Matching Methods for Causal Inference with Time-Series Cross-Section Data [Dataset]. http://doi.org/10.7910/DVN/ZTDHVE
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 13, 2021
    Dataset provided by
    Harvard Dataverse
    Authors
    Kosuke Imai; In Song Kim; Erik Wang
    License

    https://dataverse.harvard.edu/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.7910/DVN/ZTDHVEhttps://dataverse.harvard.edu/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.7910/DVN/ZTDHVE

    Description

    Matching methods improve the validity of causal inference by reducing model dependence and offering intuitive diagnostics. While they have become a part of the standard tool kit across disciplines, matching methods are rarely used when analyzing time-series cross-sectional data. We fill this methodological gap. In the proposed approach, we first match each treated observation with control observations from other units in the same time period that have an identical treatment history up to the pre-specified number of lags. We use standard matching and weighting methods to further refine this matched set so that the treated and matched control observations have similar covariate values. Assessing the quality of matches is done by examining covariate balance. Finally, we estimate both short-term and long-term average treatment effects using the difference-in-differences estimator, accounting for a time trend. We illustrate the proposed methodology through simulation and empirical studies. An open-source software package is available for implementing the proposed methods.

  12. w

    Dataset of books called Query matching in a BitTorrent-based P2P database...

    • workwithdata.com
    Updated Apr 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2025). Dataset of books called Query matching in a BitTorrent-based P2P database system [Dataset]. https://www.workwithdata.com/datasets/books?f=1&fcol0=book&fop0=%3D&fval0=Query+matching+in+a+BitTorrent-based+P2P+database+system
    Explore at:
    Dataset updated
    Apr 17, 2025
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is about books. It has 1 row and is filtered where the book is Query matching in a BitTorrent-based P2P database system. It features 7 columns including author, publication date, language, and book publisher.

  13. u

    MIVIA ARG Dataset

    • mivia.unisa.it
    • zenodo.org
    text/vf-format
    Updated Jan 1, 2013
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MIVIA Lab (2013). MIVIA ARG Dataset [Dataset]. http://doi.org/10.1016/S0167-8655(02)00253-2
    Explore at:
    text/vf-formatAvailable download formats
    Dataset updated
    Jan 1, 2013
    Dataset authored and provided by
    MIVIA Lab
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    The ARG Database is a huge collection of labeled and unlabeled graphs realized by the MIVIA Group. The aim of this collection is to provide the graph research community with a standard test ground for the benchmarking of graph matching algorithms.

  14. H

    Replication Data for: Why Propensity Scores Should Not Be Used for Matching

    • dataverse.harvard.edu
    Updated Jan 28, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Richard Nielsen; Gary King (2019). Replication Data for: Why Propensity Scores Should Not Be Used for Matching [Dataset]. http://doi.org/10.7910/DVN/A9LZNV
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 28, 2019
    Dataset provided by
    Harvard Dataverse
    Authors
    Richard Nielsen; Gary King
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Abstract: We show that propensity score matching (PSM), an enormously popular method of preprocessing data for causal inference, often accomplishes the opposite of its intended goal — thus increasing imbalance, inefficiency, model dependence, and bias. The weakness of PSM comes from its attempts to approximate a completely randomized experiment, rather than, as with other matching methods, a more efficient fully blocked randomized experiment. PSM is thus uniquely blind to the often large portion of imbalance that can be eliminated by approximating full blocking with other matching methods. Moreover, in data balanced enough to approximate complete randomization, either to begin with or after pruning some observations, PSM approximates random matching which, we show, increases imbalance even relative to the original data. Although these results suggest researchers replace PSM with one of the other available matching methods, propensity scores have other productive uses.

  15. Data for: Two-Sided Matching for mentor-mentee allocations - Algorithms and...

    • zenodo.org
    • explore.openaire.eu
    bin, csv
    Updated Jan 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Christian Haas; Margeret Hall; Christian Haas; Margeret Hall (2020). Data for: Two-Sided Matching for mentor-mentee allocations - Algorithms and manipulation strategies [Dataset]. http://doi.org/10.5281/zenodo.2555099
    Explore at:
    csv, binAvailable download formats
    Dataset updated
    Jan 24, 2020
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Christian Haas; Margeret Hall; Christian Haas; Margeret Hall
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    These are the data files for the PLOS ONE journal article "Two-Sided Matching for mentor-mentee allocations - Algorithms and manipulation strategies".

    Three files are provided:

    - Data.xlsx: An overview of the original preferences of mentors and mentee, a data dictionary, and two summary tables used to create figures in the manuscript

    - MatchingTables.csv: The outcome matching tables for each simulated scenario and repetition

    - Preferences.csv: The (un)manipulated preferences that were used as input to calculate the solution for each simulated scenario and repetition.

  16. Data from: Stochastic Matching DataSet

    • kaggle.com
    Updated Jul 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    knightwayne (2023). Stochastic Matching DataSet [Dataset]. https://www.kaggle.com/datasets/knightwayne/stochastic-matching-dataset/data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 2, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    knightwayne
    License

    Attribution-ShareAlike 3.0 (CC BY-SA 3.0)https://creativecommons.org/licenses/by-sa/3.0/
    License information was derived automatically

    Description

    Dataset

    This dataset was created by knightwayne

    Released under CC BY-SA 3.0

    Contents

  17. Data from: Matching Treatment and Offender: North Carolina, 1980-1982

    • catalog.data.gov
    • gimi9.com
    • +1more
    Updated Mar 12, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institute of Justice (2025). Matching Treatment and Offender: North Carolina, 1980-1982 [Dataset]. https://catalog.data.gov/dataset/matching-treatment-and-offender-north-carolina-1980-1982-bdbd9
    Explore at:
    Dataset updated
    Mar 12, 2025
    Dataset provided by
    National Institute of Justicehttp://nij.ojp.gov/
    Area covered
    North Carolina
    Description

    These data were gathered in order to evaluate the implications of rational choice theory for offender rehabilitation. The hypothesis of the research was that income-enhancing prison rehabilitation programs are most effective for the economically motivated offender. The offender was characterized by demographic and socio-economic characteristics, criminal history and behavior, and work activities during incarceration. Information was also collected on type of release and post-release recidivistic and labor market measures. Recividism was measured by arrests, convictions, and reincarcerations, length of time until first arrest after release, and seriousness of offense leading to reincarceration.

  18. e

    Web Data Commons Training and Test Sets for Large-Scale Product Matching -...

    • b2find.eudat.eu
    Updated Nov 27, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2020). Web Data Commons Training and Test Sets for Large-Scale Product Matching - Version 2.0 Product Matching Task derived from the WDC Product Data Corpus - Version 2.0 - Dataset - B2FIND [Dataset]. https://b2find.eudat.eu/dataset/720b440c-eda0-5182-af9f-f868ed999bd7
    Explore at:
    Dataset updated
    Nov 27, 2020
    Description

    Many e-shops have started to mark-up product data within their HTML pages using the schema.org vocabulary. The Web Data Commons project regularly extracts such data from the Common Crawl, a large public web crawl. The Web Data Commons Training and Test Sets for Large-Scale Product Matching contain product offers from different e-shops in the form of binary product pairs (with corresponding label “match” or “no match”) for four product categories, computers, cameras, watches and shoes. In order to support the evaluation of machine learning-based matching methods, the data is split into training, validation and test sets. For each product category, we provide training sets in four different sizes (2.000-70.000 pairs). Furthermore there are sets of ids for each training set for a possible validation split (stratified random draw) available. The test set for each product category consists of 1.100 product pairs. The labels of the test sets were manually checked while those of the training sets were derived using shared product identifiers from the Web weak supervision. The data stems from the WDC Product Data Corpus for Large-Scale Product Matching - Version 2.0 which consists of 26 million product offers originating from 79 thousand websites. For more information and download links for the corpus itself, please follow the links below.

  19. r

    Data from: imatch for matching in Stata

    • researchdata.edu.au
    ado, doc, txt
    Updated Jan 1, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Associate Professor Zhiqiang Wang; Associate Professor Zhiqiang Wang (2017). imatch for matching in Stata [Dataset]. http://doi.org/10.14264/UQL.2017.982
    Explore at:
    doc(60416), txt(3648), ado(3224)Available download formats
    Dataset updated
    Jan 1, 2017
    Dataset provided by
    The University of Queensland
    Authors
    Associate Professor Zhiqiang Wang; Associate Professor Zhiqiang Wang
    License

    https://guides.library.uq.edu.au/deposit-your-data/license-reuse-data-agreementhttps://guides.library.uq.edu.au/deposit-your-data/license-reuse-data-agreement

    Description

    The imatch program was written for Sata users to match different groups according to multiple variables. Program file: imatch.ado Help file: imatch.hlp

  20. h

    task-matching-data

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NOOR FATHIMA, task-matching-data [Dataset]. https://huggingface.co/datasets/Noor-ai/task-matching-data
    Explore at:
    Authors
    NOOR FATHIMA
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Noor-ai/task-matching-data dataset hosted on Hugging Face and contributed by the HF Datasets community

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Alex Rees-Jones; Ran Shorrer; Chloe Tergiman (2023). Data and Code for: Correlation Neglect in Student-to-School Matching [Dataset]. http://doi.org/10.3886/E192088V1

Data and Code for: Correlation Neglect in Student-to-School Matching

Explore at:
delimitedAvailable download formats
Dataset updated
Jun 6, 2023
Dataset provided by
American Economic Association
Authors
Alex Rees-Jones; Ran Shorrer; Chloe Tergiman
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Time period covered
2019 - 2022
Area covered
United States
Description

Data and Code to accompany the paper "Correlation Neglect in Student-to-School Matching."Abstract: We present results from three experiments containing incentivized school-choice scenarios. In these scenarios, we vary whether schools' assessments of students are based on a common priority (inducing correlation in admissions decisions) or are based on independent assessments (eliminating correlation in admissions decisions). The quality of students' application strategies declines in the presence of correlated admissions: application strategies become substantially more aggressive and fail to include attractive ``safety'' options. We provide a battery of tests suggesting that this phenomenon is at least partially driven by correlation neglect, and we discuss implications for the design and deployment of student-to-school matching mechanisms.

Search
Clear search
Close search
Google apps
Main menu