100+ datasets found
  1. f

    Datasets detail.

    • plos.figshare.com
    xls
    Updated May 15, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Babar; Basit Qureshi; Anis Koubaa (2024). Datasets detail. [Dataset]. http://doi.org/10.1371/journal.pone.0302539.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 15, 2024
    Dataset provided by
    PLOS ONE
    Authors
    Muhammad Babar; Basit Qureshi; Anis Koubaa
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In recent years, Federated Learning (FL) has gained traction as a privacy-centric approach in medical imaging. This study explores the challenges posed by data heterogeneity on FL algorithms, using the COVIDx CXR-3 dataset as a case study. We contrast the performance of the Federated Averaging (FedAvg) algorithm on non-identically and independently distributed (non-IID) data against identically and independently distributed (IID) data. Our findings reveal a notable performance decline with increased data heterogeneity, emphasizing the need for innovative strategies to enhance FL in diverse environments. This research contributes to the practical implementation of FL, extending beyond theoretical concepts and addressing the nuances in medical imaging applications. This research uncovers the inherent challenges in FL due to data diversity. It sets the stage for future advancements in FL strategies to effectively manage data heterogeneity, especially in sensitive fields like healthcare.

  2. f

    Overview of Federated Learning (FL) and data heterogeneity.

    • figshare.com
    • plos.figshare.com
    xls
    Updated May 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Babar; Basit Qureshi; Anis Koubaa (2024). Overview of Federated Learning (FL) and data heterogeneity. [Dataset]. http://doi.org/10.1371/journal.pone.0302539.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 15, 2024
    Dataset provided by
    PLOS ONE
    Authors
    Muhammad Babar; Basit Qureshi; Anis Koubaa
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Overview of Federated Learning (FL) and data heterogeneity.

  3. J

    Heterogeneity and dynamics in network models (replication data)

    • journaldata.zbw.eu
    pdf
    Updated Aug 10, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Enzo D'Innocenzo; Andre Lucas; Anne Opschoor; Xingmin Zhang; Enzo D'Innocenzo; Andre Lucas; Anne Opschoor; Xingmin Zhang (2023). Heterogeneity and dynamics in network models (replication data) [Dataset]. http://doi.org/10.15456/jae.2023222.0603813668
    Explore at:
    pdf(51620)Available download formats
    Dataset updated
    Aug 10, 2023
    Dataset provided by
    ZBW - Leibniz Informationszentrum Wirtschaft
    Authors
    Enzo D'Innocenzo; Andre Lucas; Anne Opschoor; Xingmin Zhang; Enzo D'Innocenzo; Andre Lucas; Anne Opschoor; Xingmin Zhang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Description of dataset (from Datastream, Bloomberg and BIS) corresponding to the paper "Heterogeneity and Dynamics in Network Models" by Enzo D'Innocenzo, Andre Lucas, Anne Opschoor, Xingmin Zhang (corresponding author)

  4. b

    Data from: Heterogeneity of t-tubules in pig hearts - Datasets - data.bris

    • data.bris.ac.uk
    Updated Jan 29, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2017). Data from: Heterogeneity of t-tubules in pig hearts - Datasets - data.bris [Dataset]. https://data.bris.ac.uk/data/dataset/4f977a8498d6b753a54e243b1aebc0bc
    Explore at:
    Dataset updated
    Jan 29, 2017
    Description

    Folder 1 - linescans: Folder contains linescans in TIF format of the four atrial cells referred to in Figure 1 of the manuscript. Folder 2 - di-8-ANEPPS: A folder containing two further folders ('atrial' and 'ventricular'), which contain, respectively, TIF images of the 10 atrial and 4 ventricular di-8-ANEPPS-stained cells referred to in Figure 2. Sham v control TTD: A spreadsheet containing t-tubule densities (TTD) of atrial and ventricular cells from Sham and Control animals to establish that there was no difference the t-tubule network in either atrial or ventricular cells between these two groups of animals. Control atrial 1: Original images of sections from atrial tissue from control animals used for analysis presented in Figures 4 - 6. Folder 1 of 5. Control atrial 2: Original images of sections of atrial tissue from control animals used for analysis shown in Figures 4 - 6. Folder 2 of 5. Sham atrial: Original images of atrial sections from Sham animals used for analysis presented in Figures 4 - 6. Folder 3 of 5. Control ventricular: Original images of ventricular sections from control animals used for analysis presented in Figures 4 - 6. Folder 4 of 5. Sham ventricular: Original images of sections from ventricular cells from Sham animals used for analysis presented in Figures 4 - 6. Folder 5 of 5.

  5. D

    Data and scripts from: Local dynamical heterogeneity in simple glass formers...

    • research.repository.duke.edu
    Updated Mar 18, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hu, Yi; Folena, Giampaolo; Zamponi, Francesco; Charbonneau, Patrick; Biroli, Giulio (2022). Data and scripts from: Local dynamical heterogeneity in simple glass formers [Dataset]. http://doi.org/10.7924/r4542tw29
    Explore at:
    Dataset updated
    Mar 18, 2022
    Dataset provided by
    Duke Research Data Repository
    Authors
    Hu, Yi; Folena, Giampaolo; Zamponi, Francesco; Charbonneau, Patrick; Biroli, Giulio
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Dataset funded by
    Simons Foundation
    Description

    We study the local dynamical fluctuations in glass-forming models of particles embedded in d-dimensional space, in the mean-field limit of d→∞. Our analytical calculation reveals that single-particle observables, such as squared particle displacements, display divergent fluctuations around the dynamical (or mode-coupling) transition, due to the emergence of nontrivial correlations between displacements along different directions. This effect notably gives rise to a divergent non-Gaussian parameter, α_2. The d→∞ local dynamics therefore becomes quite rich upon approaching the glass transition. The finite-d remnant of this phenomenon further provides a long sought-after, first-principle explanation for the growth of α_2 around the glass transition that is not based on multi-particle correlations. ... [Read More]

  6. D

    Data from: Crop and landscape heterogeneity increase biodiversity in...

    • researchdata.ntu.edu.sg
    • search.dataone.org
    • +2more
    tsv, txt +1
    Updated Mar 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    DR-NTU (Data) (2024). Data from: Crop and landscape heterogeneity increase biodiversity in agricultural landscapes: A global review and meta-analysis [Dataset]. http://doi.org/10.21979/N9/63PIP0
    Explore at:
    type/x-r-syntax(17275), type/x-r-syntax(14355), type/x-r-syntax(17285), type/x-r-syntax(17269), type/x-r-syntax(33904), type/x-r-syntax(17591), type/x-r-syntax(17271), type/x-r-syntax(17249), tsv(1190930), type/x-r-syntax(17671), type/x-r-syntax(17030), type/x-r-syntax(17333), type/x-r-syntax(17373), type/x-r-syntax(17565), type/x-r-syntax(17499), type/x-r-syntax(17361), tsv(53706), tsv(557438), type/x-r-syntax(17605), type/x-r-syntax(31759), type/x-r-syntax(17634), tsv(1190602), type/x-r-syntax(14341), type/x-r-syntax(14653), type/x-r-syntax(17346), type/x-r-syntax(33181), type/x-r-syntax(17036), type/x-r-syntax(17309), type/x-r-syntax(17524), tsv(1453813), type/x-r-syntax(17496), type/x-r-syntax(33686), type/x-r-syntax(17112), type/x-r-syntax(14075), type/x-r-syntax(17234), type/x-r-syntax(17072), type/x-r-syntax(6015), txt(15081), type/x-r-syntax(17581), type/x-r-syntax(17610), type/x-r-syntax(17302), type/x-r-syntax(14442), tsv(512201), type/x-r-syntax(17485), type/x-r-syntax(17074), type/x-r-syntax(31779), type/x-r-syntax(17349), type/x-r-syntax(17484), type/x-r-syntax(17350), type/x-r-syntax(17094), tsv(1260790), tsv(1507600), type/x-r-syntax(17317), type/x-r-syntax(12621), type/x-r-syntax(17052)Available download formats
    Dataset updated
    Mar 12, 2024
    Dataset provided by
    DR-NTU (Data)
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Dataset funded by
    Nanyang Technological University
    Centre for Ecology and Hydrology
    Ministry of Education (MOE)
    Description

    All the data files are publicly available in the Dryad Digital Repository, at https://doi.org/10.5061/dryad.dbrv15f7j (Priyadarshana et al. 2024). The source codes for the statistics are publicly available in the Zenodo Digital Repository, at https://doi.org/10.5281/zenodo.10799017. These data files and source codes are also accessible via the GitHub Digital Repository, at https://github.com/Tharaka18/spatial.heterogeneity.meta.

  7. Z

    Data from: Tissue heterogeneity is prevalent in gene expression studies

    • data.niaid.nih.gov
    Updated Jun 27, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jitao David Zhang (2021). Tissue heterogeneity is prevalent in gene expression studies [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4298774
    Explore at:
    Dataset updated
    Jun 27, 2021
    Dataset provided by
    Gregor Sturm
    Jitao David Zhang
    Markus List
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This archive contains results associated with the publication

    Tissue heterogeneity is prevalent in gene expression studies. Gregor Sturm, Markus List and Jitao David Zhang.

    expr.tissuemark.affy.roche.symbols.gmt: The tissue signatures from the BioQC publication used in this study

    gtex_v6_gini_solid.gmt: The cross-platform cross-species validated tissue signatures produced in this study

    heterogeneity_results.tsv.gz: Signature scores and heterogeneity calls for each tested signature

    heterogeneity_fractions.tsv: Fraction of heterogeneous and severely heterogeneous samples per tissue

  8. J

    Structural estimation of behavioral heterogeneity (replication data)

    • jda-test.zbw.eu
    csv, r, txt
    Updated Jul 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zhentao Shi; Huanhuan Zheng; Zhentao Shi; Huanhuan Zheng (2024). Structural estimation of behavioral heterogeneity (replication data) [Dataset]. https://jda-test.zbw.eu/dataset/structural-estimation-of-behavioral-heterogeneity
    Explore at:
    r(2836), r(3451), r(763), r(4156), txt(1887), r(1471), csv(154447), r(503)Available download formats
    Dataset updated
    Jul 22, 2024
    Dataset provided by
    ZBW - Leibniz Informationszentrum Wirtschaft
    Authors
    Zhentao Shi; Huanhuan Zheng; Zhentao Shi; Huanhuan Zheng
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We develop a behavioral asset pricing model in which agents trade in a market with information friction. Profit-maximizing agents switch between trading strategies in response to dynamic market conditions. Owing to noisy private information about the fundamental value, the agents form different evaluations about heterogeneous strategies. We exploit a thin set-a small sub-population-to point identify this nonlinear model, and estimate the structural parameters using extended method of moments. Based on the estimated parameters, the model produces return time series that emulate the moments of the real data. These results are robust across different sample periods and estimation methods.

  9. Data and code - Disentangeling dispersion from mean reveals true...

    • zenodo.org
    zip
    Updated Nov 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cameron Pellett; Cameron Pellett; Ruben Valbuena; Ruben Valbuena (2024). Data and code - Disentangeling dispersion from mean reveals true heterogeneity-diversity relationships [Dataset]. http://doi.org/10.5281/zenodo.14179015
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 18, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Cameron Pellett; Cameron Pellett; Ruben Valbuena; Ruben Valbuena
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Data and code for reproducing figures and results for the manuscript entitled "Disentangeling dispersion from mean reveals true heterogeneity-diversity relationships". Publication forthcoming. See references for data sources.

    Code tested with Julia version 1.11.1.

    How to cite this repository

    If using code or data from this repository, please cite the original publication (forthcoming) and respective data source (see references and README.txt in respective data folder).

    Update 2024-07-09

    Minor changes to figure sizes and use of paired-sample t-tests when assessing empirical observations of heterogeneity measures.

    Update 2024-08-04

    Step by step instructions included in README

    Manifest.toml file included with julia and package version requirements.

    Update 2024-11-18

    Update following peer review feedback:

    Analysis of an additional dataset from MacArthurs' seminal paper on foliage height diversity.

    Hypothesis test of negligible trend for delta

    Modified extended data figures

  10. J

    Heterogeneity in risk aversion and risk sharing regressions (replication...

    • journaldata.zbw.eu
    • jda-test.zbw.eu
    txt, zip
    Updated Dec 7, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pierfederico Asdrubali; Simone Tedeschi; Luigi Ventura; Pierfederico Asdrubali; Simone Tedeschi; Luigi Ventura (2022). Heterogeneity in risk aversion and risk sharing regressions (replication data) [Dataset]. http://doi.org/10.15456/jae.2022327.0709336776
    Explore at:
    txt(2843), zip(80812186)Available download formats
    Dataset updated
    Dec 7, 2022
    Dataset provided by
    ZBW - Leibniz Informationszentrum Wirtschaft
    Authors
    Pierfederico Asdrubali; Simone Tedeschi; Luigi Ventura; Pierfederico Asdrubali; Simone Tedeschi; Luigi Ventura
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Heterogeneity in risk attitudes, if not properly accounted for, may induce a bias on the income coefficient of standard consumption insurance regressions. We show that, extending the theoretical analysis and empirical findings in Schulhofer-Wohl (Journal of Political Economy, 2011, 119, 925-958), the sign of the bias is ambiguous, and depends on cycle-related variables and on the covariances of both aggregate and idiosyncratic risk with individual risk aversion.

  11. J

    HETEROGENEITY, EXCESS ZEROS, AND THE STRUCTURE OF COUNT DATA MODELS...

    • journaldata.zbw.eu
    .dat, txt
    Updated Dec 8, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    John Mullahy; John Mullahy (2022). HETEROGENEITY, EXCESS ZEROS, AND THE STRUCTURE OF COUNT DATA MODELS (replication data) [Dataset]. http://doi.org/10.15456/jae.2022313.1256459247
    Explore at:
    txt(2620), .dat(742170)Available download formats
    Dataset updated
    Dec 8, 2022
    Dataset provided by
    ZBW - Leibniz Informationszentrum Wirtschaft
    Authors
    John Mullahy; John Mullahy
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This paper demonstrates that the unobserved heterogeneity commonly assumed to be the source of overdispersion in count data models has predictable implications for the probability structure of such mixture models. In particular, the common observation of excess zeros is a strict implication of unobserved heterogeneity. This result has important implications for using count model estimates for predicting certain interesting parameters. Test statistics to detect such heterogeneity-related departures from the null model are proposed and applied in a health-care utilization example, suggesting that a null Poisson model should be rejected in favour of a mixed alternative.

  12. Z

    Fast Intratumor Heterogeneity Inference from Single-Cell Sequencing Data...

    • data.niaid.nih.gov
    Updated Jul 14, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Salem Malikic (2022). Fast Intratumor Heterogeneity Inference from Single-Cell Sequencing Data (simulated data - Extended Data Figures) [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_6829081
    Explore at:
    Dataset updated
    Jul 14, 2022
    Dataset provided by
    Can Kizilkale
    Salem Malikic
    Farid Rashidi Mehrabadi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This data repository contains simulated data used for benchmarking HUNTRESS against the existing alternative tools. Results of the benchmarking are shown in Extended Data Figures 1-10 of the paper "Fast Intratumor Heterogeneity Inference from Single-Cell Sequencing Data" (to appear in Nature Computational Science).

  13. o

    Replication data for: Using Causal Forests to Predict Treatment...

    • openicpsr.org
    Updated May 1, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jonathan M.V. Davis; Sara B. Heller (2017). Replication data for: Using Causal Forests to Predict Treatment Heterogeneity: An Application to Summer Jobs [Dataset]. http://doi.org/10.3886/E113487V1
    Explore at:
    Dataset updated
    May 1, 2017
    Dataset provided by
    American Economic Association
    Authors
    Jonathan M.V. Davis; Sara B. Heller
    Area covered
    Cook County, Illinois
    Description

    To estimate treatment heterogeneity in two randomized controlled trials of a youth summer jobs program, we implement Wager and Athey's (2015) causal forest algorithm. We provide a step-by-step explanation targeted at applied researchers of how the algorithm predicts treatment effects based on observables. We then explore how useful the predicted heterogeneity is in practice by testing whether youth with larger predicted treatment effects actually respond more in a hold-out sample. Our application highlights some limitations of the causal forest, but it also suggests that the method can identify treatment heterogeneity for some outcomes that more standard interaction approaches would have missed.

  14. J

    Habits and heterogeneity in demands: a panel data analysis (replication...

    • journaldata.zbw.eu
    txt
    Updated Dec 8, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Martin Browning; M. Dolores Collado; Martin Browning; M. Dolores Collado (2022). Habits and heterogeneity in demands: a panel data analysis (replication data) [Dataset]. http://doi.org/10.15456/jae.2022319.0714196543
    Explore at:
    txt(3601224), txt(2673)Available download formats
    Dataset updated
    Dec 8, 2022
    Dataset provided by
    ZBW - Leibniz Informationszentrum Wirtschaft
    Authors
    Martin Browning; M. Dolores Collado; Martin Browning; M. Dolores Collado
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We examine demand behaviour for intertemporal dependencies, using Spanish panel data. We present evidence that there is both state dependence and correlated heterogeneity in demand behaviour. Our specific findings are that food outside the home, alcohol and tobacco are habit forming, whereas clothing and small durables exhibit durability. We conclude that demand analyses using cross-section data that ignore these effects may be seriously biased. On the other hand, the degree of intertemporal dependence is not sufficiently strong to make composite consumption significantly habit forming, as has been suggested in some recent analyses.

  15. d

    Data from: Spatial heterogeneity in resources alters selective dynamics in...

    • datadryad.org
    • zenodo.org
    zip
    Updated Apr 30, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ian Dworkin; Audrey E Wilson; Ali Siddiqui (2021). Spatial heterogeneity in resources alters selective dynamics in Drosophila melanogaster [Dataset]. http://doi.org/10.5061/dryad.m37pvmd24
    Explore at:
    zipAvailable download formats
    Dataset updated
    Apr 30, 2021
    Dataset provided by
    Dryad
    Authors
    Ian Dworkin; Audrey E Wilson; Ali Siddiqui
    Time period covered
    2021
    Description

    Information is provided in the readme file.

    We also have the data and scripts available on github (https://github.com/idworkin/Wilson2021_Evolution_Data)

    The few columns with missing data either are empty, or have NA.

  16. o

    Replication data for: Consistency and Heterogeneity of Individual Behavior...

    • openicpsr.org
    Updated Dec 1, 2007
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Syngjoo Choi; Raymond Fisman; Douglas Gale; Shachar Kariv (2007). Replication data for: Consistency and Heterogeneity of Individual Behavior under Uncertainty [Dataset]. http://doi.org/10.3886/E116296V1
    Explore at:
    Dataset updated
    Dec 1, 2007
    Dataset provided by
    American Economic Association
    Authors
    Syngjoo Choi; Raymond Fisman; Douglas Gale; Shachar Kariv
    Description

    By using graphical representations of simple portfolio choice problems, we generate a very rich dataset to study behavior under uncertainty at the level of the individual subject. We test the data for consistency with the maximization hypothesis, and we estimate preferences using a two-parameter utility function based on Faruk Gul (1991). This specification provides a good interpretation of the data at the individual level and can account for the highly heterogeneous behaviors observed in the laboratory. The parameter estimates jointly describe attitudes toward risk and allow us to characterize the distribution of risk preferences in the population. (JEL D11, D14, D81, G11)

  17. P

    Replication Data for: Image-based Treatment Effect Heterogeneity Dataset

    • paperswithcode.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Connor T. Jerzak; Fredrik Johansson; Adel Daoud, Replication Data for: Image-based Treatment Effect Heterogeneity Dataset [Dataset]. https://paperswithcode.com/dataset/replication-data-for-image-based-treatment
    Explore at:
    Authors
    Connor T. Jerzak; Fredrik Johansson; Adel Daoud
    Description

    Dataset Overview This dataset contains individual-level data from a randomized controlled trial (RCT) conducted in northern Uganda, along with associated satellite imagery. It is designed to investigate how treatment effects may vary across different geographical and contextual settings by leveraging both tabular and image-based variables.

    Motivation and Content

    Researchers often wish to explore treatment effect heterogeneity, especially in studies focused on global poverty. Traditional variables—such as age and ethnicity—are typically collected near the time of data gathering and may overlook broader environmental, historical, or neighborhood-specific factors. Incorporating satellite images into causal inference analyses provides a valuable window into such contextual factors. This dataset exemplifies how researchers can combine tabular data (e.g., demographic variables, outcomes, treatment indicators) with geospatially keyed satellite imagery to model and interpret how treatment effects change across different locations.

    Potential Use Cases

    Causal Inference Research: Apply image-based methods to detect and explain geographic or contextual heterogeneity in RCT outcomes. Policy Evaluation: Aid policymakers in identifying areas or populations most likely to benefit from poverty-alleviation interventions. Methodological Innovations: Serve as a testbed for new models that integrate high-dimensional or unstructured data (images) with standard tabular data in the causal inference setting.

    Source Connor T. Jerzak, Fredrik Johansson, Adel Daoud. Image-based Treatment Effect Heterogeneity. Proceedings of the Second Conference on Causal Learning and Reasoning (CLeaR), Proceedings of Machine Learning Research (PMLR), 213: 531-552, 2023.

  18. d

    Data and code from: Coordinated distributed experiments in ecology do not...

    • search.dataone.org
    • data.niaid.nih.gov
    • +1more
    Updated Mar 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Julia Bebout; Jeremy Fox (2024). Data and code from: Coordinated distributed experiments in ecology do not consistently reduce heterogeneity in effect size [Dataset]. http://doi.org/10.5061/dryad.cz8w9gj8w
    Explore at:
    Dataset updated
    Mar 6, 2024
    Dataset provided by
    Dryad Digital Repository
    Authors
    Julia Bebout; Jeremy Fox
    Time period covered
    Jan 1, 2023
    Description

    Ecological meta-analyses usually exhibit high relative heterogeneity of effect size: most among-study variation in effect size represents true variation in mean effect size, rather than sampling error. This heterogeneity arises from both methodological and ecological sources. Methodological heterogeneity is a nuisance that complicates the interpretation of data syntheses. One way to reduce methodological heterogeneity is via coordinated distributed experiments, in which investigators conduct the same experiment at different sites, using the same methods. We tested whether coordinated distributed experiments in ecology exhibit a) low heterogeneity in effect size, and b) lower heterogeneity than meta-analyses, using data on 17 effects from eight coordinated distributed experiments, and 406 meta-analyses. Consistent with our expectations, among-site heterogeneity typically comprised <50% of the variance in effect size in distributed experiments. In contrast, heterogeneity within and amo..., , , # Coordinated distributed experiments in ecology do not consistently reduce heterogeneity in effect size

    Included here is a data file for a distributed experiment, and code which analyses the heterogeneity of many coordinated distributed experiments and meta-analyses. The R code file reproduces the results of this study, called meta-analyses vs distd expts - R code for sharing v 2.R.

    ## Description of the data and file structure

    Data File:

    rousk et al 2013 table 3 data - INCREASE.csv: data from the INCREASE distributed experiment by Rousk et al. (2013)

    All other data used in code is automatically sourced from URLs, but relevant variables are still described below.

    Other variables in datasets were not used in our analysis, and so are not explained in this README file. Cells with missing data have "NA" values.

    Variables used in code:

    Costello & Fox variables:Â

    meta.analysis.id: Unique ID number for each meta-analysis

    eff.size: Effect size

    var. eff.size: Variance in e...

  19. H

    Replication Data for: "Heterogeneity of Rules in Bayesian Reasoning: A...

    • dataverse.harvard.edu
    Updated May 11, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jan Kristian Woike; Ralph Hertwig; Gerd Gigerenzer (2023). Replication Data for: "Heterogeneity of Rules in Bayesian Reasoning: A Toolbox Analysis" (Study 4a-c) [Dataset]. http://doi.org/10.7910/DVN/FYMODJ
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 11, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Jan Kristian Woike; Ralph Hertwig; Gerd Gigerenzer
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Replication data for Study 4a-c: All estimates by study participants. Dataset show participantID (unique to this study), taskID (see article supplement for list of tasks), base rate (b), hit rate (h), and false alarm rate (f), correct solution and participant estimate.

  20. H

    Replication Data for: Justice-Level Heterogeneity in Certiorari Voting: U.S....

    • dataverse.harvard.edu
    Updated Oct 14, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Harvard Dataverse (2021). Replication Data for: Justice-Level Heterogeneity in Certiorari Voting: U.S. Supreme Court October Terms 1939, 1968, and 1982. [Dataset]. http://doi.org/10.7910/DVN/QPPL9H
    Explore at:
    pdf(427267), bin(7405), application/x-stata-ado(23747), pdf(66768), tsv(3786905), application/x-stata-ado(12871), application/x-stata-syntax(7652), bin(11090)Available download formats
    Dataset updated
    Oct 14, 2021
    Dataset provided by
    Harvard Dataverse
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    United States
    Description

    Although the literature on U.S. Supreme Court agenda-setting is sizable, justice-vote-level multivariate analyses of certiorari are almost exclusively limited to samples of discussed cases from 1986--1993. Moreover, these studies have done very little to explore justice-level heterogeneity on certiorari. Here, we address these lacunae by analyzing the predictors of individual justices' cert votes on all paid cases from the 1939, 1968, and 1982 terms. We find substantial justice-level heterogeneity in the weight that justices place on the standard set of forces shaping the cert vote. We also show that some of this heterogeneity is associated with justices' experience and ideological extremism, largely in theoretically predicted ways. In closing, we sound a note of caution on drawing conclusions about effects of justice attributes, when the number of justices is relatively small.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Muhammad Babar; Basit Qureshi; Anis Koubaa (2024). Datasets detail. [Dataset]. http://doi.org/10.1371/journal.pone.0302539.t002

Datasets detail.

Related Article
Explore at:
xlsAvailable download formats
Dataset updated
May 15, 2024
Dataset provided by
PLOS ONE
Authors
Muhammad Babar; Basit Qureshi; Anis Koubaa
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

In recent years, Federated Learning (FL) has gained traction as a privacy-centric approach in medical imaging. This study explores the challenges posed by data heterogeneity on FL algorithms, using the COVIDx CXR-3 dataset as a case study. We contrast the performance of the Federated Averaging (FedAvg) algorithm on non-identically and independently distributed (non-IID) data against identically and independently distributed (IID) data. Our findings reveal a notable performance decline with increased data heterogeneity, emphasizing the need for innovative strategies to enhance FL in diverse environments. This research contributes to the practical implementation of FL, extending beyond theoretical concepts and addressing the nuances in medical imaging applications. This research uncovers the inherent challenges in FL due to data diversity. It sets the stage for future advancements in FL strategies to effectively manage data heterogeneity, especially in sensitive fields like healthcare.

Search
Clear search
Close search
Google apps
Main menu