100+ datasets found
  1. Simulation Data Set

    • catalog.data.gov
    • s.cnmilf.com
    Updated Nov 12, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. EPA Office of Research and Development (ORD) (2020). Simulation Data Set [Dataset]. https://catalog.data.gov/dataset/simulation-data-set
    Explore at:
    Dataset updated
    Nov 12, 2020
    Dataset provided by
    United States Environmental Protection Agencyhttp://www.epa.gov/
    Description

    These are simulated data without any identifying information or informative birth-level covariates. We also standardize the pollution exposures on each week by subtracting off the median exposure amount on a given week and dividing by the interquartile range (IQR) (as in the actual application to the true NC birth records data). The dataset that we provide includes weekly average pregnancy exposures that have already been standardized in this way while the medians and IQRs are not given. This further protects identifiability of the spatial locations used in the analysis. This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: File format: R workspace file; “Simulated_Dataset.RData”. Metadata (including data dictionary) • y: Vector of binary responses (1: adverse outcome, 0: control) • x: Matrix of covariates; one row for each simulated individual • z: Matrix of standardized pollution exposures • n: Number of simulated individuals • m: Number of exposure time periods (e.g., weeks of pregnancy) • p: Number of columns in the covariate design matrix • alpha_true: Vector of “true” critical window locations/magnitudes (i.e., the ground truth that we want to estimate) Code Abstract We provide R statistical software code (“CWVS_LMC.txt”) to fit the linear model of coregionalization (LMC) version of the Critical Window Variable Selection (CWVS) method developed in the manuscript. We also provide R code (“Results_Summary.txt”) to summarize/plot the estimated critical windows and posterior marginal inclusion probabilities. Description “CWVS_LMC.txt”: This code is delivered to the user in the form of a .txt file that contains R statistical software code. Once the “Simulated_Dataset.RData” workspace has been loaded into R, the text in the file can be used to identify/estimate critical windows of susceptibility and posterior marginal inclusion probabilities. “Results_Summary.txt”: This code is also delivered to the user in the form of a .txt file that contains R statistical software code. Once the “CWVS_LMC.txt” code is applied to the simulated dataset and the program has completed, this code can be used to summarize and plot the identified/estimated critical windows and posterior marginal inclusion probabilities (similar to the plots shown in the manuscript). Optional Information (complete as necessary) Required R packages: • For running “CWVS_LMC.txt”: • msm: Sampling from the truncated normal distribution • mnormt: Sampling from the multivariate normal distribution • BayesLogit: Sampling from the Polya-Gamma distribution • For running “Results_Summary.txt”: • plotrix: Plotting the posterior means and credible intervals Instructions for Use Reproducibility (Mandatory) What can be reproduced: The data and code can be used to identify/estimate critical windows from one of the actual simulated datasets generated under setting E4 from the presented simulation study. How to use the information: • Load the “Simulated_Dataset.RData” workspace • Run the code contained in “CWVS_LMC.txt” • Once the “CWVS_LMC.txt” code is complete, run “Results_Summary.txt”. Format: Below is the replication procedure for the attached data set for the portion of the analyses using a simulated data set: Data The data used in the application section of the manuscript consist of geocoded birth records from the North Carolina State Center for Health Statistics, 2005-2008. In the simulation study section of the manuscript, we simulate synthetic data that closely match some of the key features of the birth certificate data while maintaining confidentiality of any actual pregnant women. Availability Due to the highly sensitive and identifying information contained in the birth certificate data (including latitude/longitude and address of residence at delivery), we are unable to make the data from the application section publically available. However, we will make one of the simulated datasets available for any reader interested in applying the method to realistic simulated birth records data. This will also allow the user to become familiar with the required inputs of the model, how the data should be structured, and what type of output is obtained. While we cannot provide the application data here, access to the North Carolina birth records can be requested through the North Carolina State Center for Health Statistics, and requires an appropriate data use agreement. Description Permissions: These are simulated data without any identifying information or informative birth-level covariates. We also standardize the pollution exposures on each week by subtracting off the median exposure amount on a given week and dividing by the interquartile range (IQR) (as in the actual application to the true NC birth records data). The dataset that we provide includes weekly average pregnancy exposures that have already been standardized in this way while the medians and IQRs are not given. This further protects identifiability of the spatial locations used in the analysis. This dataset is associated with the following publication: Warren, J., W. Kong, T. Luben, and H. Chang. Critical Window Variable Selection: Estimating the Impact of Air Pollution on Very Preterm Birth. Biostatistics. Oxford University Press, OXFORD, UK, 1-30, (2019).

  2. B

    Sociotechnical Simulation Study Source Data

    • borealisdata.ca
    Updated Oct 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Emma Stanley (2025). Sociotechnical Simulation Study Source Data [Dataset]. http://doi.org/10.5683/SP3/N0PHAT
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 8, 2025
    Dataset provided by
    Borealis
    Authors
    Emma Stanley
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    This repository contains the data used to generate figures and tables in the unpublished paper "Connecting algorithmic fairness and fair outcomes in a sociotechnical simulation case study of AI-assisted healthcare". In this work, we present a simulation-based approach to explore how statistical definitions of algorithmic fairness translate to fairness in long-term outcomes, using AI-assisted breast cancer screening as a case example. We evaluate four fairness criteria and their impact on mortality rates and socioeconomic disparities, while also considering how radiologists’ reliance on AI and patients’ access to healthcare affect outcomes. Our results highlight how algorithmic fairness does not directly translate into fair and equitable outcomes, underscoring the importance of integrating sociotechnical perspectives in order to gain a holistic understanding of fairness in AI.  

  3. Results of initial values simulation study

    • figshare.com
    bin
    Updated Jan 20, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ulrich Tran (2016). Results of initial values simulation study [Dataset]. http://doi.org/10.6084/m9.figshare.1555671.v1
    Explore at:
    binAvailable download formats
    Dataset updated
    Jan 20, 2016
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Ulrich Tran
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Results of the "Effects of initial values and convergence criterion in the two-parameter logistic model when estimating the latent distribution in BILOG-MG 3" simulation study, published in PLOS ONE.

  4. r

    Evaluation of statistical methods used in the analysis of interrupted time...

    • researchdata.edu.au
    • bridges.monash.edu
    Updated Jun 25, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Simon Turner; Andrew Forbes; Amalia Karahalios; Monica Taljaard; Joanne McKenzie (2021). Evaluation of statistical methods used in the analysis of interrupted time series studies: a simulation study - Code and Data [Dataset]. http://doi.org/10.26180/13284329
    Explore at:
    Dataset updated
    Jun 25, 2021
    Dataset provided by
    Monash University
    Authors
    Simon Turner; Andrew Forbes; Amalia Karahalios; Monica Taljaard; Joanne McKenzie
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The zip file includes the computer simulation code and datasets required for reproducing the study: Evaluation of statistical methods used in the analysis of interrupted time series studies: a simulation study.

    Simon L Turner, Andrew B Forbes, Amalia Karahalios, Monica Taljaard, Joanne E McKenzie.

  5. n

    Data and code for: Generation and applications of simulated datasets to...

    • data.niaid.nih.gov
    • datadryad.org
    • +1more
    zip
    Updated Mar 10, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Matthew Silk; Olivier Gimenez (2023). Data and code for: Generation and applications of simulated datasets to integrate social network and demographic analyses [Dataset]. http://doi.org/10.5061/dryad.m0cfxpp7s
    Explore at:
    zipAvailable download formats
    Dataset updated
    Mar 10, 2023
    Dataset provided by
    Centre d'Écologie Fonctionnelle et Évolutive
    Authors
    Matthew Silk; Olivier Gimenez
    License

    https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

    Description

    Social networks are tied to population dynamics; interactions are driven by population density and demographic structure, while social relationships can be key determinants of survival and reproductive success. However, difficulties integrating models used in demography and network analysis have limited research at this interface. We introduce the R package genNetDem for simulating integrated network-demographic datasets. It can be used to create longitudinal social networks and/or capture-recapture datasets with known properties. It incorporates the ability to generate populations and their social networks, generate grouping events using these networks, simulate social network effects on individual survival, and flexibly sample these longitudinal datasets of social associations. By generating co-capture data with known statistical relationships it provides functionality for methodological research. We demonstrate its use with case studies testing how imputation and sampling design influence the success of adding network traits to conventional Cormack-Jolly-Seber (CJS) models. We show that incorporating social network effects in CJS models generates qualitatively accurate results, but with downward-biased parameter estimates when network position influences survival. Biases are greater when fewer interactions are sampled or fewer individuals are observed in each interaction. While our results indicate the potential of incorporating social effects within demographic models, they show that imputing missing network measures alone is insufficient to accurately estimate social effects on survival, pointing to the importance of incorporating network imputation approaches. genNetDem provides a flexible tool to aid these methodological advancements and help researchers test other sampling considerations in social network studies. Methods The dataset and code stored here is for Case Studies 1 and 2 in the paper. Datsets were generated using simulations in R. Here we provide 1) the R code used for the simulations; 2) the simulation outputs (as .RDS files); and 3) the R code to analyse simulation outputs and generate the tables and figures in the paper.

  6. m

    Evaluation of statistical methods used to meta-analyse results from...

    • bridges.monash.edu
    • datasetcatalog.nlm.nih.gov
    • +1more
    zip
    Updated Nov 22, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Elizabeth Korevaar; Simon Turner; Andrew Forbes; AMALIA KARAHALIOS; Monica Taljaard; Joanne McKenzie (2023). Evaluation of statistical methods used to meta-analyse results from interrupted time series studies: a simulation study - Code and Data [Dataset]. http://doi.org/10.26180/20999185.v2
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 22, 2023
    Dataset provided by
    Monash University
    Authors
    Elizabeth Korevaar; Simon Turner; Andrew Forbes; AMALIA KARAHALIOS; Monica Taljaard; Joanne McKenzie
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The datasets containing simulation performance results during the current study, in addition to the code to replicate the simulation study in its entirety, are available here. See the README file for a description the Stata do-files, R-script files, tips to run the code, and the performance result dataset dictionaries.

  7. Results of simulation study

    • figshare.com
    xlsx
    Updated Jan 18, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ulrich Tran; Rudolf Debelak (2016). Results of simulation study [Dataset]. http://doi.org/10.6084/m9.figshare.2064885.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jan 18, 2016
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Ulrich Tran; Rudolf Debelak
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Results of simulation study on smoothing algorithms on the performance of parallel analysis in factor analysis of ordered categorical items

  8. d

    Data from: Data for simulation experiments comparing nonstationary...

    • catalog.data.gov
    • data.usgs.gov
    • +1more
    Updated Nov 20, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Data for simulation experiments comparing nonstationary design-flood adjustments based on observed annual peak flows in the conterminous United States [Dataset]. https://catalog.data.gov/dataset/data-for-simulation-experiments-comparing-nonstationary-design-flood-adjustments-based-on-
    Explore at:
    Dataset updated
    Nov 20, 2025
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Area covered
    Contiguous United States, United States
    Description

    This dataset contains files used in this Monte Carlo simulation study comparing the performance of five statistical models for adjusting design floods for current conditions at sites with known trends. These files include (i) the observed annual peak-flow series in the conterminous US used to inform ranges of known moments and trends used in the simulation experiment, (ii) the 3,000 combinations of Monte Carlo experiment parameters (including sample moments, trends, distribution types, and record lengths), (iii) the 5,000 100-year time series of random uniform variates used as annual non-exceedance probabilities in the generation of synthetic annual peak-flow series, (iv) the simulated and true (known) quantiles associated with the 10% and 1% annual exceedance probabilities conditioned on the last years of the synthetic annual peak-flow series generated through the experiment. This dataset also contains a model archive with the R statistical software code used to execute the study along with a document describing the contents of the archive and providing instructions for reproducing results.

  9. r

    Outlier detection in clinical registries - simulation study data and Stata...

    • researchdata.edu.au
    • bridges.monash.edu
    Updated Dec 12, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Susannah Ahern; Jessy Hansen; Arul Earnest; Ahmad Reza Pourghaderi (2023). Outlier detection in clinical registries - simulation study data and Stata code [Dataset]. http://doi.org/10.26180/24471664.V2
    Explore at:
    Dataset updated
    Dec 12, 2023
    Dataset provided by
    Monash University
    Authors
    Susannah Ahern; Jessy Hansen; Arul Earnest; Ahmad Reza Pourghaderi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Contains the simulated data and Stata code used to produce the results for the manuscript titled "Evaluating methods of outlier detection when benchmarking clinical registry data – a simulation study", accepted for publication in the Health Services and Outcomes Research Methodology Journal.

    data_files.zip (code to generate all files in "do_files\simstudy1_preparation.do"):
    raw_data - the .dta files produced from running the user-written hiersim command (https://doi.org/10.26180/24480889.v1)
    summary_data - the .dta files produced from summarising of the results across each unique simulated scenario and method combination (performance measure average and 95% Monte Carlo confidence intervals)
    parameter_check - the .dta files produced from summarising the simulated data parameters across each unique simulated scenario (performance measure average and 95% Monte Carlo confidence intervals)

    do_files.zip:
    simstudy1_preparation.do - the code to run the simulations (using the hiersim command, available at https://doi.org/10.26180/24480889.v1) and create summary datasets (performance measures and parameter checks)
    simstudy1_manuscript.do - the code to produce the figures included in the main manuscript
    simstudy1_supplementary.do - the code to produce the table and figures included in the manuscript supplementary material

  10. T

    ML-CFA Monte Carlo Simulation Back Up Files

    • dataverse.tdl.org
    bin +2
    Updated Nov 21, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    R. Noah Padgett; Grant Morgan; R. Noah Padgett; Grant Morgan (2019). ML-CFA Monte Carlo Simulation Back Up Files [Dataset]. http://doi.org/10.18738/T8/RBUFZG
    Explore at:
    text/x-fixed-field(8537), text/x-fixed-field(6090), text/x-fixed-field(34500), text/x-fixed-field(828000), txt(1597), text/x-fixed-field(14392), bin(2104), txt(1624), bin(40369)Available download formats
    Dataset updated
    Nov 21, 2019
    Dataset provided by
    Texas Data Repository
    Authors
    R. Noah Padgett; Grant Morgan; R. Noah Padgett; Grant Morgan
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This dataset contains the entirety of the files generated during the large Monte Carlo simulation study on fit statistics in multilevel factor analysis.

  11. Z

    Data from: Test Collection Reliability: A Study of Bias and Robustness to...

    • data-staging.niaid.nih.gov
    • data.niaid.nih.gov
    Updated Jan 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Urbano, Julián (2020). Test Collection Reliability: A Study of Bias and Robustness to Statistical Assumptions via Stochastic Simulation [Dataset]. https://data-staging.niaid.nih.gov/resources?id=zenodo_32606
    Explore at:
    Dataset updated
    Jan 24, 2020
    Authors
    Urbano, Julián
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    This archive contains the simulated collections, their diagnosis data, and the estimates of accuracy. For the full code and description, please refer to https://github.com/julian-urbano/irj2015-reliability

  12. Call Center Simulated Data

    • kaggle.com
    zip
    Updated Mar 28, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pablo Sebastián Campos Ortiz (2023). Call Center Simulated Data [Dataset]. https://www.kaggle.com/datasets/scss17/call-center-simulated-data
    Explore at:
    zip(3098 bytes)Available download formats
    Dataset updated
    Mar 28, 2023
    Authors
    Pablo Sebastián Campos Ortiz
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    The aim of this data set is to be used along with my notebook Linear Regression Notes which provides a guideline for applying correlation analysis and linear regression models from a statistical approach.

    A fictional call center is interested in knowing the relationship between the number of personnel and some variables that measure their performance such as average answer time, average calls per hour, and average time per call. Data were simulated to represent 200 shifts.

  13. d

    Data from: The use of percentage change from baseline as an outcome in a...

    • catalog.data.gov
    • data.virginia.gov
    Updated Sep 7, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institutes of Health (2025). The use of percentage change from baseline as an outcome in a controlled trial is statistically inefficient: a simulation study [Dataset]. https://catalog.data.gov/dataset/the-use-of-percentage-change-from-baseline-as-an-outcome-in-a-controlled-trial-is-statisti
    Explore at:
    Dataset updated
    Sep 7, 2025
    Dataset provided by
    National Institutes of Health
    Description

    Background Many randomized trials involve measuring a continuous outcome - such as pain, body weight or blood pressure - at baseline and after treatment. In this paper, I compare four possibilities for how such trials can be analyzed: post-treatment; change between baseline and post-treatment; percentage change between baseline and post-treatment and analysis of covariance (ANCOVA) with baseline score as a covariate. The statistical power of each method was determined for a hypothetical randomized trial under a range of correlations between baseline and post-treatment scores. Results ANCOVA has the highest statistical power. Change from baseline has acceptable power when correlation between baseline and post-treatment scores is high;when correlation is low, analyzing only post-treatment scores has reasonable power. Percentage change from baseline has the lowest statistical power and was highly sensitive to changes in variance. Theoretical considerations suggest that percentage change from baseline will also fail to protect from bias in the case of baseline imbalance and will lead to an excess of trials with non-normally distributed outcome data. Conclusions Percentage change from baseline should not be used in statistical analysis. Trialists wishing to report this statistic should use another method, such as ANCOVA, and convert the results to a percentage change by using mean baseline scores.

  14. m

    Script for: Zhou et al. (2024). A Simulation Study of the Performance of...

    • data.mendeley.com
    Updated Aug 20, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zhengyang Zhou (2024). Script for: Zhou et al. (2024). A Simulation Study of the Performance of Statistical Models for Count Outcomes with Excessive Zeros [Dataset]. http://doi.org/10.17632/r5bztdd766.2
    Explore at:
    Dataset updated
    Aug 20, 2024
    Authors
    Zhengyang Zhou
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This repository contains the R code for the data generation and analysis for the paper:

    Zhou, Z., Li, D., Huh, D., Xie, M., & Mun, E. Y. (2024). A Simulation Study of the Performance of Statistical Models for Count Outcomes with Excessive Zeros. Statistics in Medicine. https://doi.org/10.1002/sim.10198

    Abstract

    Background: Outcome measures that are count variables with excessive zeros are common in health behaviors research. Examples include the number of standard drinks consumed or alcohol-related problems experienced over time. There is a lack of empirical data about the relative performance of prevailing statistical models for assessing the efficacy of interventions when outcomes are zero-inflated, particularly compared with recently developed marginalized count regression approaches for such data. Methods: The current simulation study examined five commonly used approaches for analyzing count outcomes, including two linear models (with outcomes on raw and log-transformed scales, respectively) and three prevailing count distribution-based models (i.e., Poisson, negative binomial, and zero-inflated Poisson (ZIP) models). We also considered the marginalized zero-inflated Poisson (MZIP) model, a novel alternative that estimates the overall effects on the population mean while adjusting for zero-inflation. Motivated by alcohol misuse prevention trials, extensive simulations were conducted to evaluate and compare the statistical power and Type I error rate of candidate statistical models and approaches across data conditions that varied in sample size (N = 100 to 500), zero rate (0.2 to 0.8), and intervention effect sizes conditions. Results: Under zero-inflation, the Poisson model failed to control the Type I error rate, resulting in higher than expected false positive results. When the intervention effects on the zero (vs. non-zero) and count parts were in the same direction, the MZIP model had the highest statistical power, followed by the linear model with outcomes on the raw scale, negative binomial model, and ZIP model. The performance of linear model with a log-transformed outcome variable was unsatisfactory. When only one of the effects on the zero (vs. non-zero) part and the count part existed, the ZIP model had the highest statistical power. Conclusions: The MZIP model demonstrated better statistical properties in detecting true intervention effects and controlling false positive results for zero-inflated count outcomes. This MZIP model may serve as an appealing analytical approach to evaluating overall intervention effects in studies with count outcomes marked by excessive zeros.

  15. R

    Replication data for : Rise and fall of a multicomponent droplet in a...

    • entrepot.recherche.data.gouv.fr
    pdf, zip
    Updated Jan 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Herve HENRY; Mirantsoa Aimé RASOLOFOMANANA; Mirantsoa Aimé RASOLOFOMANANA; Romain LE TELLIER; Romain LE TELLIER; Herve HENRY (2025). Replication data for : Rise and fall of a multicomponent droplet in a surrounding fluid: Simulation study of a bumpy path [Dataset]. http://doi.org/10.57745/PVPRXJ
    Explore at:
    pdf(87493), zip(4580936992)Available download formats
    Dataset updated
    Jan 20, 2025
    Dataset provided by
    Recherche Data Gouv
    Authors
    Herve HENRY; Mirantsoa Aimé RASOLOFOMANANA; Mirantsoa Aimé RASOLOFOMANANA; Romain LE TELLIER; Romain LE TELLIER; Herve HENRY
    License

    https://spdx.org/licenses/etalab-2.0.htmlhttps://spdx.org/licenses/etalab-2.0.html

    Description

    Données de simulations numériques permettant de reproduire les figures présentées dans l'article: Rise and fall of a multicomponent droplet in a surrounding fluid: Simulation study of a bumpy path Numerical simulation data needed to reproduce the figures presented in the article: «Rise and fall of a multicomponent droplet in a surrounding fluid: Simulation study of a bumpy path»

  16. Dataset for the " Simulation Study of mmWave 5G-enabled Medical Extended...

    • catalog.data.gov
    • gimi9.com
    Updated Mar 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institute of Standards and Technology (2025). Dataset for the " Simulation Study of mmWave 5G-enabled Medical Extended Reality (MXR)" article's figure [Dataset]. https://catalog.data.gov/dataset/dataset-for-the-simulation-study-of-mmwave-5g-enabled-medical-extended-reality-mxr-article
    Explore at:
    Dataset updated
    Mar 14, 2025
    Dataset provided by
    National Institute of Standards and Technologyhttp://www.nist.gov/
    Description

    Dataset for the " Simulation Study of mmWave 5G-enabled Medical Extended Reality (MXR)" article's figure

  17. C

    Data of simulation study for preliminary detection of problematic items in...

    • dataverse.csuc.cat
    tsv, txt
    Updated Oct 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pere J. Ferrando; Pere J. Ferrando; Urbano Lorenzo-Seva; Urbano Lorenzo-Seva; M. Teresa Bargalló-Escrivà; M. Teresa Bargalló-Escrivà (2023). Data of simulation study for preliminary detection of problematic items in item factor analysis [Dataset]. http://doi.org/10.34810/data759
    Explore at:
    tsv(8007997), tsv(8007999), txt(6496)Available download formats
    Dataset updated
    Oct 31, 2023
    Dataset provided by
    CORA.Repositori de Dades de Recerca
    Authors
    Pere J. Ferrando; Pere J. Ferrando; Urbano Lorenzo-Seva; Urbano Lorenzo-Seva; M. Teresa Bargalló-Escrivà; M. Teresa Bargalló-Escrivà
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Dataset funded by
    https://ror.org/003x0zc53
    Description

    It was carried out a simulation study that took into account the item properties of extremeness (difficulty, location) and consistency. The background idea is that a scale should be defined by a minimum of five items. In addition, averaged bias and sampling error of the five items were also inspected. Files included in the dataset: Data LINEAL: The items are analysed based on linear factor analysis; Data GRADED: The items are analysed based on no-linear factor analysis

  18. s

    A simulation study exploring weighted likelihood models to recover unbiased...

    • eprints.soton.ac.uk
    Updated Jun 23, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Leasure, Douglas; Dooley, Claire; Tatem, Andrew (2021). A simulation study exploring weighted likelihood models to recover unbiased population estimates from weighted survey data [Dataset]. http://doi.org/10.5258/SOTON/WP00706
    Explore at:
    Dataset updated
    Jun 23, 2021
    Dataset provided by
    University of Southampton
    Authors
    Leasure, Douglas; Dooley, Claire; Tatem, Andrew
    Description

    This report describes a simulation study exploring weighted-precision and weighted-likelihood models (R, Stan and JAGS software) to recover unbiased estimates of population sizes from weighted survey data.

  19. f

    Supplement 1. R and C code used for the analysis of the great tits data set...

    • datasetcatalog.nlm.nih.gov
    • wiley.figshare.com
    Updated Aug 10, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kidd, Lindall R.; Matechou, Eleni; Garroway, Colin J.; Cheng, San Chye (2016). Supplement 1. R and C code used for the analysis of the great tits data set and for the simulation study presented in Appendix B. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001519991
    Explore at:
    Dataset updated
    Aug 10, 2016
    Authors
    Kidd, Lindall R.; Matechou, Eleni; Garroway, Colin J.; Cheng, San Chye
    Description

    File List AllFunctions.R (MD5: d3120c7372ab36b802dd9f0c01138f1a) Functions needed to fit the models presented in the paper. AllFits.R (MD5: f7caa2c394d6c710195c1bca762d1851) R code for fitting the models to the data set of great tits. Simulations.R (MD5: d3a8a230b93a10bc74420f6694b6e8b1) R code for performing the simulation study presented in Appendix B. PMdata.csv (MD5: 6fa6abb6c8240ba8eb0100c8031182ae) The data set of already marked female great tits. PUdata.csv (MD5: 19f72c2afdc3736b87c8956dd9c94a3e) The data set of previously unmarked female great tits. Effort.csv (MD5: 95b7da2637fd9b08b9818832e23cb7f5) Data on sampling effort. WithBreeding.dll MD5: ffe47405123db8b78289af44f4e050d3) The log-likelihood in compiled C code. C.zip (MD5: 9452e29327a86ce7bc2ab968bdd720dc) The original C code containing the log-likelihood. Description All of the functions needed to fit the models presented in the paper are in AllFunctions.R. The log-likelihood function is evaluated in C via R and is contained in the WithBreeding.dll file. The original C file is in the folder C.zip. The analysis of the great tits data set (PMdata.csv is the data set of already marked birds and PUdata.csv is the data set of previously unmarked birds) presented in the paper was performed using the functions in AllFits.R while the simulation study presented in Appendix B was performed using Simulations.R. Information on capture and resight effort (number of sites visited etc) is in effort.csv.

  20. Dataset for: Simulation and data-generation for random-effects network...

    • wiley.figshare.com
    txt
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Svenja Elisabeth Seide; Katrin Jensen; Meinhard Kieser (2023). Dataset for: Simulation and data-generation for random-effects network meta-analysis of binary outcome [Dataset]. http://doi.org/10.6084/m9.figshare.8001863.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    Wileyhttps://www.wiley.com/
    Authors
    Svenja Elisabeth Seide; Katrin Jensen; Meinhard Kieser
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    The performance of statistical methods is frequently evaluated by means of simulation studies. In case of network meta-analysis of binary data, however, available data- generating models are restricted to either inclusion of two-armed trials or the fixed-effect model. Based on data-generation in the pairwise case, we propose a framework for the simulation of random-effect network meta-analyses including multi-arm trials with binary outcome. The only of the common data-generating models which is directly applicable to a random-effects network setting uses strongly restrictive assumptions. To overcome these limitations, we modify this approach and derive a related simulation procedure using odds ratios as effect measure. The performance of this procedure is evaluated with synthetic data and in an empirical example.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
U.S. EPA Office of Research and Development (ORD) (2020). Simulation Data Set [Dataset]. https://catalog.data.gov/dataset/simulation-data-set
Organization logo

Simulation Data Set

Explore at:
Dataset updated
Nov 12, 2020
Dataset provided by
United States Environmental Protection Agencyhttp://www.epa.gov/
Description

These are simulated data without any identifying information or informative birth-level covariates. We also standardize the pollution exposures on each week by subtracting off the median exposure amount on a given week and dividing by the interquartile range (IQR) (as in the actual application to the true NC birth records data). The dataset that we provide includes weekly average pregnancy exposures that have already been standardized in this way while the medians and IQRs are not given. This further protects identifiability of the spatial locations used in the analysis. This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: File format: R workspace file; “Simulated_Dataset.RData”. Metadata (including data dictionary) • y: Vector of binary responses (1: adverse outcome, 0: control) • x: Matrix of covariates; one row for each simulated individual • z: Matrix of standardized pollution exposures • n: Number of simulated individuals • m: Number of exposure time periods (e.g., weeks of pregnancy) • p: Number of columns in the covariate design matrix • alpha_true: Vector of “true” critical window locations/magnitudes (i.e., the ground truth that we want to estimate) Code Abstract We provide R statistical software code (“CWVS_LMC.txt”) to fit the linear model of coregionalization (LMC) version of the Critical Window Variable Selection (CWVS) method developed in the manuscript. We also provide R code (“Results_Summary.txt”) to summarize/plot the estimated critical windows and posterior marginal inclusion probabilities. Description “CWVS_LMC.txt”: This code is delivered to the user in the form of a .txt file that contains R statistical software code. Once the “Simulated_Dataset.RData” workspace has been loaded into R, the text in the file can be used to identify/estimate critical windows of susceptibility and posterior marginal inclusion probabilities. “Results_Summary.txt”: This code is also delivered to the user in the form of a .txt file that contains R statistical software code. Once the “CWVS_LMC.txt” code is applied to the simulated dataset and the program has completed, this code can be used to summarize and plot the identified/estimated critical windows and posterior marginal inclusion probabilities (similar to the plots shown in the manuscript). Optional Information (complete as necessary) Required R packages: • For running “CWVS_LMC.txt”: • msm: Sampling from the truncated normal distribution • mnormt: Sampling from the multivariate normal distribution • BayesLogit: Sampling from the Polya-Gamma distribution • For running “Results_Summary.txt”: • plotrix: Plotting the posterior means and credible intervals Instructions for Use Reproducibility (Mandatory) What can be reproduced: The data and code can be used to identify/estimate critical windows from one of the actual simulated datasets generated under setting E4 from the presented simulation study. How to use the information: • Load the “Simulated_Dataset.RData” workspace • Run the code contained in “CWVS_LMC.txt” • Once the “CWVS_LMC.txt” code is complete, run “Results_Summary.txt”. Format: Below is the replication procedure for the attached data set for the portion of the analyses using a simulated data set: Data The data used in the application section of the manuscript consist of geocoded birth records from the North Carolina State Center for Health Statistics, 2005-2008. In the simulation study section of the manuscript, we simulate synthetic data that closely match some of the key features of the birth certificate data while maintaining confidentiality of any actual pregnant women. Availability Due to the highly sensitive and identifying information contained in the birth certificate data (including latitude/longitude and address of residence at delivery), we are unable to make the data from the application section publically available. However, we will make one of the simulated datasets available for any reader interested in applying the method to realistic simulated birth records data. This will also allow the user to become familiar with the required inputs of the model, how the data should be structured, and what type of output is obtained. While we cannot provide the application data here, access to the North Carolina birth records can be requested through the North Carolina State Center for Health Statistics, and requires an appropriate data use agreement. Description Permissions: These are simulated data without any identifying information or informative birth-level covariates. We also standardize the pollution exposures on each week by subtracting off the median exposure amount on a given week and dividing by the interquartile range (IQR) (as in the actual application to the true NC birth records data). The dataset that we provide includes weekly average pregnancy exposures that have already been standardized in this way while the medians and IQRs are not given. This further protects identifiability of the spatial locations used in the analysis. This dataset is associated with the following publication: Warren, J., W. Kong, T. Luben, and H. Chang. Critical Window Variable Selection: Estimating the Impact of Air Pollution on Very Preterm Birth. Biostatistics. Oxford University Press, OXFORD, UK, 1-30, (2019).

Search
Clear search
Close search
Google apps
Main menu