100+ datasets found

Simulation Data Set
catalog.data.gov
s.cnmilf.com
Updated Nov 12, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. EPA Office of Research and Development (ORD) (2020). Simulation Data Set [Dataset]. https://catalog.data.gov/dataset/simulation-data-set
Explore at:
Dataset updated
Nov 12, 2020
Dataset provided by
United States Environmental Protection Agencyhttp://www.epa.gov/
Description
These are simulated data without any identifying information or informative birth-level covariates. We also standardize the pollution exposures on each week by subtracting off the median exposure amount on a given week and dividing by the interquartile range (IQR) (as in the actual application to the true NC birth records data). The dataset that we provide includes weekly average pregnancy exposures that have already been standardized in this way while the medians and IQRs are not given. This further protects identifiability of the spatial locations used in the analysis. This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: File format: R workspace file; “Simulated_Dataset.RData”. Metadata (including data dictionary) • y: Vector of binary responses (1: adverse outcome, 0: control) • x: Matrix of covariates; one row for each simulated individual • z: Matrix of standardized pollution exposures • n: Number of simulated individuals • m: Number of exposure time periods (e.g., weeks of pregnancy) • p: Number of columns in the covariate design matrix • alpha_true: Vector of “true” critical window locations/magnitudes (i.e., the ground truth that we want to estimate) Code Abstract We provide R statistical software code (“CWVS_LMC.txt”) to fit the linear model of coregionalization (LMC) version of the Critical Window Variable Selection (CWVS) method developed in the manuscript. We also provide R code (“Results_Summary.txt”) to summarize/plot the estimated critical windows and posterior marginal inclusion probabilities. Description “CWVS_LMC.txt”: This code is delivered to the user in the form of a .txt file that contains R statistical software code. Once the “Simulated_Dataset.RData” workspace has been loaded into R, the text in the file can be used to identify/estimate critical windows of susceptibility and posterior marginal inclusion probabilities. “Results_Summary.txt”: This code is also delivered to the user in the form of a .txt file that contains R statistical software code. Once the “CWVS_LMC.txt” code is applied to the simulated dataset and the program has completed, this code can be used to summarize and plot the identified/estimated critical windows and posterior marginal inclusion probabilities (similar to the plots shown in the manuscript). Optional Information (complete as necessary) Required R packages: • For running “CWVS_LMC.txt”: • msm: Sampling from the truncated normal distribution • mnormt: Sampling from the multivariate normal distribution • BayesLogit: Sampling from the Polya-Gamma distribution • For running “Results_Summary.txt”: • plotrix: Plotting the posterior means and credible intervals Instructions for Use Reproducibility (Mandatory) What can be reproduced: The data and code can be used to identify/estimate critical windows from one of the actual simulated datasets generated under setting E4 from the presented simulation study. How to use the information: • Load the “Simulated_Dataset.RData” workspace • Run the code contained in “CWVS_LMC.txt” • Once the “CWVS_LMC.txt” code is complete, run “Results_Summary.txt”. Format: Below is the replication procedure for the attached data set for the portion of the analyses using a simulated data set: Data The data used in the application section of the manuscript consist of geocoded birth records from the North Carolina State Center for Health Statistics, 2005-2008. In the simulation study section of the manuscript, we simulate synthetic data that closely match some of the key features of the birth certificate data while maintaining confidentiality of any actual pregnant women. Availability Due to the highly sensitive and identifying information contained in the birth certificate data (including latitude/longitude and address of residence at delivery), we are unable to make the data from the application section publically available. However, we will make one of the simulated datasets available for any reader interested in applying the method to realistic simulated birth records data. This will also allow the user to become familiar with the required inputs of the model, how the data should be structured, and what type of output is obtained. While we cannot provide the application data here, access to the North Carolina birth records can be requested through the North Carolina State Center for Health Statistics, and requires an appropriate data use agreement. Description Permissions: These are simulated data without any identifying information or informative birth-level covariates. We also standardize the pollution exposures on each week by subtracting off the median exposure amount on a given week and dividing by the interquartile range (IQR) (as in the actual application to the true NC birth records data). The dataset that we provide includes weekly average pregnancy exposures that have already been standardized in this way while the medians and IQRs are not given. This further protects identifiability of the spatial locations used in the analysis. This dataset is associated with the following publication: Warren, J., W. Kong, T. Luben, and H. Chang. Critical Window Variable Selection: Estimating the Impact of Air Pollution on Very Preterm Birth. Biostatistics. Oxford University Press, OXFORD, UK, 1-30, (2019).
B
Sociotechnical Simulation Study Source Data
borealisdata.ca
Updated Oct 8, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Emma Stanley (2025). Sociotechnical Simulation Study Source Data [Dataset]. http://doi.org/10.5683/SP3/N0PHAT
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.5683/SP3/N0PHAT
Dataset updated
Oct 8, 2025
Dataset provided by
Borealis
Authors
Emma Stanley
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
This repository contains the data used to generate figures and tables in the unpublished paper "Connecting algorithmic fairness and fair outcomes in a sociotechnical simulation case study of AI-assisted healthcare". In this work, we present a simulation-based approach to explore how statistical definitions of algorithmic fairness translate to fairness in long-term outcomes, using AI-assisted breast cancer screening as a case example. We evaluate four fairness criteria and their impact on mortality rates and socioeconomic disparities, while also considering how radiologists’ reliance on AI and patients’ access to healthcare affect outcomes. Our results highlight how algorithmic fairness does not directly translate into fair and equitable outcomes, underscoring the importance of integrating sociotechnical perspectives in order to gain a holistic understanding of fairness in AI.  
Results of initial values simulation study
figshare.com
bin
Updated Jan 20, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ulrich Tran (2016). Results of initial values simulation study [Dataset]. http://doi.org/10.6084/m9.figshare.1555671.v1
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.1555671.v1
Dataset updated
Jan 20, 2016
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Ulrich Tran
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Results of the "Effects of initial values and convergence criterion in the two-parameter logistic model when estimating the latent distribution in BILOG-MG 3" simulation study, published in PLOS ONE.
r
Evaluation of statistical methods used in the analysis of interrupted time...
researchdata.edu.au
bridges.monash.edu
Updated Jun 25, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Simon Turner; Andrew Forbes; Amalia Karahalios; Monica Taljaard; Joanne McKenzie (2021). Evaluation of statistical methods used in the analysis of interrupted time series studies: a simulation study - Code and Data [Dataset]. http://doi.org/10.26180/13284329
Explore at:
Unique identifier
https://doi.org/10.26180/13284329
Dataset updated
Jun 25, 2021
Dataset provided by
Monash University
Authors
Simon Turner; Andrew Forbes; Amalia Karahalios; Monica Taljaard; Joanne McKenzie
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The zip file includes the computer simulation code and datasets required for reproducing the study: Evaluation of statistical methods used in the analysis of interrupted time series studies: a simulation study.
Simon L Turner, Andrew B Forbes, Amalia Karahalios, Monica Taljaard, Joanne E McKenzie.
n
Data and code for: Generation and applications of simulated datasets to...
data.niaid.nih.gov
datadryad.org
+1more
zip
Updated Mar 10, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Matthew Silk; Olivier Gimenez (2023). Data and code for: Generation and applications of simulated datasets to integrate social network and demographic analyses [Dataset]. http://doi.org/10.5061/dryad.m0cfxpp7s
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5061/dryad.m0cfxpp7s
Dataset updated
Mar 10, 2023
Dataset provided by
Centre d'Écologie Fonctionnelle et Évolutive
Authors
Matthew Silk; Olivier Gimenez
License
https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Description
Social networks are tied to population dynamics; interactions are driven by population density and demographic structure, while social relationships can be key determinants of survival and reproductive success. However, difficulties integrating models used in demography and network analysis have limited research at this interface. We introduce the R package genNetDem for simulating integrated network-demographic datasets. It can be used to create longitudinal social networks and/or capture-recapture datasets with known properties. It incorporates the ability to generate populations and their social networks, generate grouping events using these networks, simulate social network effects on individual survival, and flexibly sample these longitudinal datasets of social associations. By generating co-capture data with known statistical relationships it provides functionality for methodological research. We demonstrate its use with case studies testing how imputation and sampling design influence the success of adding network traits to conventional Cormack-Jolly-Seber (CJS) models. We show that incorporating social network effects in CJS models generates qualitatively accurate results, but with downward-biased parameter estimates when network position influences survival. Biases are greater when fewer interactions are sampled or fewer individuals are observed in each interaction. While our results indicate the potential of incorporating social effects within demographic models, they show that imputing missing network measures alone is insufficient to accurately estimate social effects on survival, pointing to the importance of incorporating network imputation approaches. genNetDem provides a flexible tool to aid these methodological advancements and help researchers test other sampling considerations in social network studies. Methods The dataset and code stored here is for Case Studies 1 and 2 in the paper. Datsets were generated using simulations in R. Here we provide 1) the R code used for the simulations; 2) the simulation outputs (as .RDS files); and 3) the R code to analyse simulation outputs and generate the tables and figures in the paper.
m
Evaluation of statistical methods used to meta-analyse results from...
bridges.monash.edu
datasetcatalog.nlm.nih.gov
+1more
zip
Updated Nov 22, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Elizabeth Korevaar; Simon Turner; Andrew Forbes; AMALIA KARAHALIOS; Monica Taljaard; Joanne McKenzie (2023). Evaluation of statistical methods used to meta-analyse results from interrupted time series studies: a simulation study - Code and Data [Dataset]. http://doi.org/10.26180/20999185.v2
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.26180/20999185.v2
Dataset updated
Nov 22, 2023
Dataset provided by
Monash University
Authors
Elizabeth Korevaar; Simon Turner; Andrew Forbes; AMALIA KARAHALIOS; Monica Taljaard; Joanne McKenzie
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The datasets containing simulation performance results during the current study, in addition to the code to replicate the simulation study in its entirety, are available here. See the README file for a description the Stata do-files, R-script files, tips to run the code, and the performance result dataset dictionaries.
Results of simulation study
figshare.com
xlsx
Updated Jan 18, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ulrich Tran; Rudolf Debelak (2016). Results of simulation study [Dataset]. http://doi.org/10.6084/m9.figshare.2064885.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.2064885.v1
Dataset updated
Jan 18, 2016
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Ulrich Tran; Rudolf Debelak
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Results of simulation study on smoothing algorithms on the performance of parallel analysis in factor analysis of ordered categorical items
d
Data from: Data for simulation experiments comparing nonstationary...
catalog.data.gov
data.usgs.gov
+1more
Updated Nov 20, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Geological Survey (2025). Data for simulation experiments comparing nonstationary design-flood adjustments based on observed annual peak flows in the conterminous United States [Dataset]. https://catalog.data.gov/dataset/data-for-simulation-experiments-comparing-nonstationary-design-flood-adjustments-based-on-
Explore at:
Dataset updated
Nov 20, 2025
Dataset provided by
United States Geological Surveyhttp://www.usgs.gov/
Area covered
Contiguous United States, United States
Description
This dataset contains files used in this Monte Carlo simulation study comparing the performance of five statistical models for adjusting design floods for current conditions at sites with known trends. These files include (i) the observed annual peak-flow series in the conterminous US used to inform ranges of known moments and trends used in the simulation experiment, (ii) the 3,000 combinations of Monte Carlo experiment parameters (including sample moments, trends, distribution types, and record lengths), (iii) the 5,000 100-year time series of random uniform variates used as annual non-exceedance probabilities in the generation of synthetic annual peak-flow series, (iv) the simulated and true (known) quantiles associated with the 10% and 1% annual exceedance probabilities conditioned on the last years of the synthetic annual peak-flow series generated through the experiment. This dataset also contains a model archive with the R statistical software code used to execute the study along with a document describing the contents of the archive and providing instructions for reproducing results.
r
Outlier detection in clinical registries - simulation study data and Stata...
researchdata.edu.au
bridges.monash.edu
Updated Dec 12, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Susannah Ahern; Jessy Hansen; Arul Earnest; Ahmad Reza Pourghaderi (2023). Outlier detection in clinical registries - simulation study data and Stata code [Dataset]. http://doi.org/10.26180/24471664.V2
Explore at:
Unique identifier
https://doi.org/10.26180/24471664.V2
Dataset updated
Dec 12, 2023
Dataset provided by
Monash University
Authors
Susannah Ahern; Jessy Hansen; Arul Earnest; Ahmad Reza Pourghaderi
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Contains the simulated data and Stata code used to produce the results for the manuscript titled "Evaluating methods of outlier detection when benchmarking clinical registry data – a simulation study", accepted for publication in the Health Services and Outcomes Research Methodology Journal.
data_files.zip (code to generate all files in "do_files\simstudy1_preparation.do"):
raw_data - the .dta files produced from running the user-written hiersim command (https://doi.org/10.26180/24480889.v1)
summary_data - the .dta files produced from summarising of the results across each unique simulated scenario and method combination (performance measure average and 95% Monte Carlo confidence intervals)
parameter_check - the .dta files produced from summarising the simulated data parameters across each unique simulated scenario (performance measure average and 95% Monte Carlo confidence intervals)
do_files.zip:
simstudy1_preparation.do - the code to run the simulations (using the hiersim command, available at https://doi.org/10.26180/24480889.v1) and create summary datasets (performance measures and parameter checks)
simstudy1_manuscript.do - the code to produce the figures included in the main manuscript
simstudy1_supplementary.do - the code to produce the table and figures included in the manuscript supplementary material
T
ML-CFA Monte Carlo Simulation Back Up Files
dataverse.tdl.org
bin +2
Updated Nov 21, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
R. Noah Padgett; Grant Morgan; R. Noah Padgett; Grant Morgan (2019). ML-CFA Monte Carlo Simulation Back Up Files [Dataset]. http://doi.org/10.18738/T8/RBUFZG
Explore at:
text/x-fixed-field(8537), text/x-fixed-field(6090), text/x-fixed-field(34500), text/x-fixed-field(828000), txt(1597), text/x-fixed-field(14392), bin(2104), txt(1624), bin(40369)Available download formats
Unique identifier
https://doi.org/10.18738/T8/RBUFZG
Dataset updated
Nov 21, 2019
Dataset provided by
Texas Data Repository
Authors
R. Noah Padgett; Grant Morgan; R. Noah Padgett; Grant Morgan
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
This dataset contains the entirety of the files generated during the large Monte Carlo simulation study on fit statistics in multilevel factor analysis.
Z
Data from: Test Collection Reliability: A Study of Bias and Robustness to...
data-staging.niaid.nih.gov
data.niaid.nih.gov
Updated Jan 24, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Urbano, Julián (2020). Test Collection Reliability: A Study of Bias and Robustness to Statistical Assumptions via Stochastic Simulation [Dataset]. https://data-staging.niaid.nih.gov/resources?id=zenodo_32606
Explore at:
Dataset updated
Jan 24, 2020
Authors
Urbano, Julián
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
This archive contains the simulated collections, their diagnosis data, and the estimates of accuracy. For the full code and description, please refer to https://github.com/julian-urbano/irj2015-reliability
Call Center Simulated Data
kaggle.com
zip
Updated Mar 28, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pablo Sebastián Campos Ortiz (2023). Call Center Simulated Data [Dataset]. https://www.kaggle.com/datasets/scss17/call-center-simulated-data
Explore at:
zip(3098 bytes)Available download formats
Dataset updated
Mar 28, 2023
Authors
Pablo Sebastián Campos Ortiz
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
The aim of this data set is to be used along with my notebook Linear Regression Notes which provides a guideline for applying correlation analysis and linear regression models from a statistical approach.

A fictional call center is interested in knowing the relationship between the number of personnel and some variables that measure their performance such as average answer time, average calls per hour, and average time per call. Data were simulated to represent 200 shifts.
d
Data from: The use of percentage change from baseline as an outcome in a...
catalog.data.gov
data.virginia.gov
Updated Sep 7, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Institutes of Health (2025). The use of percentage change from baseline as an outcome in a controlled trial is statistically inefficient: a simulation study [Dataset]. https://catalog.data.gov/dataset/the-use-of-percentage-change-from-baseline-as-an-outcome-in-a-controlled-trial-is-statisti
Explore at:
Dataset updated
Sep 7, 2025
Dataset provided by
National Institutes of Health
Description
Background Many randomized trials involve measuring a continuous outcome - such as pain, body weight or blood pressure - at baseline and after treatment. In this paper, I compare four possibilities for how such trials can be analyzed: post-treatment; change between baseline and post-treatment; percentage change between baseline and post-treatment and analysis of covariance (ANCOVA) with baseline score as a covariate. The statistical power of each method was determined for a hypothetical randomized trial under a range of correlations between baseline and post-treatment scores. Results ANCOVA has the highest statistical power. Change from baseline has acceptable power when correlation between baseline and post-treatment scores is high;when correlation is low, analyzing only post-treatment scores has reasonable power. Percentage change from baseline has the lowest statistical power and was highly sensitive to changes in variance. Theoretical considerations suggest that percentage change from baseline will also fail to protect from bias in the case of baseline imbalance and will lead to an excess of trials with non-normally distributed outcome data. Conclusions Percentage change from baseline should not be used in statistical analysis. Trialists wishing to report this statistic should use another method, such as ANCOVA, and convert the results to a percentage change by using mean baseline scores.
m
Script for: Zhou et al. (2024). A Simulation Study of the Performance of...
data.mendeley.com
Updated Aug 20, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Zhengyang Zhou (2024). Script for: Zhou et al. (2024). A Simulation Study of the Performance of Statistical Models for Count Outcomes with Excessive Zeros [Dataset]. http://doi.org/10.17632/r5bztdd766.2
Explore at:
Unique identifier
https://doi.org/10.17632/r5bztdd766.2
Dataset updated
Aug 20, 2024
Authors
Zhengyang Zhou
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This repository contains the R code for the data generation and analysis for the paper:

Zhou, Z., Li, D., Huh, D., Xie, M., & Mun, E. Y. (2024). A Simulation Study of the Performance of Statistical Models for Count Outcomes with Excessive Zeros. Statistics in Medicine. https://doi.org/10.1002/sim.10198

Abstract

Background: Outcome measures that are count variables with excessive zeros are common in health behaviors research. Examples include the number of standard drinks consumed or alcohol-related problems experienced over time. There is a lack of empirical data about the relative performance of prevailing statistical models for assessing the efficacy of interventions when outcomes are zero-inflated, particularly compared with recently developed marginalized count regression approaches for such data. Methods: The current simulation study examined five commonly used approaches for analyzing count outcomes, including two linear models (with outcomes on raw and log-transformed scales, respectively) and three prevailing count distribution-based models (i.e., Poisson, negative binomial, and zero-inflated Poisson (ZIP) models). We also considered the marginalized zero-inflated Poisson (MZIP) model, a novel alternative that estimates the overall effects on the population mean while adjusting for zero-inflation. Motivated by alcohol misuse prevention trials, extensive simulations were conducted to evaluate and compare the statistical power and Type I error rate of candidate statistical models and approaches across data conditions that varied in sample size (N = 100 to 500), zero rate (0.2 to 0.8), and intervention effect sizes conditions. Results: Under zero-inflation, the Poisson model failed to control the Type I error rate, resulting in higher than expected false positive results. When the intervention effects on the zero (vs. non-zero) and count parts were in the same direction, the MZIP model had the highest statistical power, followed by the linear model with outcomes on the raw scale, negative binomial model, and ZIP model. The performance of linear model with a log-transformed outcome variable was unsatisfactory. When only one of the effects on the zero (vs. non-zero) part and the count part existed, the ZIP model had the highest statistical power. Conclusions: The MZIP model demonstrated better statistical properties in detecting true intervention effects and controlling false positive results for zero-inflated count outcomes. This MZIP model may serve as an appealing analytical approach to evaluating overall intervention effects in studies with count outcomes marked by excessive zeros.
R
Replication data for : Rise and fall of a multicomponent droplet in a...
entrepot.recherche.data.gouv.fr
pdf, zip
Updated Jan 20, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Herve HENRY; Mirantsoa Aimé RASOLOFOMANANA; Mirantsoa Aimé RASOLOFOMANANA; Romain LE TELLIER; Romain LE TELLIER; Herve HENRY (2025). Replication data for : Rise and fall of a multicomponent droplet in a surrounding fluid: Simulation study of a bumpy path [Dataset]. http://doi.org/10.57745/PVPRXJ
Explore at:
pdf(87493), zip(4580936992)Available download formats
Unique identifier
https://doi.org/10.57745/PVPRXJ
Dataset updated
Jan 20, 2025
Dataset provided by
Recherche Data Gouv
Authors
Herve HENRY; Mirantsoa Aimé RASOLOFOMANANA; Mirantsoa Aimé RASOLOFOMANANA; Romain LE TELLIER; Romain LE TELLIER; Herve HENRY
License
https://spdx.org/licenses/etalab-2.0.htmlhttps://spdx.org/licenses/etalab-2.0.html
Description
Données de simulations numériques permettant de reproduire les figures présentées dans l'article: Rise and fall of a multicomponent droplet in a surrounding fluid: Simulation study of a bumpy path Numerical simulation data needed to reproduce the figures presented in the article: «Rise and fall of a multicomponent droplet in a surrounding fluid: Simulation study of a bumpy path»
Dataset for the " Simulation Study of mmWave 5G-enabled Medical Extended...
catalog.data.gov
gimi9.com
Updated Mar 14, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Institute of Standards and Technology (2025). Dataset for the " Simulation Study of mmWave 5G-enabled Medical Extended Reality (MXR)" article's figure [Dataset]. https://catalog.data.gov/dataset/dataset-for-the-simulation-study-of-mmwave-5g-enabled-medical-extended-reality-mxr-article
Explore at:
Dataset updated
Mar 14, 2025
Dataset provided by
National Institute of Standards and Technologyhttp://www.nist.gov/
Description
Dataset for the " Simulation Study of mmWave 5G-enabled Medical Extended Reality (MXR)" article's figure
C
Data of simulation study for preliminary detection of problematic items in...
dataverse.csuc.cat
tsv, txt
Updated Oct 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pere J. Ferrando; Pere J. Ferrando; Urbano Lorenzo-Seva; Urbano Lorenzo-Seva; M. Teresa Bargalló-Escrivà; M. Teresa Bargalló-Escrivà (2023). Data of simulation study for preliminary detection of problematic items in item factor analysis [Dataset]. http://doi.org/10.34810/data759
Explore at:
tsv(8007997), tsv(8007999), txt(6496)Available download formats
Unique identifier
https://doi.org/10.34810/data759
Dataset updated
Oct 31, 2023
Dataset provided by
CORA.Repositori de Dades de Recerca
Authors
Pere J. Ferrando; Pere J. Ferrando; Urbano Lorenzo-Seva; Urbano Lorenzo-Seva; M. Teresa Bargalló-Escrivà; M. Teresa Bargalló-Escrivà
License
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Dataset funded by
https://ror.org/003x0zc53
Description
It was carried out a simulation study that took into account the item properties of extremeness (difficulty, location) and consistency. The background idea is that a scale should be defined by a minimum of five items. In addition, averaged bias and sampling error of the five items were also inspected. Files included in the dataset: Data LINEAL: The items are analysed based on linear factor analysis; Data GRADED: The items are analysed based on no-linear factor analysis
s
A simulation study exploring weighted likelihood models to recover unbiased...
eprints.soton.ac.uk
Updated Jun 23, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Leasure, Douglas; Dooley, Claire; Tatem, Andrew (2021). A simulation study exploring weighted likelihood models to recover unbiased population estimates from weighted survey data [Dataset]. http://doi.org/10.5258/SOTON/WP00706
Explore at:
Unique identifier
https://doi.org/10.5258/SOTON/WP00706
Dataset updated
Jun 23, 2021
Dataset provided by
University of Southampton
Authors
Leasure, Douglas; Dooley, Claire; Tatem, Andrew
Description
This report describes a simulation study exploring weighted-precision and weighted-likelihood models (R, Stan and JAGS software) to recover unbiased estimates of population sizes from weighted survey data.
f
Supplement 1. R and C code used for the analysis of the great tits data set...
datasetcatalog.nlm.nih.gov
wiley.figshare.com
Updated Aug 10, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kidd, Lindall R.; Matechou, Eleni; Garroway, Colin J.; Cheng, San Chye (2016). Supplement 1. R and C code used for the analysis of the great tits data set and for the simulation study presented in Appendix B. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001519991
Explore at:
Dataset updated
Aug 10, 2016
Authors
Kidd, Lindall R.; Matechou, Eleni; Garroway, Colin J.; Cheng, San Chye
Description
File List AllFunctions.R (MD5: d3120c7372ab36b802dd9f0c01138f1a) Functions needed to fit the models presented in the paper. AllFits.R (MD5: f7caa2c394d6c710195c1bca762d1851) R code for fitting the models to the data set of great tits. Simulations.R (MD5: d3a8a230b93a10bc74420f6694b6e8b1) R code for performing the simulation study presented in Appendix B. PMdata.csv (MD5: 6fa6abb6c8240ba8eb0100c8031182ae) The data set of already marked female great tits. PUdata.csv (MD5: 19f72c2afdc3736b87c8956dd9c94a3e) The data set of previously unmarked female great tits. Effort.csv (MD5: 95b7da2637fd9b08b9818832e23cb7f5) Data on sampling effort. WithBreeding.dll MD5: ffe47405123db8b78289af44f4e050d3) The log-likelihood in compiled C code. C.zip (MD5: 9452e29327a86ce7bc2ab968bdd720dc) The original C code containing the log-likelihood. Description All of the functions needed to fit the models presented in the paper are in AllFunctions.R. The log-likelihood function is evaluated in C via R and is contained in the WithBreeding.dll file. The original C file is in the folder C.zip. The analysis of the great tits data set (PMdata.csv is the data set of already marked birds and PUdata.csv is the data set of previously unmarked birds) presented in the paper was performed using the functions in AllFits.R while the simulation study presented in Appendix B was performed using Simulations.R. Information on capture and resight effort (number of sites visited etc) is in effort.csv.
Dataset for: Simulation and data-generation for random-effects network...
wiley.figshare.com
txt
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Svenja Elisabeth Seide; Katrin Jensen; Meinhard Kieser (2023). Dataset for: Simulation and data-generation for random-effects network meta-analysis of binary outcome [Dataset]. http://doi.org/10.6084/m9.figshare.8001863.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.8001863.v1
Dataset updated
Jun 1, 2023
Dataset provided by
Wileyhttps://www.wiley.com/
Authors
Svenja Elisabeth Seide; Katrin Jensen; Meinhard Kieser
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
The performance of statistical methods is frequently evaluated by means of simulation studies. In case of network meta-analysis of binary data, however, available data- generating models are restricted to either inclusion of two-armed trials or the fixed-effect model. Based on data-generation in the pairwise case, we propose a framework for the simulation of random-effect network meta-analyses including multi-arm trials with binary outcome. The only of the common data-generating models which is directly applicable to a random-effects network setting uses strongly restrictive assumptions. To overcome these limitations, we modify this approach and derive a related simulation procedure using odds ratios as effect measure. The performance of this procedure is evaluated with synthetic data and in an empirical example.

Facebook

Twitter

Click to copy link

Link copied

Cite

U.S. EPA Office of Research and Development (ORD) (2020). Simulation Data Set [Dataset]. https://catalog.data.gov/dataset/simulation-data-set

Simulation Data Set

Explore at:

Dataset updated

Nov 12, 2020

Dataset provided by

United States Environmental Protection Agencyhttp://www.epa.gov/

Description

These are simulated data without any identifying information or informative birth-level covariates. We also standardize the pollution exposures on each week by subtracting off the median exposure amount on a given week and dividing by the interquartile range (IQR) (as in the actual application to the true NC birth records data). The dataset that we provide includes weekly average pregnancy exposures that have already been standardized in this way while the medians and IQRs are not given. This further protects identifiability of the spatial locations used in the analysis. This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: File format: R workspace file; “Simulated_Dataset.RData”. Metadata (including data dictionary) • y: Vector of binary responses (1: adverse outcome, 0: control) • x: Matrix of covariates; one row for each simulated individual • z: Matrix of standardized pollution exposures • n: Number of simulated individuals • m: Number of exposure time periods (e.g., weeks of pregnancy) • p: Number of columns in the covariate design matrix • alpha_true: Vector of “true” critical window locations/magnitudes (i.e., the ground truth that we want to estimate) Code Abstract We provide R statistical software code (“CWVS_LMC.txt”) to fit the linear model of coregionalization (LMC) version of the Critical Window Variable Selection (CWVS) method developed in the manuscript. We also provide R code (“Results_Summary.txt”) to summarize/plot the estimated critical windows and posterior marginal inclusion probabilities. Description “CWVS_LMC.txt”: This code is delivered to the user in the form of a .txt file that contains R statistical software code. Once the “Simulated_Dataset.RData” workspace has been loaded into R, the text in the file can be used to identify/estimate critical windows of susceptibility and posterior marginal inclusion probabilities. “Results_Summary.txt”: This code is also delivered to the user in the form of a .txt file that contains R statistical software code. Once the “CWVS_LMC.txt” code is applied to the simulated dataset and the program has completed, this code can be used to summarize and plot the identified/estimated critical windows and posterior marginal inclusion probabilities (similar to the plots shown in the manuscript). Optional Information (complete as necessary) Required R packages: • For running “CWVS_LMC.txt”: • msm: Sampling from the truncated normal distribution • mnormt: Sampling from the multivariate normal distribution • BayesLogit: Sampling from the Polya-Gamma distribution • For running “Results_Summary.txt”: • plotrix: Plotting the posterior means and credible intervals Instructions for Use Reproducibility (Mandatory) What can be reproduced: The data and code can be used to identify/estimate critical windows from one of the actual simulated datasets generated under setting E4 from the presented simulation study. How to use the information: • Load the “Simulated_Dataset.RData” workspace • Run the code contained in “CWVS_LMC.txt” • Once the “CWVS_LMC.txt” code is complete, run “Results_Summary.txt”. Format: Below is the replication procedure for the attached data set for the portion of the analyses using a simulated data set: Data The data used in the application section of the manuscript consist of geocoded birth records from the North Carolina State Center for Health Statistics, 2005-2008. In the simulation study section of the manuscript, we simulate synthetic data that closely match some of the key features of the birth certificate data while maintaining confidentiality of any actual pregnant women. Availability Due to the highly sensitive and identifying information contained in the birth certificate data (including latitude/longitude and address of residence at delivery), we are unable to make the data from the application section publically available. However, we will make one of the simulated datasets available for any reader interested in applying the method to realistic simulated birth records data. This will also allow the user to become familiar with the required inputs of the model, how the data should be structured, and what type of output is obtained. While we cannot provide the application data here, access to the North Carolina birth records can be requested through the North Carolina State Center for Health Statistics, and requires an appropriate data use agreement. Description Permissions: These are simulated data without any identifying information or informative birth-level covariates. We also standardize the pollution exposures on each week by subtracting off the median exposure amount on a given week and dividing by the interquartile range (IQR) (as in the actual application to the true NC birth records data). The dataset that we provide includes weekly average pregnancy exposures that have already been standardized in this way while the medians and IQRs are not given. This further protects identifiability of the spatial locations used in the analysis. This dataset is associated with the following publication: Warren, J., W. Kong, T. Luben, and H. Chang. Critical Window Variable Selection: Estimating the Impact of Air Pollution on Very Preterm Birth. Biostatistics. Oxford University Press, OXFORD, UK, 1-30, (2019).

Clear search

Close search

Google apps

Main menu

Simulation Data Set

Sociotechnical Simulation Study Source Data

Results of initial values simulation study

Evaluation of statistical methods used in the analysis of interrupted time...

Data and code for: Generation and applications of simulated datasets to...

Evaluation of statistical methods used to meta-analyse results from...

Results of simulation study

Data from: Data for simulation experiments comparing nonstationary...

Outlier detection in clinical registries - simulation study data and Stata...

ML-CFA Monte Carlo Simulation Back Up Files

Data from: Test Collection Reliability: A Study of Bias and Robustness to...

Call Center Simulated Data

Data from: The use of percentage change from baseline as an outcome in a...

Script for: Zhou et al. (2024). A Simulation Study of the Performance of...

Replication data for : Rise and fall of a multicomponent droplet in a...

Dataset for the " Simulation Study of mmWave 5G-enabled Medical Extended...

Data of simulation study for preliminary detection of problematic items in...

A simulation study exploring weighted likelihood models to recover unbiased...

Supplement 1. R and C code used for the analysis of the great tits data set...

Dataset for: Simulation and data-generation for random-effects network...

Simulation Data Set