100+ datasets found

Health Survey for England, 2003-2005: Multilevel Modelling Teaching Dataset
beta.ukdataservice.ac.uk
datacatalogue.cessda.eu
Updated 2011
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Cathie Marsh Centre For Census University Of Manchester (2011). Health Survey for England, 2003-2005: Multilevel Modelling Teaching Dataset [Dataset]. http://doi.org/10.5255/ukda-sn-6765-1
Explore at:
Unique identifier
https://doi.org/10.5255/ukda-sn-6765-1
Dataset updated
2011
Dataset provided by
DataCitehttps://www.datacite.org/
National Centre for Social Research
Authors
Cathie Marsh Centre For Census University Of Manchester
Description
The Health Survey for England, 2003-2005: Multilevel Modelling Teaching Dataset has been prepared as a resource for those interested in learning multilevel modelling techniques. It was first presented as part of a workshop entitled 'Introducing multilevel models and applying them to the Health Survey for England using MLwiN'. The HSE teaching dataset is available in both Stata and MLwIN formats and is accompanied by a practical guide that includes the multilevel modelling practical exercises. A separate document provides information on the teaching dataset and materials.

The main dataset is an edited version of the Health Survey for England (HSE) data from 2003, 2004 and 2005 (the full HSEs are at the UK Data Archive under SNs 5098, 5439 and 5675). Details of the recoding of HSE variables for the teaching dataset and how the aggregate data were produced can be found in the documentation.

WARNING – Users should note that this dataset is intended as a learning resource and should not be used for research purposes. In particular the dataset uses adult measures of Body Mass Index (BMI) for children and so the results from the data should not be reported in research contexts.
n
Multilevel modeling of time-series cross-sectional data reveals the dynamic...
data.niaid.nih.gov
datadryad.org
zip
Updated Mar 6, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kodai Kusano (2020). Multilevel modeling of time-series cross-sectional data reveals the dynamic interaction between ecological threats and democratic development [Dataset]. http://doi.org/10.5061/dryad.547d7wm3x
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5061/dryad.547d7wm3x
Dataset updated
Mar 6, 2020
Dataset provided by
University of Nevada, Reno
Authors
Kodai Kusano
License
https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Description
What is the relationship between environment and democracy? The framework of cultural evolution suggests that societal development is an adaptation to ecological threats. Pertinent theories assume that democracy emerges as societies adapt to ecological factors such as higher economic wealth, lower pathogen threats, less demanding climates, and fewer natural disasters. However, previous research confused within-country processes with between-country processes and erroneously interpreted between-country findings as if they generalize to within-country mechanisms. In this article, we analyze a time-series cross-sectional dataset to study the dynamic relationship between environment and democracy (1949-2016), accounting for previous misconceptions in levels of analysis. By separating within-country processes from between-country processes, we find that the relationship between environment and democracy not only differs by countries but also depends on the level of analysis. Economic wealth predicts increasing levels of democracy in between-country comparisons, but within-country comparisons show that democracy declines as countries become wealthier over time. This relationship is only prevalent among historically wealthy countries but not among historically poor countries, whose wealth also increased over time. By contrast, pathogen prevalence predicts lower levels of democracy in both between-country and within-country comparisons. Our longitudinal analyses identifying temporal precedence reveal that not only reductions in pathogen prevalence drive future democracy, but also democracy reduces future pathogen prevalence and increases future wealth. These nuanced results contrast with previous analyses using narrow, cross-sectional data. As a whole, our findings illuminate the dynamic process by which environment and democracy shape each other.

Methods Our Time-Series Cross-Sectional data combine various online databases. Country names were first identified and matched using R-package “countrycode” (Arel-Bundock, Enevoldsen, & Yetman, 2018) before all datasets were merged. Occasionally, we modified unidentified country names to be consistent across datasets. We then transformed “wide” data into “long” data and merged them using R’s Tidyverse framework (Wickham, 2014). Our analysis begins with the year 1949, which was occasioned by the fact that one of the key time-variant level-1 variables, pathogen prevalence was only available from 1949 on. See our Supplemental Material for all data, Stata syntax, R-markdown for visualization, supplemental analyses and detailed results (available at https://osf.io/drt8j/).
f
Supplementary Material for: Analyzing Change: A Primer on Multilevel Models...
karger.figshare.com
txt
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Holden J.E.; Kelley K.; Agarwal R. (2023). Supplementary Material for: Analyzing Change: A Primer on Multilevel Models with Applications to Nephrology [Dataset]. http://doi.org/10.6084/m9.figshare.4785667.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.4785667.v1
Dataset updated
May 31, 2023
Dataset provided by
Karger Publishers
Authors
Holden J.E.; Kelley K.; Agarwal R.
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The analysis of change is central to the study of kidney research. In the past 25 years, newer and more sophisticated methods for the analysis of change have been developed; however, as of yet these newer methods are underutilized in the field of kidney research. Repeated measures ANOVA is the traditional model that is easy to understand and simpler to interpret, but it may not be valid in complex real-world situations. Problems with the assumption of sphericity, unit of analysis, lack of consideration for different types of change, and missing data, in the repeated measures ANOVA context are often encountered. Multilevel modeling, a newer and more sophisticated method for the analysis of change, overcomes these limitations and provides a better framework for understanding the true nature of change. The present article provides a primer on the use of multilevel modeling to study change. An example from a clinical study is detailed and the method for implementation in SAS is provided.
H
Replication data for: How many countries for multilevel modeling? A...
dataverse.harvard.edu
bin +3
Updated May 26, 2015
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Harvard Dataverse (2015). Replication data for: How many countries for multilevel modeling? A comparison of Frequentist and Bayesian approaches. [Dataset]. http://doi.org/10.7910/DVN/WDA163
Explore at:
tsv(1285455), text/x-stata-syntax; charset=us-ascii(1117), text/plain; charset=us-ascii(375), text/x-stata-syntax; charset=us-ascii(179), text/plain; charset=us-ascii(2251), bin(67379), text/plain; charset=us-ascii(797)Available download formats
Unique identifier
https://doi.org/10.7910/DVN/WDA163
Dataset updated
May 26, 2015
Dataset provided by
Harvard Dataverse
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Area covered
Monaco, Monte Carlo Studies / European Union
Description
Researchers in comparative research increasingly use multilevel models to test effects of country level factors on individual behavior and preferences. However, the justification of widely employed estimation strategies is asymptotic and applications in comparative politics routinely involve only a small number of countries. Thus researchers and reviewers often wonder if these models are applicable at all. In other words, how many countries do we need for multilevel modeling? I present results from a large scale Monte Carlo experiment comparing the performance of multilevel models when few countries are available. I find that maximum likelihood estimates and confidence intervals can be severely biased, especially in models including cross-level interactions. In contrast, the Bayesian approach proves to be far more robust, and yields considerably more conservative tests.
i
Dataset for “Multilevel Parallel Coevolution for Green Scheduling”
ieee-dataport.org
Updated Nov 1, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jinglian Wang (2019). Dataset for “Multilevel Parallel Coevolution for Green Scheduling” [Dataset]. https://ieee-dataport.org/documents/dataset-multilevel-parallel-coevolution-green-scheduling
Explore at:
Dataset updated
Nov 1, 2019
Authors
Jinglian Wang
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Normal 0

7.8 磅 0 2

false false false

EN-US ZH-CN X-NONE
z
Missing data in the analysis of multilevel and dependent data (Example data...
zenodo.org
bin
Updated Jul 20, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Simon Grund; Simon Grund; Oliver Lüdtke; Oliver Lüdtke; Alexander Robitzsch; Alexander Robitzsch (2023). Missing data in the analysis of multilevel and dependent data (Example data sets) [Dataset]. http://doi.org/10.5281/zenodo.7773614
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.7773614
Dataset updated
Jul 20, 2023
Dataset provided by
Springer
Authors
Simon Grund; Simon Grund; Oliver Lüdtke; Oliver Lüdtke; Alexander Robitzsch; Alexander Robitzsch
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Example data sets for the book chapter titled "Missing Data in the Analysis of Multilevel and Dependent Data" submitted for publication in the second edition of "Dependent Data in Social Science Research" (Stemmler et al., 2015). This repository includes the data sets used in both example analyses (Examples 1 and 2) in two file formats (binary ".rda" for use in R; plain-text ".dat").

The data sets contain simulated data from 23,376 (Example 1) and 23,072 (Example 2) individuals from 2,000 groups on four variables:

ID = group identifier (1-2000)
x = numeric (Level 1)
y = numeric (Level 1)
w = binary (Level 2)

In all data sets, missing values are coded as "NA".
h
multilevel-legal-reasoning
huggingface.co
Updated May 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Computational Intelligence and Operations Laboratory (CIOL) (2025). multilevel-legal-reasoning [Dataset]. https://huggingface.co/datasets/ciol-research/multilevel-legal-reasoning
Explore at:
Dataset updated
May 2, 2025
Dataset authored and provided by
Computational Intelligence and Operations Laboratory (CIOL)
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Legal Reasoning Dataset with Multilevel Human and Model-Annotated Explanations

Prepared by Mst Rafia Islam, Umong Sain, Azmine Toushik Wasi Prepared as a part of Reasoning Datasets Competition by Bespoke Labs, Hugging Face, and Together.ai.

🧭 Purpose and Scope

The Legal Reasoning Dataset aims to support the evaluation and training of legal reasoning systems, particularly in multilingual or jurisdiction-agnostic contexts. It focuses on international acts and treaties… See the full description on the dataset page: https://huggingface.co/datasets/ciol-research/multilevel-legal-reasoning.
d
Multi-level marketing - list of names to be corrected
data.gov.tw
csv
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fair Trade Commission, EY, Multi-level marketing - list of names to be corrected [Dataset]. https://data.gov.tw/en/datasets/6128
Explore at:
csvAvailable download formats
Dataset authored and provided by
Fair Trade Commission, EY
License
https://data.gov.tw/licensehttps://data.gov.tw/license
Description
The multi-level marketing business has not yet prepared the required reporting materials, and according to Article 6, Paragraph 1 of the Multi-level Marketing Management Measures, it is considered as not having been reported.
H
Replication Data for: Understanding, choosing, and unifying multilevel and...
dataverse.harvard.edu
Updated Sep 29, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CHAD HAZLETT (2022). Replication Data for: Understanding, choosing, and unifying multilevel and fixed effect approaches [Dataset]. http://doi.org/10.7910/DVN/VZDPSQ
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/VZDPSQ
Dataset updated
Sep 29, 2022
Dataset provided by
Harvard Dataverse
Authors
CHAD HAZLETT
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Replication materials
Data from: Multilevel Modeling of Training Needs in Artificial Intelligence
zenodo.org
bin, xls
Updated Mar 7, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Veronica Distefano; Veronica Distefano; Sandra De iaco; Sandra De iaco; Sabrina Maggio; Sabrina Maggio (2025). Multilevel Modeling of Training Needs in Artificial Intelligence [Dataset]. http://doi.org/10.5281/zenodo.13890780
Explore at:
bin, xlsAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.13890780
Dataset updated
Mar 7, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Veronica Distefano; Veronica Distefano; Sandra De iaco; Sandra De iaco; Sabrina Maggio; Sabrina Maggio
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Nowadays, Artificial Intelligence (AI) is playing a rapidly increasing role in several fields of research and in almost all sectors of real life. However, few studies have assessed the effects of AI applications on training needs. This paper proposes an innovative multilevel modeling in order to investigate Awareness, Attitude and Trust towards AI and their reflections on learning needs. In particular, it is shown how a machine learning variable selection algorithm can support the definition of the optimal subset of all relevant covariates with respect to the outcome variable and improve the multilevel model performance for estimating the probability of educational needs. Thus, starting from a complex web survey to European citizens distributed in eight countries, the estimation of a multilevel binary model, defined on the basis of covariates selected through the Boruta random forest algorithm, is proposed. A discussion on the gender differences of the related estimated multilevel logit models is presented. A sensitivity analysis is also included in order to assess the prediction accuracy of the proposed multilevel logit modeling.

This repository contains data generated for the manuscript: " A two-stage procedure for optimal modeling of the probability of training needs in artificial intelligence". It comprehends: (1) the dataset Data_Boruta_Random_Forest used to estimate the variables importance. (2) the dataset Data_Multilevel to perform the comparison among different multilevel binary models proposed in the paper.
U
Multilevel Monitoring System (MLMS) datasets for wells in the U.S....
data.usgs.gov
catalog.data.gov
Updated Nov 18, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jason Fisher; Brian Twining (2024). Multilevel Monitoring System (MLMS) datasets for wells in the U.S. Geological Survey - Idaho National Laboratory groundwater monitoring network [Dataset]. http://doi.org/10.5066/P144NWLJ
Explore at:
Unique identifier
https://doi.org/10.5066/P144NWLJ
Dataset updated
Nov 18, 2024
Dataset provided by
United States Geological Surveyhttp://www.usgs.gov/
Authors
Jason Fisher; Brian Twining
License
U.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically
Time period covered
Sep 27, 2005 - Aug 7, 2023
Area covered
Idaho
Description
A collection of analysis-ready Multilevel Monitoring System (MLMS) datasets for wells in the U.S. Geological Survey (USGS) aquifer-monitoring network, Idaho National Laboratory (INL), Idaho. Administered by the USGS INL Project Office in cooperation with the U.S. Department of Energy.
d
Replication Data for: An introduction to multilevel regression and...
search.dataone.org
dataverse.harvard.edu
Updated Nov 22, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hanretty, Chris (2023). Replication Data for: An introduction to multilevel regression and post-stratification for estimating constituency opinion [Dataset]. http://doi.org/10.7910/DVN/IPPPNU
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/IPPPNU
Dataset updated
Nov 22, 2023
Dataset provided by
Harvard Dataverse
Authors
Hanretty, Chris
Description
This article provides an overview of multilevel regression and post-stratification (MRP). It reviews the stages in estimating opinion for small areas, identifies circumstances in which MRP can go wrong, or go right, and provides a worked example for the UK using publicly available data sources and a previously published post-stratification frame. This archive contains two R source code files and one post-stratification matrix in CSV format.
f
Data from: Analysis of Isocratic-Chromatographic-Retention Data using...
acs.figshare.com
txt
Updated Jun 2, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Łukasz Kubik; Roman Kaliszan; Paweł Wiczling (2023). Analysis of Isocratic-Chromatographic-Retention Data using Bayesian Multilevel Modeling [Dataset]. http://doi.org/10.1021/acs.analchem.8b04033.s002
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.1021/acs.analchem.8b04033.s002
Dataset updated
Jun 2, 2023
Dataset provided by
ACS Publications
Authors
Łukasz Kubik; Roman Kaliszan; Paweł Wiczling
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
The objective of this work was to develop a multilevel (hierarchical) model based on isocratic-reversed-phase-high-performance-chromatographic data collected in methanol and acetonitrile for 58 chemical compounds. Such a multilevel model is a regression model of the analyte-specific chromatographic measurements, in which all the regression parameters are given a probability model. It is a fundamentally different approach from the most common approach, where parameters are separately estimated for each analyte (without sharing information across analytes and different organic modifiers). The statistical analysis was done with Stan software implementing the Bayesian-statistics inference with Markov-chain Monte Carlo sampling. During the model-building process, a series of multilevel models of different complexity were obtained, such as (1) a model with no pooling (separate models were fitted for each analyte), (2) a model with partial pooling (a common distribution was used for analyte-specific parameters), and (3) a model with partial pooling as well as a regression model relating analyte-specific parameters and analyte-specific properties (QSRR equations). All the models were compared with each other using 10-fold cross-validation. The benefits of multilevel models in inference and predictions were shown. In particular the obtained models allowed us to (i) better understand the data and (ii) solve many routine analytical problems, such as obtaining well-calibrated predictions of retention factors for an analyte in acetonitrile-containing mobile phases given zero, one, or several measurements in methanol-containing mobile phases and vice versa.
f
Results of multilevel mediation analysis.
plos.figshare.com
xls
Updated Jan 25, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shujun Liu; Luke Sloan; Tarek Al Baghal; Matthew Williams; Paulo Serôdio; Curtis Jessop (2024). Results of multilevel mediation analysis. [Dataset]. http://doi.org/10.1371/journal.pone.0297036.t002
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0297036.t002
Dataset updated
Jan 25, 2024
Dataset provided by
PLOS ONE
Authors
Shujun Liu; Luke Sloan; Tarek Al Baghal; Matthew Williams; Paulo Serôdio; Curtis Jessop
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Previous studies mainly focused on individual-level factors that influence the adoption and usage of mobile technology and social networking sites, with little emphasis paid to the influences of household situations. Using multilevel modelling approach, this study merges household- (n1 = 1,455) and individual-level (n2 = 2,570) data in the U.K. context to investigate (a) whether a household economic capital (HEC) can affect its members’ Twitter adoption, (b) whether the influences are mediated by the member’s activity variety and self-reported efficacy with mobile technology, and (c) whether the members’ traits, including educational level, gross income and residential area, moderate the relationship between HEC and Twitter adoption. Significant direct and indirect associations were discovered between HEC and its members’ Twitter adoption. The educational level and gross income of household members moderated the influence of HEC on individuals’ Twitter adoption.
Chapter 3-Binomial and Ordinal Multilevel Analysis (Dataset)
search.datacite.org
Updated 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kazuma Mizukoshi (2020). Chapter 3-Binomial and Ordinal Multilevel Analysis (Dataset) [Dataset]. http://doi.org/10.7910/dvn/rtqxdf
Explore at:
Unique identifier
https://doi.org/10.7910/dvn/rtqxdf
Dataset updated
2020
Dataset provided by
DataCitehttps://www.datacite.org/
Harvard Dataverse
Authors
Kazuma Mizukoshi
Description
This is the dataset to recreate Figure3.4 in Chapter 3 of my PhD thesis.
Parallel Hierarchical Adaptive Multilevel Project (PHAML)
datasets.ai
catalog.data.gov
0
Updated Aug 27, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Institute of Standards and Technology (2024). Parallel Hierarchical Adaptive Multilevel Project (PHAML) [Dataset]. https://datasets.ai/datasets/parallel-hierarchical-adaptive-multilevel-project-phaml-06003
Explore at:
0Available download formats
Dataset updated
Aug 27, 2024
Dataset authored and provided by
National Institute of Standards and Technologyhttp://www.nist.gov/
Description
Software for the solution of elliptic partial differential equations using finite elements with adaptive mesh refinement and multigrid techniques.
d
Chapter 3-Binomial and Ordinal Multilevel Analysis (R-Script)
search.dataone.org
dataverse.harvard.edu
Updated Nov 23, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mizukoshi, Kazuma (2023). Chapter 3-Binomial and Ordinal Multilevel Analysis (R-Script) [Dataset]. http://doi.org/10.7910/DVN/FTAIS7
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/FTAIS7
Dataset updated
Nov 23, 2023
Dataset provided by
Harvard Dataverse
Authors
Mizukoshi, Kazuma
Description
This is the R script to recreate Figure 3.4 in Chapter 3 of my PhD thesis.
Multilevel Influences on HIV and Substance Use in a YMSM Cohort (RADAR),...
icpsr.umich.edu
ascii, delimited, r +3
Updated Jun 23, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mustanski, Brian (2025). Multilevel Influences on HIV and Substance Use in a YMSM Cohort (RADAR), Chicago Metropolitan Area, 2015-2020 [Dataset]. http://doi.org/10.3886/ICPSR37603.v6
Explore at:
stata, r, sas, ascii, delimited, spssAvailable download formats
Unique identifier
https://doi.org/10.3886/ICPSR37603.v6
Dataset updated
Jun 23, 2025
Dataset provided by
Inter-university Consortium for Political and Social Researchhttps://www.icpsr.umich.edu/web/pages/
Authors
Mustanski, Brian
License
https://www.icpsr.umich.edu/web/ICPSR/studies/37603/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/37603/terms
Time period covered
Feb 1, 2015 - Dec 31, 2020
Area covered
Chicago Metropolitan Area, United States, Chicago, Illinois
Description
The National Institute on Drug Abuse (NIDA) funded RADAR in 2014 to collect multilevel, longitudinal data and biospecimens from an ethnically and racially diverse cohort of young, sexual and gender minorities (SGM; e.g., men who have sex with men (MSM), transgender women, gender non-conforming individuals) who were assigned male at birth (AMAB) (current core cohort n=1,113). The primary objective of this study is to apply a multilevel perspective to a syndemic of health issues associated with human immunodeficiency virus (HIV) in this population. The multilevel design focuses on individual, dyadic (i.e., sexual and romantic relationships), network (i.e., social, drug, and sexual connections) and biologic factors that may be associated with HIV. The cohort contains both HIV-negative and HIV-positive individuals, which allows for the development of a repository of biospecimens and HIV sequence data from both pre-infection and post-infection visits that will help facilitate future projects evaluating substance use, HIV risk, and pathogenesis. A multiple cohort, accelerated longitudinal design was utilized by initially enrolling two existing SGM cohorts and then expanded through the use of convenience and snowball sampling methods. Enrollment criteria varied slightly based on the recruitment method, but overall inclusion criteria required participants to be AMAB, between 16 and 29 years of age, report having had sex with a man in the prior year or identify as a SGM, live in the Chicago metropolitan area, and be an English speaker. Study recruitment opened in February 2015. Participants are followed through the developmental period of late adolescence to early adulthood, which is a critical period of initiation and acceleration of sexual behavior and substance use. Study visits occur every six months.
o
Accompanying simulated data for "Go multivariate: a Monte Carlo study of a...
explore.openaire.eu
data.niaid.nih.gov
Updated Mar 25, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sebastian Mildiner Moraga; Emmeke Aarts (2022). Accompanying simulated data for "Go multivariate: a Monte Carlo study of a multilevel hidden Markov model with categorical data of varying complexity" [Dataset]. http://doi.org/10.5281/zenodo.6384006
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.6384006
Dataset updated
Mar 25, 2022
Authors
Sebastian Mildiner Moraga; Emmeke Aarts
Description
The multilevel hidden Markov model (MHMM) is a promising vehicle to investigate latent dynamics over time in social and behavioral processes. By including continuous individual random effects, the model accommodates variability between individuals, providing individual-specific trajectories and facilitating the study of individual differences. However, the performance of the MHMM has not been sufficiently explored. Currently, there are no practical guidelines on the sample size needed to obtain reliable estimates related to categorical data characteristics We performed an extensive simulation to assess the effect of the number of dependent variables (1-4), the number of individuals (5-90), and the number of observations per individual (100-1600) on the estimation performance of group-level parameters and between-individual variability on a Bayesian MHMM with categorical data of various levels of complexity. We found that using multivariate data generally alleviates the sample size needed and improves the stability of the results. Regarding the estimation of group-level parameters, the number of individuals and observations largely compensate for each other. Meanwhile, only the former drives the estimation of between-individual variability. We conclude with guidelines on the sample size necessary based on the complexity of the data and the study objectives of the practitioners. This repository contains data generated for the manuscript: "Go multivariate: a Monte Carlo study of a multilevel hidden Markov model with categorical data of varying complexity". It comprehends: (1) model outputs (maximum a posteriori estimates) for each repetition (n=100) of each scenario (n=324) of the main simulation, (2) complete model outputs (including estimates for 4000 MCMC iterations) for two chains of each repetition (n=3) of each scenario (n=324). Please note that the empirical data used in the manuscript is not available as part of this repository. A subsample of the data used in the empirical example are openly available as an example data set in the R package mHMMbayes on CRAN. The full data set is available on request from the authors.
Multilevel Event History Analysis Training Datasets, 2003-2005
beta.ukdataservice.ac.uk
Updated 2005
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
F. Steele (2005). Multilevel Event History Analysis Training Datasets, 2003-2005 [Dataset]. http://doi.org/10.5255/ukda-sn-5171-1
Explore at:
Unique identifier
https://doi.org/10.5255/ukda-sn-5171-1
Dataset updated
2005
Dataset provided by
DataCitehttps://www.datacite.org/
University of London, Institute of Education, Centre for Longitudinal Studies
Authors
F. Steele
Description
This study includes five data files and corresponding exercise instructions.

Four of the five data files and instructions were produced from the National Child Development Study datasets for an ESRC-funded workshop on Multilevel Event History Analysis, held in February 2005. The workshop data includes three files in ASCII DAT format and one in SPSS SAV format. Further information and documentation beyond that included in this study, and MLwiN software downloads are available from the Centre for Multilevel Modelling web site.

In addition, for the second edition of the study, example data and documentation for fitting multilevel multiprocess event history models using aML software were added to the dataset (the data file 'amlex.raw'). The aML syntax file that accompanies these data can also be found at the Centre for Multilevel Modelling web site noted above.

The project from which these data were produced was conducted under the ESRC Research Methods programme. It involved the development of multilevel simultaneous equations models for the analysis of correlated event histories. The research was motivated by a study of the interrelationships between partnership (marriage or cohabitation) durations and decisions about childbearing, using event history data from the 1958 and 1970 British Birth Cohort studies (in the case of this dataset, NCDS).

Additional aims and objectives of the project were to develop methodology for the analysis of complex event history data; provide means for implementing methodology in existing software; and provide social scientists with practical training in advanced event history analysis.

Facebook

Twitter

Click to copy link

Link copied

Cite

Cathie Marsh Centre For Census University Of Manchester (2011). Health Survey for England, 2003-2005: Multilevel Modelling Teaching Dataset [Dataset]. http://doi.org/10.5255/ukda-sn-6765-1

Health Survey for England, 2003-2005: Multilevel Modelling Teaching Dataset

Explore at:

484 scholarly articles cite this dataset (View in Google Scholar)

Unique identifier

https://doi.org/10.5255/ukda-sn-6765-1

Dataset updated

2011

Dataset provided by

DataCitehttps://www.datacite.org/
National Centre for Social Research

Authors

Cathie Marsh Centre For Census University Of Manchester

Description

The Health Survey for England, 2003-2005: Multilevel Modelling Teaching Dataset has been prepared as a resource for those interested in learning multilevel modelling techniques. It was first presented as part of a workshop entitled 'Introducing multilevel models and applying them to the Health Survey for England using MLwiN'. The HSE teaching dataset is available in both Stata and MLwIN formats and is accompanied by a practical guide that includes the multilevel modelling practical exercises. A separate document provides information on the teaching dataset and materials.

The main dataset is an edited version of the Health Survey for England (HSE) data from 2003, 2004 and 2005 (the full HSEs are at the UK Data Archive under SNs 5098, 5439 and 5675). Details of the recoding of HSE variables for the teaching dataset and how the aggregate data were produced can be found in the documentation.

WARNING – Users should note that this dataset is intended as a learning resource and should not be used for research purposes. In particular the dataset uses adult measures of Body Mass Index (BMI) for children and so the results from the data should not be reported in research contexts.

Clear search

Close search

Google apps

Main menu

Health Survey for England, 2003-2005: Multilevel Modelling Teaching Dataset

Multilevel modeling of time-series cross-sectional data reveals the dynamic...

Supplementary Material for: Analyzing Change: A Primer on Multilevel Models...

Replication data for: How many countries for multilevel modeling? A...

Dataset for “Multilevel Parallel Coevolution for Green Scheduling”

Missing data in the analysis of multilevel and dependent data (Example data...

multilevel-legal-reasoning

Multi-level marketing - list of names to be corrected

Replication Data for: Understanding, choosing, and unifying multilevel and...

Data from: Multilevel Modeling of Training Needs in Artificial Intelligence

Multilevel Monitoring System (MLMS) datasets for wells in the U.S....

Replication Data for: An introduction to multilevel regression and...

Data from: Analysis of Isocratic-Chromatographic-Retention Data using...

Results of multilevel mediation analysis.

Chapter 3-Binomial and Ordinal Multilevel Analysis (Dataset)

Parallel Hierarchical Adaptive Multilevel Project (PHAML)

Chapter 3-Binomial and Ordinal Multilevel Analysis (R-Script)

Multilevel Influences on HIV and Substance Use in a YMSM Cohort (RADAR),...

Accompanying simulated data for "Go multivariate: a Monte Carlo study of a...

Multilevel Event History Analysis Training Datasets, 2003-2005

Health Survey for England, 2003-2005: Multilevel Modelling Teaching Dataset