100+ datasets found

f
STATA data sheet
datasetcatalog.nlm.nih.gov
figshare.com
Updated Jun 11, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Benbarka, Siraj (2023). STATA data sheet [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001112819
Explore at:
Dataset updated
Jun 11, 2023
Authors
Benbarka, Siraj
Description
These are the STATA data sheets imported from excel. These are used directly for meta-analysis
Stata code for analysis
catalog.data.gov
datasets.ai
+1more
Updated Jan 19, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. EPA Office of Research and Development (ORD) (2021). Stata code for analysis [Dataset]. https://catalog.data.gov/dataset/stata-code-for-analysis
Explore at:
Dataset updated
Jan 19, 2021
Dataset provided by
United States Environmental Protection Agencyhttp://www.epa.gov/
Description
This is STATA software code for analysis on publicly available NHANES data
m
Example Stata syntax and data construction for negative binomial time series...
data.mendeley.com
Updated Nov 2, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sarah Price (2022). Example Stata syntax and data construction for negative binomial time series regression [Dataset]. http://doi.org/10.17632/3mj526hgzx.2
Explore at:
Unique identifier
https://doi.org/10.17632/3mj526hgzx.2
Dataset updated
Nov 2, 2022
Authors
Sarah Price
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
We include Stata syntax (dummy_dataset_create.do) that creates a panel dataset for negative binomial time series regression analyses, as described in our paper "Examining methodology to identify patterns of consulting in primary care for different groups of patients before a diagnosis of cancer: an exemplar applied to oesophagogastric cancer". We also include a sample dataset for clarity (dummy_dataset.dta), and a sample of that data in a spreadsheet (Appendix 2).

The variables contained therein are defined as follows:

case: binary variable for case or control status (takes a value of 0 for controls and 1 for cases).

patid: a unique patient identifier.

time_period: A count variable denoting the time period. In this example, 0 denotes 10 months before diagnosis with cancer, and 9 denotes the month of diagnosis with cancer,

ncons: number of consultations per month.

period0 to period9: 10 unique inflection point variables (one for each month before diagnosis). These are used to test which aggregation period includes the inflection point.

burden: binary variable denoting membership of one of two multimorbidity burden groups.

We also include two Stata do-files for analysing the consultation rate, stratified by burden group, using the Maximum likelihood method (1_menbregpaper.do and 2_menbregpaper_bs.do).

Note: In this example, for demonstration purposes we create a dataset for 10 months leading up to diagnosis. In the paper, we analyse 24 months before diagnosis. Here, we study consultation rates over time, but the method could be used to study any countable event, such as number of prescriptions.
d
Download statistics GESIS Data Archive
da-ra.de
Updated Apr 27, 2018
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
GESIS - Data Archive for the Social Sciences (2018). Download statistics GESIS Data Archive [Dataset]. http://doi.org/10.4232/1.12979
Explore at:
Unique identifier
https://doi.org/10.4232/1.12979
Dataset updated
Apr 27, 2018
Dataset provided by
GESIS Data Archive
da|ra
Authors
GESIS - Data Archive for the Social Sciences
Time period covered
Jan 1, 2004 - Dec 31, 2017
Description
General information: The data sets contain information on how often materials of studies available through GESIS: Data Archive for the Social Sciences were downloaded and/or ordered through one of the archive´s plattforms/services between 2004 and 2017.

Sources and plattforms: Study materials are accessible through various GESIS plattforms and services: Data Catalogue (DBK), histat, datorium, data service (and others).

Years available: - Data Catalogue: 2012-2017 - data service: 2006-2017 - datorium: 2014-2017 - histat: 2004-2017

Data sets: Data set ZA6899_Datasets_only_all_sources contains information on how often data files such as those with dta- (Stata) or sav- (SPSS) extension have been downloaded. Identification of data files is handled semi-automatically (depending on the plattform/serice). Multiple downloads of one file by the same user (identified through IP-address or username for registered users) on the same days are only counted as one download.

Data set ZA6899_Doc_and_Data_all_sources contains information on how often study materials have been downloaded. Multiple downloads of any file of the same study by the same user (identified through IP-address or username for registered users) on the same days are only counted as one download.

Both data sets are available in three formats: csv (quoted, semicolon-separated), dta (Stata v13, labeled) and sav (SPSS, labeled). All formats contain identical information.

Variables: Variables/columns in both data sets are identical. za_nr ´Archive study number´ version ´GESIS Archiv Version´ doi ´Digital Object Identifier´ StudyNo ´Study number of respective study´ Title ´English study title´ Title_DE ´German study title´ Access ´Access category (0, A, B, C, D, E)´ PubYear ´Publication year of last version of the study´ inZACAT ´Study is currently also available via ZACAT´ inHISTAT ´Study is currently also available via HISTAT´ inDownloads ´There are currently data files available for download for this study in DBK or datorium´ Total ´All downloads combined´ downloads_2004 ´downloads/orders from all sources combined in 2004´ [up to ...] downloads_2017 ´downloads/orders from all sources combined in 2017´ d_2004_dbk ´downloads from source dbk in 2004´ [up to ...] d_2017_dbk ´downloads from source dbk in 2017´ d_2004_histat ´downloads from source histat in 2004´ [up to ...] d_2017_histat ´downloads from source histat in 2017´ d_2004_dataservice ´downloads/orders from source dataservice in 2004´ [up to ...] d_2017_dataservice ´downloads/orders from source dataservice in 2017´

More information is available within the codebook.
U
Statistical Computing: Stata
dataverse-staging.rdmc.unc.edu
pdf +4
Updated Feb 27, 2014
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Catherine Zimmer; Catherine Zimmer (2014). Statistical Computing: Stata [Dataset]. https://dataverse-staging.rdmc.unc.edu/dataset.xhtml?persistentId=hdl:1902.29/11638
Explore at:
pdf(193387), tsv(126), text/plain; charset=us-ascii(146), pdf(318256), tsv(4026), text/plain; charset=us-ascii(443), pdf(352818), text/x-stata-syntax; charset=us-ascii(547), tsv(219941), tsv(75), text/x-stata-syntax; charset=us-ascii(211), tsv(3591), tsv(1152), tsv(2874), tsv(1157), xls(2549)Available download formats
Dataset updated
Feb 27, 2014
Dataset provided by
UNC Dataverse
Authors
Catherine Zimmer; Catherine Zimmer
License
https://dataverse-staging.rdmc.unc.edu/api/datasets/:persistentId/versions/4.0/customlicense?persistentId=hdl:1902.29/11638https://dataverse-staging.rdmc.unc.edu/api/datasets/:persistentId/versions/4.0/customlicense?persistentId=hdl:1902.29/11638
Description
This is a 3-part short course (held over three afternoons). Stata part 1 will offer an introduction to Stata for Windows. Part 2 will teach entering data in Stata, working with Stata do files, and show how to append, sort, and merge data sets in Stata. Part 3 teaches how to perform basic statistical procedures and how to draw sub samples from large datasets.
s
Data from: Data files used to study change dynamics in software systems
figshare.swinburne.edu.au
pdf
Updated Jul 22, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rajesh Vasa (2024). Data files used to study change dynamics in software systems [Dataset]. http://doi.org/10.25916/sut.26288227.v1
Explore at:
pdfAvailable download formats
Unique identifier
https://doi.org/10.25916/sut.26288227.v1
Dataset updated
Jul 22, 2024
Dataset provided by
Swinburne
Authors
Rajesh Vasa
License
Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
License information was derived automatically
Description
It is a widely accepted fact that evolving software systems change and grow. However, it is less well-understood how change is distributed over time, specifically in object oriented software systems. The patterns and techniques used to measure growth permit developers to identify specific releases where significant change took place as well as to inform them of the longer term trend in the distribution profile. This knowledge assists developers in recording systemic and substantial changes to a release, as well as to provide useful information as input into a potential release retrospective. However, these analysis methods can only be applied after a mature release of the code has been developed. But in order to manage the evolution of complex software systems effectively, it is important to identify change-prone classes as early as possible. Specifically, developers need to know where they can expect change, the likelihood of a change, and the magnitude of these modifications in order to take proactive steps and mitigate any potential risks arising from these changes. Previous research into change-prone classes has identified some common aspects, with different studies suggesting that complex and large classes tend to undergo more changes and classes that changed recently are likely to undergo modifications in the near future. Though the guidance provided is helpful, developers need more specific guidance in order for it to be applicable in practice. Furthermore, the information needs to be available at a level that can help in developing tools that highlight and monitor evolution prone parts of a system as well as support effort estimation activities. The specific research questions that we address in this chapter are: (1) What is the likelihood that a class will change from a given version to the next? (a) Does this probability change over time? (b) Is this likelihood project specific, or general? (2) How is modification frequency distributed for classes that change? (3) What is the distribution of the magnitude of change? Are most modifications minor adjustments, or substantive modifications? (4) Does structural complexity make a class susceptible to change? (5) Does popularity make a class more change-prone? We make recommendations that can help developers to proactively monitor and manage change. These are derived from a statistical analysis of change in approximately 55000 unique classes across all projects under investigation. The analysis methods that we applied took into consideration the highly skewed nature of the metric data distributions. The raw metric data (4 .txt files and 4 .log files in a .zip file measuring ~2MB in total) is provided as a comma separated values (CSV) file, and the first line of the CSV file contains the header. A detailed output of the statistical analysis undertaken is provided as log files generated directly from Stata (statistical analysis software).
d
DHS data extractors for Stata
search.dataone.org
dataverse.harvard.edu
Updated Nov 21, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Emily Oster (2023). DHS data extractors for Stata [Dataset]. http://doi.org/10.7910/DVN/RRX3QD
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/RRX3QD
Dataset updated
Nov 21, 2023
Dataset provided by
Harvard Dataverse
Authors
Emily Oster
Description
This package contains two files designed to help read individual level DHS data into Stata. The first file addresses the problem that versions of Stata before Version 7/SE will read in only up to 2047 variables and most of the individual files have more variables than that. The file will read in the .do, .dct and .dat file and output new .do and .dct files with only a subset of the variables specified by the user. The second file deals with earlier DHS surveys in which .do and .dct file do not exist and only .sps and .sas files are provided. The file will read in the .sas and .sps files and output a .dct and .do file. If necessary the first file can then be run again to select a subset of variables.
Data and Code (Stata format)
figshare.com
txt
Updated Aug 31, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
陈亮单 (2024). Data and Code (Stata format) [Dataset]. http://doi.org/10.6084/m9.figshare.26886763.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.26886763.v1
Dataset updated
Aug 31, 2024
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
陈亮单
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
file1: Regression models for intentional injury crimes.file2: Regression models for bribery and corruption.
m
The data and STATA commands for Chinese Agriculture in the Age of High-speed...
data.mendeley.com
Updated Jan 9, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yanyan Gao (2024). The data and STATA commands for Chinese Agriculture in the Age of High-speed Rail: Effects on Agricultural Value Added and Food Output [Dataset]. http://doi.org/10.17632/cbw6dr9p32.1
Explore at:
Unique identifier
https://doi.org/10.17632/cbw6dr9p32.1
Dataset updated
Jan 9, 2024
Authors
Yanyan Gao
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
China
Description
This data and commands replicate all tables and figures in the paper titled "Chinese Agriculture in the Age of High-speed Rail: Effects on Agricultural Value Added and Food Output" publihsed in Agribusiness, 2023, 39 (2), 387-405. If using the data in this paper, please cite Gao, Y., & Wang, X. (2023). Chinese agriculture in the age of high-speed rail: Effects on agricultural value added and food output. Agribusiness, 39, 387–405. https://doi.org/10.1002/agr.21771
Datasets for One to One Merge in Stata
kaggle.com
Updated Feb 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
iFinance Tutor (2023). Datasets for One to One Merge in Stata [Dataset]. https://www.kaggle.com/datasets/ifinancetutor/datasets-for-one-to-one-merge-in-stata/code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 1, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
iFinance Tutor
Description
Dataset

This dataset was created by iFinance Tutor

Contents
m
Raw data used for regression analysis within stata format
data.mendeley.com
Updated Aug 8, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bealu Bekata (2022). Raw data used for regression analysis within stata format [Dataset]. http://doi.org/10.17632/bh98k2rjd4.1
Explore at:
Unique identifier
https://doi.org/10.17632/bh98k2rjd4.1
Dataset updated
Aug 8, 2022
Authors
Bealu Bekata
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This is data used for regression model in Stata format
g
Stata code for analysis
gimi9.com
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stata code for analysis [Dataset]. https://gimi9.com/dataset/data-gov_stata-code-for-analysis/
Explore at:
Description
🇺🇸 미국
r
Revised STATA do-file and dataset
researchdata.edu.au
adelaide.figshare.com
Updated Dec 5, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Warnakulasooriya Lakmini Fernando; Stephanie McWhinnie (2024). Revised STATA do-file and dataset [Dataset]. http://doi.org/10.25909/27932961.V1
Explore at:
Unique identifier
https://doi.org/10.25909/27932961.V1
Dataset updated
Dec 5, 2024
Dataset provided by
The University of Adelaide
Authors
Warnakulasooriya Lakmini Fernando; Stephanie McWhinnie
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Revised STATA do-file and dataset prepared for journal article resubmission.
f
Dataset and Stata code
figshare.com
bin
Updated Oct 16, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Otto Simonsson (2023). Dataset and Stata code [Dataset]. http://doi.org/10.6084/m9.figshare.24316408.v1
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.24316408.v1
Dataset updated
Oct 16, 2023
Dataset provided by
figshare
Authors
Otto Simonsson
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Dataset and Stata code
h
stata
huggingface.co
Updated Aug 1, 2006
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Aden Haussmann (2006). stata [Dataset]. https://huggingface.co/datasets/adenhaus/stata
Explore at:
Dataset updated
Aug 1, 2006
Authors
Aden Haussmann
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
Background

This dataset contains human evaluations of whether outputs on the TaTA dataset are a) understandable and b) attributable to the source tables. See TaTA: A Multilingual Table-to-Text Dataset for African Languages for more details. It can be used to train a learned metric, called StATA, to evaluate model performance on the TaTA dataset. Paper: https://www.arxiv.org/abs/2503.23204 The original can be found here.
f
Cleaned Stata dataset
figshare.com
bin
Updated Aug 11, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tarsizious Chikaonda (2022). Cleaned Stata dataset [Dataset]. http://doi.org/10.6084/m9.figshare.20431287.v1
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.20431287.v1
Dataset updated
Aug 11, 2022
Dataset provided by
figshare
Authors
Tarsizious Chikaonda
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Cleaned Dataset for the Project
H
Replication Data for: Changes in Marital Sorting: Theory and Evidence from...
dataverse.harvard.edu
Updated Jan 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Costas Meghir (2025). Replication Data for: Changes in Marital Sorting: Theory and Evidence from the US [Dataset]. http://doi.org/10.7910/DVN/WEWKGK
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/WEWKGK
Dataset updated
Jan 9, 2025
Dataset provided by
Harvard Dataverse
Authors
Costas Meghir
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Area covered
United States
Description
This supplementary material includes instructions and STATA do files that replicate the empirical results in the paper. The data used is the CPS and needs to be downloaded from the IPMUS website.
d
Replication Data for: Collaboration, Alphabetical Order and Gender...
search.dataone.org
dataverse.harvard.edu
Updated Nov 22, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Wiborg, Vegard; Kjell Arne Brekke; Karine Nyborg (2023). Replication Data for: Collaboration, Alphabetical Order and Gender Discrimination [Dataset]. http://doi.org/10.7910/DVN/AZDWWT
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/AZDWWT
Dataset updated
Nov 22, 2023
Dataset provided by
Harvard Dataverse
Authors
Wiborg, Vegard; Kjell Arne Brekke; Karine Nyborg
Description
Dataset and Stata codes for the paper "Collaboration, Alphabetical Order and Gender Discrimination"
Value Judgment
figshare.com
bin
Updated Oct 18, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Anna Nicolet; Paul F. M. Krabbe (2019). Value Judgment [Dataset]. http://doi.org/10.6084/m9.figshare.10000556.v1
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.10000556.v1
Dataset updated
Oct 18, 2019
Dataset provided by
Figsharehttp://figshare.com/
Authors
Anna Nicolet; Paul F. M. Krabbe
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This is the dataset based on the survey responses of the general population and patients in the Netherlands
d
2 INDIVIDUAL ( Sections 1 2 3 5) v4 (stata 11)
catalog.data.gov
res1catalogd-o-tdatad-o-tgov.vcapture.xyz
+1more
Updated Oct 8, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Office of Assistant Secretary for Policy (2022). 2 INDIVIDUAL ( Sections 1 2 3 5) v4 (stata 11) [Dataset]. https://catalog.data.gov/dataset/2-individual-sections-1-2-3-5-v4-stata-11
Explore at:
Dataset updated
Oct 8, 2022
Dataset provided by
Office of Assistant Secretary for Policy
Description
Restricted Use data from the ILAB Philippines study