28 datasets found
  1. d

    Current Population Survey (CPS)

    • search.dataone.org
    • dataverse.harvard.edu
    Updated Nov 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Damico, Anthony (2023). Current Population Survey (CPS) [Dataset]. http://doi.org/10.7910/DVN/AK4FDD
    Explore at:
    Dataset updated
    Nov 21, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Damico, Anthony
    Description

    analyze the current population survey (cps) annual social and economic supplement (asec) with r the annual march cps-asec has been supplying the statistics for the census bureau's report on income, poverty, and health insurance coverage since 1948. wow. the us census bureau and the bureau of labor statistics ( bls) tag-team on this one. until the american community survey (acs) hit the scene in the early aughts (2000s), the current population survey had the largest sample size of all the annual general demographic data sets outside of the decennial census - about two hundred thousand respondents. this provides enough sample to conduct state- and a few large metro area-level analyses. your sample size will vanish if you start investigating subgroups b y state - consider pooling multiple years. county-level is a no-no. despite the american community survey's larger size, the cps-asec contains many more variables related to employment, sources of income, and insurance - and can be trended back to harry truman's presidency. aside from questions specifically asked about an annual experience (like income), many of the questions in this march data set should be t reated as point-in-time statistics. cps-asec generalizes to the united states non-institutional, non-active duty military population. the national bureau of economic research (nber) provides sas, spss, and stata importation scripts to create a rectangular file (rectangular data means only person-level records; household- and family-level information gets attached to each person). to import these files into r, the parse.SAScii function uses nber's sas code to determine how to import the fixed-width file, then RSQLite to put everything into a schnazzy database. you can try reading through the nber march 2012 sas importation code yourself, but it's a bit of a proc freak show. this new github repository contains three scripts: 2005-2012 asec - download all microdata.R down load the fixed-width file containing household, family, and person records import by separating this file into three tables, then merge 'em together at the person-level download the fixed-width file containing the person-level replicate weights merge the rectangular person-level file with the replicate weights, then store it in a sql database create a new variable - one - in the data table 2012 asec - analysis examples.R connect to the sql database created by the 'download all microdata' progr am create the complex sample survey object, using the replicate weights perform a boatload of analysis examples replicate census estimates - 2011.R connect to the sql database created by the 'download all microdata' program create the complex sample survey object, using the replicate weights match the sas output shown in the png file below 2011 asec replicate weight sas output.png statistic and standard error generated from the replicate-weighted example sas script contained in this census-provided person replicate weights usage instructions document. click here to view these three scripts for more detail about the current population survey - annual social and economic supplement (cps-asec), visit: the census bureau's current population survey page the bureau of labor statistics' current population survey page the current population survey's wikipedia article notes: interviews are conducted in march about experiences during the previous year. the file labeled 2012 includes information (income, work experience, health insurance) pertaining to 2011. when you use the current populat ion survey to talk about america, subract a year from the data file name. as of the 2010 file (the interview focusing on america during 2009), the cps-asec contains exciting new medical out-of-pocket spending variables most useful for supplemental (medical spending-adjusted) poverty research. confidential to sas, spss, stata, sudaan users: why are you still rubbing two sticks together after we've invented the butane lighter? time to transition to r. :D

  2. Post-trial follow-up after a randomized clinical trial of COVID-19...

    • figshare.com
    txt
    Updated Jul 14, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ignacio Esteban (2022). Post-trial follow-up after a randomized clinical trial of COVID-19 convalescent plasma - Stata do-file [Dataset]. http://doi.org/10.6084/m9.figshare.20311155.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jul 14, 2022
    Dataset provided by
    figshare
    Authors
    Ignacio Esteban
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Stata do-file for the study called: "Post-trial follow-up after a randomized clinical trial of COVID-19 convalescent plasma"

  3. t

    Data: discrimination and inter-group contact in post-conflict settings....

    • service.tib.eu
    Updated May 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Data: discrimination and inter-group contact in post-conflict settings. evidence from colombia - Vdataset - LDM [Dataset]. https://service.tib.eu/ldmservice/dataset/goe-doi-10-25625-le0wwv
    Explore at:
    Dataset updated
    May 16, 2025
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    In this folder you will find the raw data from the experiment that was conducted in August and September 2018 separately for each of the four universities in Colombia where the experiments were conducted. Data were gathered using the psytoolkit software. The final dataset is called “final.dta”. For the analysis we use the software Stata. Data were prepared and cleaned (see do-files on preparation and data cleaning). Variables are labelled. Additionally, a definition of each variable can be found in the appendix of the manuscript. All tables and figures can be replicated using the do-file “figures” and “tables”. For additional questions please contact: Kerstin Unfried: Kerstin.unfried@uni-goettingen.de

  4. Z

    Data and code release for Carleton, Cornetet, Huybers, Meng & Proctor (PNAS,...

    • data.niaid.nih.gov
    • zenodo.org
    Updated Dec 26, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cornetet, Jules (2023). Data and code release for Carleton, Cornetet, Huybers, Meng & Proctor (PNAS, 2020), "Global evidence for ultraviolet radiation decreasing COVID-19 growth rates" [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_3829621
    Explore at:
    Dataset updated
    Dec 26, 2023
    Dataset provided by
    Meng, Kyle C.
    Cornetet, Jules
    Proctor, Jonathan
    Carleton, Tamma
    Huybers, Peter
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This upload contains all replication material for "Global evidence for ultraviolet radiation decreasing COVID-19 growth rates" (PNAS, 2020). Please note that previous versions of this upload provided data and code for the pre-print version of the article, which changed somewhat through the peer review process.

    Authors: Tamma Carleton, Jules Cornetet, Peter Huybers, Kyle C. Meng, Jonathan Proctor.

    Code is located within CCHMP_covid_climate_code_release.zip, and is written in R, Stata, and Matlab. The working directory should be set to the repository folder at the top of each script (all other filepaths are relative).

    Please find the code needed to replicate the main findings of the paper described below:

    Plots of data: R and Stata scripts to make figures 1B, 2A/B/C, S1, S2, and S3, can be found within “code/analysis/data_plots/”.

    Regression analysis: Stata scripts to run the distributed lag regressions and plot the results in figures 2, 3C, S5, S6, S7, S8, S10, and S14, as well as Table S1, can be found within “code/analysis/regressions/”. R scripts for data analysis and plotting for figures 3A/B and S9 are also within "code/analysis/regressions/".

    Seasonal simulations: R and Stata scripts to replicate the seasonal simulation shown in figures 4, S4 and S11 can be found within “code/analysis/seasonal_sim/”.

    SEIR simulations: Matlab scripts to replicate the SEIR simulations shown in figures S12 and S13 can be found within “code/analysis/SEIR/”.

    Data are located within CCHMP_covid_climate_data_release.zip.

  5. Data files for Mind Upload Studies

    • figshare.com
    txt
    Updated Jul 10, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Michael Laakasuo; Marianna Drosinou; Mika Koverola; Anton Kunnari; Juho Halonen; Noora Lehtonen; Jussi Palomäki (2018). Data files for Mind Upload Studies [Dataset]. http://doi.org/10.6084/m9.figshare.6016145.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jul 10, 2018
    Dataset provided by
    Figsharehttp://figshare.com/
    figshare
    Authors
    Michael Laakasuo; Marianna Drosinou; Mika Koverola; Anton Kunnari; Juho Halonen; Noora Lehtonen; Jussi Palomäki
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Four anonymized datasets used in our study published in Palgrave Communications. Analyses, psychometrics and formation of dependent and independent variables have been described in the manuscript. Data for Study 3 is in Stata-13 format and contains 303 participants whom were analyzed. For the retained sample of 303 participants few missing values were imputed, as described in the manuscript. For full details regarding the non imputed data-set (not anonymized), please contact the authors if necessary. Stata 13.1 has a bug in its programming and it does not exporting .csv files correctly. Variables related to geolocation, participant numbers and time stamps have been removed. Also personal questions related to possible psychological problems. sexual orientation, occupational details have been removed. Many of the variables have been translated to English. We also collected political party affiliations and voting choices, which have been removed. Finland is a multi-party system and voting behavior, combined with age, gender, timestamps and location can be used to identify people. For full description of the complete collected data the interested parties are encouraged to contact the authors.Other exploratory variables not pertinent to the main study and not reported in the manuscript have also been removed, since this data will be used to explore other research questions and ideas to be published later. Once published, they will be made available as well.

  6. Dataset of optical CBF data during endovascular therapy. Original data in...

    • figshare.com
    xlsx
    Updated May 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Christopher Favilla (2024). Dataset of optical CBF data during endovascular therapy. Original data in STATA format (.dta), but excel version has been provided as well. [Dataset]. http://doi.org/10.6084/m9.figshare.25438891.v2
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    May 1, 2024
    Dataset provided by
    Figsharehttp://figshare.com/
    figshare
    Authors
    Christopher Favilla
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Endovascular therapy (EVT) has changed the landscape of acute stroke treatment in the context of large vessel occlusion (LVO). Still, procedural success is typically determined by the degree of large vessel recanalization, despite the fact that large vessel recanalization does not always result in microvascular reperfusion. To address this discrepancy, we performed bedside optical CBF monitoring (with diffuse correlation spectroscopy) during endovascular therapy. This allowed comparison of CBF pre vs post-recanalization.Note 3 files uploaded:The .dta file is a stata file which contains all variables labels which includes all necessary variables details.The .xlsx file database is in numerical format (i.e. without applying text labels).The .xlsx file data dictionary contains all variable names, variable labels, and code to translate the numerical values.

  7. H

    Replication Code for: "The Quality of Vote Tallies: Causes and Consequences"...

    • dataverse.harvard.edu
    Updated Jul 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cristian Challú; Enrique Seira; Alberto Simpser (2020). Replication Code for: "The Quality of Vote Tallies: Causes and Consequences" [Dataset]. http://doi.org/10.7910/DVN/4M0HEN
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 24, 2020
    Dataset provided by
    Harvard Dataverse
    Authors
    Cristian Challú; Enrique Seira; Alberto Simpser
    License

    https://dataverse.harvard.edu/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.7910/DVN/4M0HENhttps://dataverse.harvard.edu/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.7910/DVN/4M0HEN

    Description

    This upload conains two Stata do-files that process the raw data and reproduce tables and figures in the paper and in the online appendix. The readme.txt file discusses the data.

  8. d

    Replication Data for: The Origins of Persistent Current Account Imbalances...

    • search.dataone.org
    • dataverse.harvard.edu
    Updated Nov 22, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Manger, Mark (2023). Replication Data for: The Origins of Persistent Current Account Imbalances in the post-Bretton Woods Era [Dataset]. http://doi.org/10.7910/DVN/M7FYU8
    Explore at:
    Dataset updated
    Nov 22, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Manger, Mark
    Description

    Replication data, Stata code for analysis, and R code for figures

  9. g

    ABC News/Washington Post Drug Poll, February 1997 - Version 2

    • search.gesis.org
    Updated Feb 21, 1997
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GESIS search (1997). ABC News/Washington Post Drug Poll, February 1997 - Version 2 [Dataset]. http://doi.org/10.3886/ICPSR02175.v2
    Explore at:
    Dataset updated
    Feb 21, 1997
    Dataset provided by
    ICPSR - Interuniversity Consortium for Political and Social Research
    GESIS search
    License

    https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de454978https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de454978

    Description

    Abstract (en): This special topic poll, conducted February 20-24, 1997, solicited responses from parents and their teenage children, aged 12-17, on the topic of illegal drug use among America's youth. One parent and one child from each household were asked a series of questions covering illegal drugs, violence in school, underage drinking, academic challenges, and parent-child communication. Respondents were asked to assess their understanding of the presence of drugs and drug users in their local schools, throughout the community, across the nation, among the teen's peer group, and within their own family. A series of topics covered the availability and effectiveness of school-sponsored anti-drug programs. Parents were asked how their possible past and present use and/or experimentation with marijuana and other illegal drugs, alcohol, and tobacco products influenced the manner in which they approached drug use with their own children. Teenage respondents were asked for their reaction to the use of drugs and alcohol by their friends, the seriousness of the contemporary drug problem, and whether they believed that their parents had used or experimented with illegal drugs. Other questions asked about teenage respondents' plans after high school and whether they attended a public or private school. Demographic variables for parental respondents included age, race, sex, education level, household income, political party affiliation, and type of residential area (e.g., urban or rural). Demographic variables for teenage respondents included age, race, sex, residential area, and grade level in school. The data contain a weight variable (WEIGHT) that should be used in analyzing the data. This poll consists of "standard" national representative samples of the adult population with sample balancing of sex, race, age, and education. The weight variable contains two implied decimal places, and applies only to the parental respondents. ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection: Created online analysis version with question text.. Persons aged 18 and over living in households with telephones in the contiguous 48 United States. Households were selected by random-digit dialing. Within households, the respondent selected was the adult living in the household who last had a birthday and who was home at the time of the interview. 2007-02-27 SAS, SPSS, and Stata setup files, and SAS and Stata supplemental files have been added to this data collection. Respondent names were removed from the data file and the CASEID variable was created for use with online analysis.2006-11-10 SAS, SPSS, and Stata setup files have been added to this data collection. telephone interview (1) The data available for download are not weighted and users will need to weight the data prior to analysis. (2) Original reports using these data may be found via the ABC News Web site. (3) According to the data collection instrument, code 3 in the variable P_EDUC also included respondents who answered that they had attended a technical college. (4) The CASEID variable was created for use with online analysis.

  10. n

    General Household Survey, Panel 2023-2024 - Nigeria

    • microdata.nigerianstat.gov.ng
    • catalog.ihsn.org
    • +2more
    Updated Dec 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Bureau of Statistics (NBS) (2024). General Household Survey, Panel 2023-2024 - Nigeria [Dataset]. https://microdata.nigerianstat.gov.ng/index.php/catalog/82
    Explore at:
    Dataset updated
    Dec 6, 2024
    Dataset provided by
    National Bureau of Statistics, Nigeria
    Authors
    National Bureau of Statistics (NBS)
    Time period covered
    2023 - 2024
    Area covered
    Nigeria
    Description

    Abstract

    The General Household Survey-Panel (GHS-Panel) is implemented in collaboration with the World Bank Living Standards Measurement Study (LSMS) team as part of the Integrated Surveys on Agriculture (ISA) program. The objectives of the GHS-Panel include the development of an innovative model for collecting agricultural data, interinstitutional collaboration, and comprehensive analysis of welfare indicators and socio-economic characteristics. The GHS-Panel is a nationally representative survey of approximately 5,000 households, which are also representative of the six geopolitical zones. The 2023/24 GHS-Panel is the fifth round of the survey with prior rounds conducted in 2010/11, 2012/13, 2015/16 and 2018/19. The GHS-Panel households were visited twice: during post-planting period (July - September 2023) and during post-harvest period (January - March 2024).

    Geographic coverage

    National

    Analysis unit

    • Households • Individuals • Agricultural plots • Communities

    Universe

    The survey covered all de jure households excluding prisons, hospitals, military barracks, and school dormitories.

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    The original GHS‑Panel sample was fully integrated with the 2010 GHS sample. The GHS sample consisted of 60 Primary Sampling Units (PSUs) or Enumeration Areas (EAs), chosen from each of the 37 states in Nigeria. This resulted in a total of 2,220 EAs nationally. Each EA contributed 10 households to the GHS sample, resulting in a sample size of 22,200 households. Out of these 22,200 households, 5,000 households from 500 EAs were selected for the panel component, and 4,916 households completed their interviews in the first wave.

    After nearly a decade of visiting the same households, a partial refresh of the GHS‑Panel sample was implemented in Wave 4 and maintained for Wave 5. The refresh was conducted to maintain the integrity and representativeness of the sample. The refresh EAs were selected from the same sampling frame as the original GHS‑Panel sample in 2010. A listing of households was conducted in the 360 EAs, and 10 households were randomly selected in each EA, resulting in a total refresh sample of approximately 3,600 households.

    In addition to these 3,600 refresh households, a subsample of the original 5,000 GHS‑Panel households from 2010 were selected to be included in the new sample. This “long panel” sample of 1,590 households was designed to be nationally representative to enable continued longitudinal analysis for the sample going back to 2010. The long panel sample consisted of 159 EAs systematically selected across Nigeria’s six geopolitical zones.

    The combined sample of refresh and long panel EAs in Wave 5 that were eligible for inclusion consisted of 518 EAs based on the EAs selected in Wave 4. The combined sample generally maintains both the national and zonal representativeness of the original GHS‑Panel sample.

    Sampling deviation

    Although 518 EAs were identified for the post-planting visit, conflict events prevented interviewers from visiting eight EAs in the North West zone of the country. The EAs were located in the states of Zamfara, Katsina, Kebbi and Sokoto. Therefore, the final number of EAs visited both post-planting and post-harvest comprised 157 long panel EAs and 354 refresh EAs. The combined sample is also roughly equally distributed across the six geopolitical zones.

    Mode of data collection

    Computer Assisted Personal Interview [capi]

    Research instrument

    The GHS-Panel Wave 5 consisted of three questionnaires for each of the two visits. The Household Questionnaire was administered to all households in the sample. The Agriculture Questionnaire was administered to all households engaged in agricultural activities such as crop farming, livestock rearing, and other agricultural and related activities. The Community Questionnaire was administered to the community to collect information on the socio-economic indicators of the enumeration areas where the sample households reside.

    GHS-Panel Household Questionnaire: The Household Questionnaire provided information on demographics; education; health; labour; childcare; early child development; food and non-food expenditure; household nonfarm enterprises; food security and shocks; safety nets; housing conditions; assets; information and communication technology; economic shocks; and other sources of household income. Household location was geo-referenced in order to be able to later link the GHS-Panel data to other available geographic data sets (forthcoming).

    GHS-Panel Agriculture Questionnaire: The Agriculture Questionnaire solicited information on land ownership and use; farm labour; inputs use; GPS land area measurement and coordinates of household plots; agricultural capital; irrigation; crop harvest and utilization; animal holdings and costs; household fishing activities; and digital farming information. Some information is collected at the crop level to allow for detailed analysis for individual crops.

    GHS-Panel Community Questionnaire: The Community Questionnaire solicited information on access to infrastructure and transportation; community organizations; resource management; changes in the community; key events; community needs, actions, and achievements; social norms; and local retail price information.

    The Household Questionnaire was slightly different for the two visits. Some information was collected only in the post-planting visit, some only in the post-harvest visit, and some in both visits.

    The Agriculture Questionnaire collected different information during each visit, but for the same plots and crops.

    The Community Questionnaire collected prices during both visits, and different community level information during the two visits.

    Cleaning operations

    CAPI: Wave five exercise was conducted using Computer Assisted Person Interview (CAPI) techniques. All the questionnaires (household, agriculture, and community questionnaires) were implemented in both the post-planting and post-harvest visits of Wave 5 using the CAPI software, Survey Solutions. The Survey Solutions software was developed and maintained by the Living Standards Measurement Unit within the Development Economics Data Group (DECDG) at the World Bank. Each enumerator was given a tablet which they used to conduct the interviews. Overall, implementation of survey using Survey Solutions CAPI was highly successful, as it allowed for timely availability of the data from completed interviews.

    DATA COMMUNICATION SYSTEM: The data communication system used in Wave 5 was highly automated. Each field team was given a mobile modem which allowed for internet connectivity and daily synchronization of their tablets. This ensured that head office in Abuja had access to the data in real-time. Once the interview was completed and uploaded to the server, the data was first reviewed by the Data Editors. The data was also downloaded from the server, and Stata dofile was run on the downloaded data to check for additional errors that were not captured by the Survey Solutions application. An excel error file was generated following the running of the Stata dofile on the raw dataset. Information contained in the excel error files were then communicated back to respective field interviewers for their action. This monitoring activity was done on a daily basis throughout the duration of the survey, both in the post-planting and post-harvest.

    DATA CLEANING: The data cleaning process was done in three main stages. The first stage was to ensure proper quality control during the fieldwork. This was achieved in part by incorporating validation and consistency checks into the Survey Solutions application used for the data collection and designed to highlight many of the errors that occurred during the fieldwork.

    The second stage cleaning involved the use of Data Editors and Data Assistants (Headquarters in Survey Solutions). As indicated above, once the interview is completed and uploaded to the server, the Data Editors review completed interview for inconsistencies and extreme values. Depending on the outcome, they can either approve or reject the case. If rejected, the case goes back to the respective interviewer’s tablet upon synchronization. Special care was taken to see that the households included in the data matched with the selected sample and where there were differences, these were properly assessed and documented. The agriculture data were also checked to ensure that the plots identified in the main sections merged with the plot information identified in the other sections. Additional errors observed were compiled into error reports that were regularly sent to the teams. These errors were then corrected based on re-visits to the household on the instruction of the supervisor. The data that had gone through this first stage of cleaning was then approved by the Data Editor. After the Data Editor’s approval of the interview on Survey Solutions server, the Headquarters also reviews and depending on the outcome, can either reject or approve.

    The third stage of cleaning involved a comprehensive review of the final raw data following the first and second stage cleaning. Every variable was examined individually for (1) consistency with other sections and variables, (2) out of range responses, and (3) outliers. However, special care was taken to avoid making strong assumptions when resolving potential errors. Some minor errors remain in the data where the diagnosis and/or solution were unclear to the data cleaning team.

    Response

  11. d

    Replication Data for: Moral Hazard and the Energy Efficiency Gap: Theory and...

    • search.dataone.org
    • dataverse.harvard.edu
    Updated Nov 22, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maher, Joseph (2023). Replication Data for: Moral Hazard and the Energy Efficiency Gap: Theory and Evidence [Dataset]. http://doi.org/10.7910/DVN/I4LNJU
    Explore at:
    Dataset updated
    Nov 22, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Maher, Joseph
    Description

    Code and data to produce final results and summary statistics using Stata 15.1. There are four code files (and a master do file to call them in sequence) and three data sets to produce the results in the paper. All data are included in a single .ZIP file (uncompressed files exceeded upload limits).

  12. H

    National Health and Nutrition Examination Survey (NHANES)

    • dataverse.harvard.edu
    Updated May 30, 2013
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anthony Damico (2013). National Health and Nutrition Examination Survey (NHANES) [Dataset]. http://doi.org/10.7910/DVN/IMWQPJ
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 30, 2013
    Dataset provided by
    Harvard Dataverse
    Authors
    Anthony Damico
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    analyze the national health and nutrition examination survey (nhanes) with r nhanes is this fascinating survey where doctors and dentists accompany survey interviewers in a little mobile medical center that drives around the country. while the survey folks are interviewing people, the medical professionals administer laboratory tests and conduct a real doctor's examination. the b lood work and medical exam allow researchers like you and me to answer tough questions like, "how many people have diabetes but don't know they have diabetes?" conducting the lab tests and the physical isn't cheap, so a new nhanes data set becomes available once every two years and only includes about twelve thousand respondents. since the number of respondents is so small, analysts often pool multiple years of data together. the replication scripts below give a few different examples of how multiple years of data can be pooled with r. the survey gets conducted by the centers for disease control and prevention (cdc), and generalizes to the united states non-institutional, non-active duty military population. most of the data tables produced by the cdc include only a small number of variables, so importation with the foreign package's read.xport function is pretty straightforward. but that makes merging the appropriate data sets trickier, since it might not be clear what to pull for which variables. for every analysis, start with the table with 'demo' in the name -- this file includes basic demographics, weighting, and complex sample survey design variables. since it's quick to download the files directly from the cdc's ftp site, there's no massive ftp download automation script. this new github repository co ntains five scripts: 2009-2010 interview only - download and analyze.R download, import, save the demographics and health insurance files onto your local computer load both files, limit them to the variables needed for the analysis, merge them together perform a few example variable recodes create the complex sample survey object, using the interview weights run a series of pretty generic analyses on the health insurance ques tions 2009-2010 interview plus laboratory - download and analyze.R download, import, save the demographics and cholesterol files onto your local computer load both files, limit them to the variables needed for the analysis, merge them together perform a few example variable recodes create the complex sample survey object, using the mobile examination component (mec) weights perform a direct-method age-adjustment and matc h figure 1 of this cdc cholesterol brief replicate 2005-2008 pooled cdc oral examination figure.R download, import, save, pool, recode, create a survey object, run some basic analyses replicate figure 3 from this cdc oral health databrief - the whole barplot replicate cdc publications.R download, import, save, pool, merge, and recode the demographics file plus cholesterol laboratory, blood pressure questionnaire, and blood pressure laboratory files match the cdc's example sas and sudaan syntax file's output for descriptive means match the cdc's example sas and sudaan synta x file's output for descriptive proportions match the cdc's example sas and sudaan syntax file's output for descriptive percentiles replicate human exposure to chemicals report.R (user-contributed) download, import, save, pool, merge, and recode the demographics file plus urinary bisphenol a (bpa) laboratory files log-transform some of the columns to calculate the geometric means and quantiles match the 2007-2008 statistics shown on pdf page 21 of the cdc's fourth edition of the report click here to view these five scripts for more detail about the national health and nutrition examination survey (nhanes), visit: the cdc's nhanes homepage the national cancer institute's page of nhanes web tutorials notes: nhanes includes interview-only weights and interview + mobile examination component (mec) weights. if you o nly use questions from the basic interview in your analysis, use the interview-only weights (the sample size is a bit larger). i haven't really figured out a use for the interview-only weights -- nhanes draws most of its power from the combination of the interview and the mobile examination component variables. if you're only using variables from the interview, see if you can use a data set with a larger sample size like the current population (cps), national health interview survey (nhis), or medical expenditure panel survey (meps) instead. confidential to sas, spss, stata, sudaan users: why are you still riding around on a donkey after we've invented the internal combustion engine? time to transition to r. :D

  13. f

    Data from: S1 Data set -

    • figshare.com
    • plos.figshare.com
    xls
    Updated Oct 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Raymond Bernard Kihumuro; Peace Kellen; Sarah Chun; Edith K. Wakida; Celestino Obua; Herbert E. Ainamani (2023). S1 Data set - [Dataset]. http://doi.org/10.1371/journal.pone.0293258.s003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Oct 31, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Raymond Bernard Kihumuro; Peace Kellen; Sarah Chun; Edith K. Wakida; Celestino Obua; Herbert E. Ainamani
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BackgroundWorldwide, there is a growing concern about the rising number of people with declining cognitive functioning. However, findings on this phenomenon are inconclusive. Our study aimed to assess the prevalence of cognitive impairment and the associated factors in women with a history of pregnancy complications in rural southwestern Uganda.MethodsThis was a cross-sectional study carried out among women above 40 years of age in the greater Kabale district of southwestern Uganda between March and April 2022. Study participants were identified using a consecutive sampling method. Predictor variables included pregnancy complications and other social demographic factors that were assessed by semi-structured interviews while cognitive functioning as an outcome variable was assessed by Montreal Cognitive Assessment (MoCA-B) tool. Data were analyzed using STATA at a 95% Confidence level. Logistic regression analyses were selected for statistical modelling while odds ratios were calculated to assess the strength of associations between the predictor and outcome variables.ResultsIn total, 75% (212/280) of participants had some form of cognitive impairment, with 45% (123/280) falling into mild CI, 31% (86/280) moderate CI and 4% (10/280) severe CI. Twenty-three percent (68/280) of participants fell into category of normal cognitive functioning. Participants with >65 years of age had higher odds of developing cognitive impairment (OR = 2.94; 95%CI: 0.96–9.04, p = 0.06) than those with < 65 years of age. Protective factors to cognitive impairment include delivering from a health facility (OR = 0.31,95% CI:0.16–0.60, p = < .001), primary and post primary levels of education (OR = 0.05; 95% CI: 0.02–0.13, p

  14. d

    Replication Data for: Where You Stand Depends on Where You Sit:...

    • search.dataone.org
    Updated Nov 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chen, Tsung-yen (2023). Replication Data for: Where You Stand Depends on Where You Sit: Inconsistencies in Taiwan Legislators’ Positions on Importing US Meat [Dataset]. http://doi.org/10.7910/DVN/PFUS9M
    Explore at:
    Dataset updated
    Nov 8, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Chen, Tsung-yen
    Area covered
    Taiwan
    Description

    This study delves into the intricate political dynamics that influence legislators’ policy stances concerning the import of US meat into Taiwan over the last decade. It specifically centers on the instances of US beef importation in 2012 and US pork importation in 2021. Within this folder, you will find two datasets along with a Stata .do file, all of which are instrumental for the analysis of quantitative data as presented in the paper. Additionally, the folder encompasses a spreadsheet that facilitates the creation of Figure 5.

  15. m

    Data from: HIV risk and factors associated with use of novel prevention...

    • data.mendeley.com
    Updated Feb 10, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ivan Segawa (2022). HIV risk and factors associated with use of novel prevention interventions among female students at Makerere University [Dataset]. http://doi.org/10.17632/dmbt2sjmvp.1
    Explore at:
    Dataset updated
    Feb 10, 2022
    Authors
    Ivan Segawa
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A cross-sectional study; data was collected using an online self-administered questionnaire shared via email and social media (WhatsApp). Participants were selected from the 10 colleges of Makerere University using quota sampling. Data on socio-demographic characteristics, sexual behaviors, and use of HIV self-testing, pre-or post-exposure prophylaxis was collected. Data was collated in Google spreadsheets, cleaned in Microsoft Excel, and analyzed using STATA 14. Descriptive statistics were used to characterize HIV risk and logistic regression to evaluate factors associated with the use of novel prevention interventions and to estimate odds ratios and 95% confidence intervals.

  16. f

    Sociodemographic and other relevant characteristics of study participants (N...

    • plos.figshare.com
    xls
    Updated Jun 20, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dorcas Obiri-Yeboah; Yaw Asante Awuku; George Adjei; Obed Cudjoe; Anna Hayfron Benjamin; Evans Obboh; Daniel Amoako-Sakyi (2023). Sociodemographic and other relevant characteristics of study participants (N = 711). [Dataset]. http://doi.org/10.1371/journal.pone.0219148.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 20, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Dorcas Obiri-Yeboah; Yaw Asante Awuku; George Adjei; Obed Cudjoe; Anna Hayfron Benjamin; Evans Obboh; Daniel Amoako-Sakyi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Sociodemographic and other relevant characteristics of study participants (N = 711).

  17. f

    Stata “do-file” containing the code used to run the analyses in the...

    • figshare.com
    • plos.figshare.com
    txt
    Updated Dec 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jiayao Lei; Kate Cuschieri; Hasit Patel; Alexandra Lawrence; Katie Deats; Peter Sasieni; Anita W. W. Lim (2024). Stata “do-file” containing the code used to run the analyses in the manuscript using the data from “S1 Dataset.” [Dataset]. http://doi.org/10.1371/journal.pmed.1004494.s006
    Explore at:
    txtAvailable download formats
    Dataset updated
    Dec 12, 2024
    Dataset provided by
    PLOS Medicine
    Authors
    Jiayao Lei; Kate Cuschieri; Hasit Patel; Alexandra Lawrence; Katie Deats; Peter Sasieni; Anita W. W. Lim
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Stata “do-file” containing the code used to run the analyses in the manuscript using the data from “S1 Dataset.”

  18. d

    The Australian Voter Experience (AVE) dataset

    • search.dataone.org
    • dataverse.harvard.edu
    Updated Nov 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Norris, Pippa; Nai, Alessandro; Karp, Jeffrey (2023). The Australian Voter Experience (AVE) dataset [Dataset]. http://doi.org/10.7910/DVN/FEBKDE
    Explore at:
    Dataset updated
    Nov 21, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Norris, Pippa; Nai, Alessandro; Karp, Jeffrey
    Area covered
    Australia
    Description

    The Electoral Integrity Project at Harvard University and the University of Sydney (www.electoralintegrityproject.com) developed the AVE data, release 1.0. The dataset contains information from a three-wave panel survey designed to gather the views of a representative sample of ordinary Australians just before and after the 2nd July 2016 Australian federal elections. The survey monitored Australian voters’ experience at the polls, perceptions of the integrity and convenience of the registration and voting process, patterns of civic engagement, public confidence in electoral administration, and attitudes towards reforms, such as civic education campaigns and convenience voting facilities. Respondents were initially contacted in the week before the election between 28 June and 1 July and completed an online questionnaire lasting approximately 15 minutes. This forms the pre-election base line survey (wave 1). The same individuals were contacted again after the election to complete a longer survey, an average of 25 minutes in length. Respondents in wave 2 were contacted between 4 July and 19 July, with two thirds completing the survey after the first week. About six weeks later, the same respondents were interviewed again (wave 3) beginning on 23 August and ending on 13 September. The initial sample contains 2,139 valid responses for the first wave of questionnaires, 1,838 for the second wave (an 86 percent retention rate), and 1,543 for the third wave (84 percent retention rate). Overall, 72 percent of the respondents were carried over from the pre-election wave to the final wave. The following files can be accessed: a) dataset in Stata and SPSS formats; b) codebook; c) questionnaire. The EIP acknowledges support from the Kathleen Fitzpatrick Australian Laureate from the Australian Research Council (ARC ref: FL110100093). **** EIP further publications: BOOKS • LeDuc, Lawrence, Richard Niemi and Pippa Norris. Eds. 2014. Comparing Democracies 4: Elections and Voting in a Changing World. London: Sage Publications. • Nai, Alessandro and Walter, Annemarie. Eds. 2015 New Perspectives on Negative Campaigning: Why Attack Politics Matters. Colchester: ECPR Press. • Norris, Pippa, Richard W. Frank and Ferran Martínez i Coma. Eds. 2014. Advancing Electoral Integrity. New York: Oxford University Press. • Norris, Pippa, Richard W. Frank and Ferran Martínez i Coma. Eds. 2015. Contentious Elections: From Ballots to the Barricades. New York: Routledge. • Norris, Pippa. 2014. Why Electoral Integrity Matters. New York: Cambridge University Press. • Norris, Pippa. 2015. Why Elections Fail. New York: Cambridge University Press. • Norris, Pippa and Andrea Abel van Es. Eds. 2016. Checkbook Elections? Political Finance in Comparative Perspective. Oxford University Press. ARTICLES AND CHAPTERS • W. Frank. 2013. ‘Assessing the quality of elections.’ Journal of Democracy. 24(4): 124-135.• Lago, Ignacio and Martínez i Coma, Ferran. 2016. ‘Challenge or Consent? Understanding Losers’ Reactions in Mass Elections’. Government and Opposition doi:10.1071/gov.3015.31 • Martínez i Coma, Ferran and Lago, Ignacio. 2016. 'Gerrymandering in Comparative Perspective’ Party Politics DOI: 10.1177/1354068816642806 • Norris, Pippa. 2013. ‘Does the world agree about standards of electoral integrity? Evidence for the diffusion of global norms.’ Special issue of Electoral Studies. 32(4):576-588. • Norris, Pippa. 2013. ‘The new research agenda studying electoral integrity’. Special issue of Electoral Studies. 32(4): 563-575.57 • Norris, Pippa. 2014. ‘Electoral integrity and political legitimacy.’ In Comparing Democracies 4. Lawrence LeDuc, Richard Niemi and Pippa Norris. Eds. London: Sage. • Norris, Pippa, Richard W. Frank and Ferran Martínez i Coma. 2014. ‘Measuring electoral integrity: A new dataset.’ PS: Political Science & Politics. 47(4): 789-798. • Norris, Pippa. 2016 (forthcoming). ‘Electoral integrity in East Asia.’ Routledge Handbook on Democratization in East Asia. Tun-jen Cheng and Yun-han Chu. Eds. Routledge: New York. • Norris, Pippa. 2016 (forthcoming). ‘Electoral transitions: Stumbling out of the gate.’ In Rebooting Transitology – Democratization in the 21st Century. Mohammad-Mahmoud Ould Mohamedou and Timothy D. Sisk. Eds. • Pietsch, Juliet; Michael Miller and Jeffrey Karp. 2015. ‘Public support for democracy in transitional regimes.’ Journal of Elections, Public Opinion and Parties. 25(1): 1–9. DOI: 10.1080/17457289.2014. • Smith, Rodney. 2016 (forthcoming). ‘Confidence in paper-based and electronic voting channels: Evidence from Australia.’ Australian Journal of Political Science. ID: 1093091 DOI: 10.1080/10361146.2015.1093091 dx.doi.org/10.1080/07907184.2015.1099097 • Van Ham, Carolien and Staffan Lindberg. 2015. ‘From sticks to carrots: Electoral manipulation in Africa, 1986-2012’, Gover... Visit https://dataone.org/datasets/sha256%3A9efcfe40123531a7f785369bae96a30beb0f41c1ce7334bc7c398a54be5e69f5 for complete metadata about this dataset.

  19. d

    Dodd Frank financial reform at the Commodity Futures Trading Commission...

    • dataone.org
    • search.dataone.org
    Updated Mar 7, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Konrad Posch; Thomas Nath; J. Nicholas Ziegler (2024). Dodd Frank financial reform at the Commodity Futures Trading Commission (CFTC): Public comments, January 14th, 2010 – July 16th, 2014 [Dataset]. http://doi.org/10.6078/D1610G
    Explore at:
    Dataset updated
    Mar 7, 2024
    Dataset provided by
    Dryad Digital Repository
    Authors
    Konrad Posch; Thomas Nath; J. Nicholas Ziegler
    Time period covered
    Jan 10, 2024
    Description

    This dataset includes a complete record of the 36,066 public comments submitted to the Commodity Futures Trading Commission (CFTC) in response to notices of proposed rule-making (NPRMs) implementing the Dodd-Frank Act over a 42-month period (January 14, 2010 to July 16, 2014). The data was exported from the agency’s internal database by the CFTC and provided to the authors by email correspondence following a cold call to the CFTC public relations department. The source internal database is maintained by the CFTC as part of its internal compliance with the Administrative Procedures Act (APA) and includes all rule-making notices that appear in the Federal Register. Owing to the salience and publicity of the Dodd-Frank Act, the CFTC made a special tag in its database for all comments submitted in response to rules proposed under the authority of the Dodd-Frank Act. This database thus includes all comments which the CFTC considers relevant to the Dodd-Frank reform. In short, the CFTC gave t..., This dataset was exported by the CFTC from their internal database of public comments in response to NPRMs. The uploaded file is the exact raw data generated by the CTFC and provided to the authors. An updated version of the data file including the author's classifications based on the organization value will be uploaded when the related work is accepted for publication., , # Dodd Frank Financial Reform at the CFTC - Public Comments, January 14th, 2010 to July 16th, 2014

    Description of the data and file structure

    NOTE: The Comment Text ( and variables) are longer than the maximum character count of Microsoft Excel cells (32,767 characters). All analysis should take this into account and import the .txt file directly into your analysis program (R, Stata, etc.) rather than attempt to edit or modify the data in Excel before using computational analysis.

    There are two files provided:

    1. DoddFrankCommentsAll(uncompressed).txt - the full raw data file from the CFTC internal database of all 36,066 comments on NPRMs
    2. (2014-07-30) CFTC Original Codebook.xlsx - the codebook provided by the CTFC with the raw data. Originally provided as email text, formatted in Excel by authors.

    Codebook:Â

    | Variable | Explanation ...

  20. d

    Replication Code for: Local Adaptation and Unintended Coastal Vulnerability:...

    • dataone.org
    • dataverse.harvard.edu
    Updated Nov 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gopalakrishnan, Sathya (2023). Replication Code for: Local Adaptation and Unintended Coastal Vulnerability: The Effect of Beach Nourishment on Residential Development in North Carolina [Dataset]. http://doi.org/10.7910/DVN/DANYFZ
    Explore at:
    Dataset updated
    Nov 8, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Gopalakrishnan, Sathya
    Area covered
    North Carolina
    Description

    Two STATA files with code to replicate the duration analysis 1. duration_estimation.do: This file estimates first-stage OLS and second-stage duration model with dual IV. Log file for the estimation of the main model is included. 2. duration_simulation.do: The file bootstraps parameters for the main model, and performs the accept/reject for a baseline counterfactual with no changes. Log file is included Because the data are proprietary, we cannot upload the complete dataset. A 10% random sample can be provided upon request.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Damico, Anthony (2023). Current Population Survey (CPS) [Dataset]. http://doi.org/10.7910/DVN/AK4FDD

Current Population Survey (CPS)

Explore at:
Dataset updated
Nov 21, 2023
Dataset provided by
Harvard Dataverse
Authors
Damico, Anthony
Description

analyze the current population survey (cps) annual social and economic supplement (asec) with r the annual march cps-asec has been supplying the statistics for the census bureau's report on income, poverty, and health insurance coverage since 1948. wow. the us census bureau and the bureau of labor statistics ( bls) tag-team on this one. until the american community survey (acs) hit the scene in the early aughts (2000s), the current population survey had the largest sample size of all the annual general demographic data sets outside of the decennial census - about two hundred thousand respondents. this provides enough sample to conduct state- and a few large metro area-level analyses. your sample size will vanish if you start investigating subgroups b y state - consider pooling multiple years. county-level is a no-no. despite the american community survey's larger size, the cps-asec contains many more variables related to employment, sources of income, and insurance - and can be trended back to harry truman's presidency. aside from questions specifically asked about an annual experience (like income), many of the questions in this march data set should be t reated as point-in-time statistics. cps-asec generalizes to the united states non-institutional, non-active duty military population. the national bureau of economic research (nber) provides sas, spss, and stata importation scripts to create a rectangular file (rectangular data means only person-level records; household- and family-level information gets attached to each person). to import these files into r, the parse.SAScii function uses nber's sas code to determine how to import the fixed-width file, then RSQLite to put everything into a schnazzy database. you can try reading through the nber march 2012 sas importation code yourself, but it's a bit of a proc freak show. this new github repository contains three scripts: 2005-2012 asec - download all microdata.R down load the fixed-width file containing household, family, and person records import by separating this file into three tables, then merge 'em together at the person-level download the fixed-width file containing the person-level replicate weights merge the rectangular person-level file with the replicate weights, then store it in a sql database create a new variable - one - in the data table 2012 asec - analysis examples.R connect to the sql database created by the 'download all microdata' progr am create the complex sample survey object, using the replicate weights perform a boatload of analysis examples replicate census estimates - 2011.R connect to the sql database created by the 'download all microdata' program create the complex sample survey object, using the replicate weights match the sas output shown in the png file below 2011 asec replicate weight sas output.png statistic and standard error generated from the replicate-weighted example sas script contained in this census-provided person replicate weights usage instructions document. click here to view these three scripts for more detail about the current population survey - annual social and economic supplement (cps-asec), visit: the census bureau's current population survey page the bureau of labor statistics' current population survey page the current population survey's wikipedia article notes: interviews are conducted in march about experiences during the previous year. the file labeled 2012 includes information (income, work experience, health insurance) pertaining to 2011. when you use the current populat ion survey to talk about america, subract a year from the data file name. as of the 2010 file (the interview focusing on america during 2009), the cps-asec contains exciting new medical out-of-pocket spending variables most useful for supplemental (medical spending-adjusted) poverty research. confidential to sas, spss, stata, sudaan users: why are you still rubbing two sticks together after we've invented the butane lighter? time to transition to r. :D

Search
Clear search
Close search
Google apps
Main menu