100+ datasets found
  1. Macroeconomic Time Series for the United States, United Kingdom, Germany,...

    • icpsr.umich.edu
    ascii, sas, spss +1
    Updated Mar 26, 2007
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Bureau of Economic Research (2007). Macroeconomic Time Series for the United States, United Kingdom, Germany, and France [Dataset]. http://doi.org/10.3886/ICPSR07644.v2
    Explore at:
    spss, sas, stata, asciiAvailable download formats
    Dataset updated
    Mar 26, 2007
    Dataset provided by
    Inter-university Consortium for Political and Social Researchhttps://www.icpsr.umich.edu/web/pages/
    Authors
    National Bureau of Economic Research
    License

    https://www.icpsr.umich.edu/web/ICPSR/studies/7644/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/7644/terms

    Time period covered
    1785 - 1968
    Area covered
    United States, France, Global, Germany, United Kingdom
    Description

    This collection contains an array of economic time series data pertaining to the United States, the United Kingdom, Germany, and France, primarily between the 1920s and the 1960s, and including some time series from the 18th and 19th centuries. These data were collected by the National Bureau of Economic Research (NBER), and they constitute a research resource of importance to economists as well as to political scientists, sociologists, and historians. Under a grant from the National Science Foundation, ICPSR and the National Bureau of Economic Research converted this collection (which existed heretofore only on handwritten sheets stored in New York) into fully accessible, readily usable, and completely documented machine-readable form. The NBER collection -- containing an estimated 1.6 million entries -- is divided into 16 major categories: (1) construction, (2) prices, (3) security markets, (4) foreign trade, (5) income and employment, (6) financial status of business, (7) volume of transactions, (8) government finance, (9) distribution of commodities, (10) savings and investments, (11) transportation and public utilities, (12) stocks of commodities, (13) interest rates, and (14) indices of leading, coincident, and lagging indicators, (15) money and banking, and (16) production of commodities. Data from all categories are available in Parts 1-22. The economic variables are usually observations on the entire nation or large subsets of the nation. Frequently, however, and especially in the United States, separate regional and metropolitan data are included in other variables. This makes cross-sectional analysis possible in many cases. The time span of variables in these files may be as short as one year or as long as 160 years. Most data pertain to the first half of the 20th century. Many series, however, extend into the 19th century, and a few reach into the 18th. The oldest series, covering brick production in England and Wales, begins in 1785, and the most recent United States data extend to 1968. The unit of analysis is an interval of time -- a year, a quarter, or a month. The bulk of observations are monthly, and most series of monthly data contain annual values or totals.

  2. n

    Multilevel modeling of time-series cross-sectional data reveals the dynamic...

    • data.niaid.nih.gov
    zip
    Updated Mar 6, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kodai Kusano (2020). Multilevel modeling of time-series cross-sectional data reveals the dynamic interaction between ecological threats and democratic development [Dataset]. http://doi.org/10.5061/dryad.547d7wm3x
    Explore at:
    zipAvailable download formats
    Dataset updated
    Mar 6, 2020
    Dataset provided by
    University of Nevada, Reno
    Authors
    Kodai Kusano
    License

    https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

    Description

    What is the relationship between environment and democracy? The framework of cultural evolution suggests that societal development is an adaptation to ecological threats. Pertinent theories assume that democracy emerges as societies adapt to ecological factors such as higher economic wealth, lower pathogen threats, less demanding climates, and fewer natural disasters. However, previous research confused within-country processes with between-country processes and erroneously interpreted between-country findings as if they generalize to within-country mechanisms. In this article, we analyze a time-series cross-sectional dataset to study the dynamic relationship between environment and democracy (1949-2016), accounting for previous misconceptions in levels of analysis. By separating within-country processes from between-country processes, we find that the relationship between environment and democracy not only differs by countries but also depends on the level of analysis. Economic wealth predicts increasing levels of democracy in between-country comparisons, but within-country comparisons show that democracy declines as countries become wealthier over time. This relationship is only prevalent among historically wealthy countries but not among historically poor countries, whose wealth also increased over time. By contrast, pathogen prevalence predicts lower levels of democracy in both between-country and within-country comparisons. Our longitudinal analyses identifying temporal precedence reveal that not only reductions in pathogen prevalence drive future democracy, but also democracy reduces future pathogen prevalence and increases future wealth. These nuanced results contrast with previous analyses using narrow, cross-sectional data. As a whole, our findings illuminate the dynamic process by which environment and democracy shape each other.

    Methods Our Time-Series Cross-Sectional data combine various online databases. Country names were first identified and matched using R-package “countrycode” (Arel-Bundock, Enevoldsen, & Yetman, 2018) before all datasets were merged. Occasionally, we modified unidentified country names to be consistent across datasets. We then transformed “wide” data into “long” data and merged them using R’s Tidyverse framework (Wickham, 2014). Our analysis begins with the year 1949, which was occasioned by the fact that one of the key time-variant level-1 variables, pathogen prevalence was only available from 1949 on. See our Supplemental Material for all data, Stata syntax, R-markdown for visualization, supplemental analyses and detailed results (available at https://osf.io/drt8j/).

  3. d

    Replication Data for: Chapter 17: Time-Series Cross-Section

    • search.dataone.org
    Updated Nov 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fortin-Rittberger, Jessica (2023). Replication Data for: Chapter 17: Time-Series Cross-Section [Dataset]. http://doi.org/10.7910/DVN/TZDH4N
    Explore at:
    Dataset updated
    Nov 21, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Fortin-Rittberger, Jessica
    Description

    Companion files for: 2014. Jessica Fortin-Rittberger. “Time-Series Cross-Section” in Henning Best and Christof Wolf (Eds.), The SAGE Handbook of Regression Analysis and Causal Inference, Sage Publishers. DOI: http://dx.doi.org/10.4135/9781446288146.n17 data file (Norris, P. (2009). Democracy timeseries data release 3.0. http://www.pippanorris.com/) and Stata do file

  4. H

    Replication Data for: Generalized Synthetic Control Method: Causal Inference...

    • dataverse.harvard.edu
    Updated Aug 26, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yiqing Xu (2016). Replication Data for: Generalized Synthetic Control Method: Causal Inference with Interactive Fixed Effects Models [Dataset]. http://doi.org/10.7910/DVN/8AKACJ
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 26, 2016
    Dataset provided by
    Harvard Dataverse
    Authors
    Yiqing Xu
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This replication file contains data and source code to replicate the results in "Generalized Synthetic Control Method: Causal Inference with Interactive Fixed Effects Models" by Yiqing Xu

  5. Interpretation and identification of within-unit and cross-sectional...

    • plos.figshare.com
    pdf
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jonathan Kropko; Robert Kubinec (2023). Interpretation and identification of within-unit and cross-sectional variation in panel data models [Dataset]. http://doi.org/10.1371/journal.pone.0231349
    Explore at:
    pdfAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Jonathan Kropko; Robert Kubinec
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    While fixed effects (FE) models are often employed to address potential omitted variables, we argue that these models’ real utility is in isolating a particular dimension of variance from panel data for analysis. In addition, we show through novel mathematical decomposition and simulation that only one-way FE models cleanly capture either the over-time or cross-sectional dimensions in panel data, while the two-way FE model unhelpfully combines within-unit and cross-sectional variation in a way that produces un-interpretable answers. In fact, as we show in this paper, if we begin with the interpretation that many researchers wrongly assign to the two-way FE model—that it represents a single estimate of X on Y while accounting for unit-level heterogeneity and time shocks—the two-way FE specification is statistically unidentified, a fact that statistical software packages like R and Stata obscure through internal matrix processing.

  6. Data from: The Gaussian Graphical Model in Cross-Sectional and Time-Series...

    • tandf.figshare.com
    pdf
    Updated Jun 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sacha Epskamp; Lourens J. Waldorp; René Mõttus; Denny Borsboom (2023). The Gaussian Graphical Model in Cross-Sectional and Time-Series Data [Dataset]. http://doi.org/10.6084/m9.figshare.6144422.v1
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jun 3, 2023
    Dataset provided by
    Taylor & Francishttps://taylorandfrancis.com/
    Authors
    Sacha Epskamp; Lourens J. Waldorp; René Mõttus; Denny Borsboom
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We discuss the Gaussian graphical model (GGM; an undirected network of partial correlation coefficients) and detail its utility as an exploratory data analysis tool. The GGM shows which variables predict one-another, allows for sparse modeling of covariance structures, and may highlight potential causal relationships between observed variables. We describe the utility in three kinds of psychological data sets: data sets in which consecutive cases are assumed independent (e.g., cross-sectional data), temporally ordered data sets (e.g., n = 1 time series), and a mixture of the 2 (e.g., n > 1 time series). In time-series analysis, the GGM can be used to model the residual structure of a vector-autoregression analysis (VAR), also termed graphical VAR. Two network models can then be obtained: a temporal network and a contemporaneous network. When analyzing data from multiple subjects, a GGM can also be formed on the covariance structure of stationary means—the between-subjects network. We discuss the interpretation of these models and propose estimation methods to obtain these networks, which we implement in the R packages graphicalVAR and mlVAR. The methods are showcased in two empirical examples, and simulation studies on these methods are included in the supplementary materials.

  7. w

    National Panel Survey 2008-2015, Uniform Panel Dataset - Tanzania

    • microdata.worldbank.org
    Updated Mar 17, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Bureau of Statistics (2021). National Panel Survey 2008-2015, Uniform Panel Dataset - Tanzania [Dataset]. https://microdata.worldbank.org/catalog/3814
    Explore at:
    Dataset updated
    Mar 17, 2021
    Dataset authored and provided by
    National Bureau of Statistics
    Time period covered
    2008 - 2015
    Area covered
    Tanzania
    Description

    Abstract

    Panel data possess several advantages over conventional cross-sectional and time-series data, including their power to isolate the effects of specific actions, treatments, and general policies often at the core of large-scale econometric development studies. While the concept of panel data alone provides the capacity for modeling the complexities of human behavior, the notion of universal panel data – in which time- and situation-driven variances leading to variations in tools, and thus results, are mitigated – can further enhance exploitation of the richness of panel information.

    This Basic Information Document (BID) provides a brief overview of the Tanzania National Panel Survey (NPS), but focuses primarily on the theoretical development and application of panel data, as well as key elements of the universal panel survey instrument and datasets generated by the four rounds of the NPS. As this Basic Information Document (BID) for the UPD does not describe in detail the background, development, or use of the NPS itself, the round-specific NPS BIDs should supplement the information provided here.

    The NPS Uniform Panel Dataset (UPD) consists of both survey instruments and datasets, meticulously aligned and engineered with the aim of facilitating the use of and improving access to the wealth of panel data offered by the NPS. The NPS-UPD provides a consistent and straightforward means of conducting not only user-driven analyses using convenient, standardized tools, but also for monitoring MKUKUTA, FYDP II, and other national level development indicators reported by the NPS.

    The design of the NPS-UPD combines the four completed rounds of the NPS – NPS 2008/09 (R1), NPS 2010/11 (R2), NPS 2012/13 (R3), and NPS 2014/15 (R4) – into pooled, module-specific survey instruments and datasets. The panel survey instruments offer the ease of comparability over time, with modifications and variances easily identifiable as well as those aspects of the questionnaire which have remained identical and offer consistent information. By providing all module-specific data over time within compact, pooled datasets, panel datasets eliminate the need for user-generated merges between rounds and present data in a clear, logical format, increasing both the usability and comprehension of complex data.

    Geographic coverage

    Designed for analysis of key indicators at four primary domains of inference, namely: Dar es Salaam, other urban, rural, Zanzibar.

    Analysis unit

    • Households
    • Individuals

    Universe

    The universe includes all households and individuals in Tanzania with the exception of those residing in military barracks or other institutions.

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    While the same sample of respondents was maintained over the first three rounds of the NPS, longitudinal surveys tend to suffer from bias introduced by households leaving the survey over time; i.e. attrition. Although the NPS maintains a highly successful recapture rate (roughly 96% retention at the household level), minimizing the escalation of this selection bias, a refresh of longitudinal cohorts was done for the NPS 2014/15 to ensure proper representativeness of estimates while maintaining a sufficient primary sample to maintain cohesion within panel analysis. A newly completed Population and Housing Census (PHC) in 2012, providing updated population figures along with changes in administrative boundaries, emboldened the opportunity to realign the NPS sample and abate collective bias potentially introduced through attrition.

    To maintain the panel concept of the NPS, the sample design for NPS 2014/2015 consisted of a combination of the original NPS sample and a new NPS sample. A nationally representative sub-sample was selected to continue as part of the “Extended Panel” while an entirely new sample, “Refresh Panel”, was selected to represent national and sub-national domains. Similar to the sample in NPS 2008/2009, the sample design for the “Refresh Panel” allows analysis at four primary domains of inference, namely: Dar es Salaam, other urban areas on mainland Tanzania, rural mainland Tanzania, and Zanzibar. This new cohort in NPS 2014/2015 will be maintained and tracked in all future rounds between national censuses.

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    The format of the NPS-UPD survey instrument is similar to previously disseminated NPS survey instruments. Each module has a questionnaire and clearly identifies if the module collects information at the individual or household level. Within each module-specific questionnaire of the NPS-UPD survey instrument, there are five distinct sections, arranged vertically: (1) the UPD - “U” on the survey instrument, (2) R4, (3), R3, (4) R2, and (5) R1 – the latter 4 sections presenting each questionnaire in its original form at time of its respective dissemination.

    The uppermost section of each module’s questionnaire (“U”) represents the model universal panel questionnaire, with questions generated from the comprehensive listing of questions across all four rounds of the NPS and codes generated from the comprehensive collection of codes. The following sections are arranged vertically by round, considering R4 as most recent. While not all rounds will have data reported for each question in the UPD and not each question will have reports for each of the UPD codes listed, the NPS-UPD survey instrument represents the visual, all-inclusive set of information collected by the NPS over time.

    The four round-specific sections (R4, R3, R2, R1) are aligned with their UPD-equivalent question, visually presenting their contribution to compatibility with the UPD. Each round-specific section includes the original round-specific variable names, response codes and skip patterns (corresponding to their respective round-specific NPS data sets, and despite their variance from other rounds or from the comprehensive UPD code listing)4.

  8. r

    QoG Social Policy Dataset - Wide Time-Series CS Data

    • researchdata.se
    Updated Aug 6, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jan Teorell; Richard Svensson; Marcus Samanni; Staffan Kumlin; Stefan Dahlberg; Bo Rothstein; Sören Holmberg (2024). QoG Social Policy Dataset - Wide Time-Series CS Data [Dataset]. https://researchdata.se/en/catalogue/dataset/ext0004-3
    Explore at:
    Dataset updated
    Aug 6, 2024
    Dataset provided by
    University of Gothenburg
    Authors
    Jan Teorell; Richard Svensson; Marcus Samanni; Staffan Kumlin; Stefan Dahlberg; Bo Rothstein; Sören Holmberg
    Time period covered
    1970 - 2005
    Area covered
    United Kingdom, Sweden, Slovenia, Lithuania, Austria, Belgium, Hungary, Canada, Mexico, Finland
    Description

    The QoG Institute is an independent research institute within the Department of Political Science at the University of Gothenburg. Overall 30 researchers conduct and promote research on the causes, consequences and nature of Good Governance and the Quality of Government - that is, trustworthy, reliable, impartial, uncorrupted and competent government institutions.

    The main objective of our research is to address the theoretical and empirical problem of how political institutions of high quality can be created and maintained. A second objective is to study the effects of Quality of Government on a number of policy areas, such as health, the environment, social policy, and poverty.

    The dataset was created as part of a research project titled “Quality of Government and the Conditions for Sustainable Social Policy”. The aim of the dataset is to promote cross-national comparative research on social policy output and its correlates, with a special focus on the connection between social policy and Quality of Government (QoG).

    The data comes in three versions: one cross-sectional dataset, and two cross-sectional time-series datasets for a selection of countries. The two combined datasets are called “long” (year 1946-2009) and “wide” (year 1970-2005).

    The data contains six types of variables, each provided under its own heading in the codebook: Social policy variables, Tax system variables, Social Conditions, Public opinion data, Political indicators, Quality of government variables.

    QoG Social Policy Dataset can be downloaded from the Data Archive of the QoG Institute at http://qog.pol.gu.se/data/datadownloads/data-archive Its variables are now included in QoG Standard.

    Purpose:

    The primary aim of QoG is to conduct and promote research on corruption. One aim of the QoG Institute is to make publicly available cross-national comparative data on QoG and its correlates. The aim of the QoG Social Policy Dataset is to promote cross-national comparative research on social policy output and its correlates, with a special focus on the connection between social policy and Quality of Government (QoG).

    The dataset combining cross-sectional data and time-series data for a selection of 40 countries. The dataset is specifically tailored for the analysis of public opinion data over time, instead uses country as its unit of observation, and one variable for every 5th year from 1970-2005 (or, one per module of each public opinion data source).

    Samanni, Marcus. Jan Teorell, Staffan Kumlin, Stefan Dahlberg, Bo Rothstein, Sören Holmberg & Richard Svensson. 2012. The QoG Social Policy Dataset, version 4Apr12. University of Gothenburg:The Quality of Government Institute. http://www.qog.pol.gu.se

  9. H

    Replication Data for: Change-point Detection and Regularization in Time...

    • dataverse.harvard.edu
    Updated Aug 16, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jong Hee Park; Soichiro Yamauchi (2022). Replication Data for: Change-point Detection and Regularization in Time Series Cross Sectional Data Analysis [Dataset]. http://doi.org/10.7910/DVN/MCQTYC
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 16, 2022
    Dataset provided by
    Harvard Dataverse
    Authors
    Jong Hee Park; Soichiro Yamauchi
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Researchers of time series cross sectional (TSCS) data regularly face the change-point problem, which re- quires them to discern between significant parametric shifts that can be deemed structural changes and minor parametric shifts that must be considered noise. In this paper, we develop a general Bayesian method for change-point detection in high dimensional data and present its application in the context of the fixed-effect model. Our proposed method, hidden Markov Bayesian bridge model (HMBB), jointly estimates high dimensional regime-specific parameters and hidden regime transitions in a unified way. We apply our method to Alvarez, Garrett, and Lange (1991)’s study of the relationship between government partisanship and economic growth and Allee and Scalera (2012)’s study of membership effects in international organizations. In both applications, we found that the proposed method successfully identify substantively meaningful temporal heterogeneity in parameters of regression models.

  10. g

    Macroeconomic Time Series for the United States, United Kingdom, Germany,...

    • search.gesis.org
    Updated Mar 26, 2007
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Bureau of Economic Research (2007). Macroeconomic Time Series for the United States, United Kingdom, Germany, and France - Version 2 [Dataset]. http://doi.org/10.3886/ICPSR07644.v2
    Explore at:
    Dataset updated
    Mar 26, 2007
    Dataset provided by
    ICPSR - Interuniversity Consortium for Political and Social Research
    GESIS search
    Authors
    National Bureau of Economic Research
    License

    https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de441876https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de441876

    Area covered
    United Kingdom, United States, France, Germany
    Description

    Abstract (en): This collection contains an array of economic time series data pertaining to the United States, the United Kingdom, Germany, and France, primarily between the 1920s and the 1960s, and including some time series from the 18th and 19th centuries. These data were collected by the National Bureau of Economic Research (NBER), and they constitute a research resource of importance to economists as well as to political scientists, sociologists, and historians. Under a grant from the National Science Foundation, ICPSR and the National Bureau of Economic Research converted this collection (which existed heretofore only on handwritten sheets stored in New York) into fully accessible, readily usable, and completely documented machine-readable form. The NBER collection -- containing an estimated 1.6 million entries -- is divided into 16 major categories: (1) construction, (2) prices, (3) security markets, (4) foreign trade, (5) income and employment, (6) financial status of business, (7) volume of transactions, (8) government finance, (9) distribution of commodities, (10) savings and investments, (11) transportation and public utilities, (12) stocks of commodities, (13) interest rates, and (14) indices of leading, coincident, and lagging indicators, (15) money and banking, and (16) production of commodities. Data from all categories are available in Parts 1-22. The economic variables are usually observations on the entire nation or large subsets of the nation. Frequently, however, and especially in the United States, separate regional and metropolitan data are included in other variables. This makes cross-sectional analysis possible in many cases. The time span of variables in these files may be as short as one year or as long as 160 years. Most data pertain to the first half of the 20th century. Many series, however, extend into the 19th century, and a few reach into the 18th. The oldest series, covering brick production in England and Wales, begins in 1785, and the most recent United States data extend to 1968. The unit of analysis is an interval of time -- a year, a quarter, or a month. The bulk of observations are monthly, and most series of monthly data contain annual values or totals. ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection: Performed consistency checks.; Standardized missing values.; Checked for undocumented or out-of-range codes.. Time series of economic statistics pertaining to France, Germany, the United Kingdom, and the United States between 1785 and 1968. 2007-03-26 This study, updated from OSIRIS, now includes SAS, SPSS, and Stata setup files, SAS transport (XPORT) files, SPSS portable files, a Stata system files, and an updated codebook. Funding insitution(s): National Science Foundation. The data were collected between the 1920s and the 1970s, but it is unclear from the documentation as to the exact start and end dates.

  11. d

    Replication Data for: Matching Methods for Causal Inference with Time-Series...

    • search.dataone.org
    Updated Nov 13, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Imai, Kosuke; Kim, In Song; Wang, Erik (2023). Replication Data for: Matching Methods for Causal Inference with Time-Series Cross-Section Data [Dataset]. http://doi.org/10.7910/DVN/ZTDHVE
    Explore at:
    Dataset updated
    Nov 13, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Imai, Kosuke; Kim, In Song; Wang, Erik
    Description

    Matching methods improve the validity of causal inference by reducing model dependence and offering intuitive diagnostics. While they have become a part of the standard tool kit across disciplines, matching methods are rarely used when analyzing time-series cross-sectional data. We fill this methodological gap. In the proposed approach, we first match each treated observation with control observations from other units in the same time period that have an identical treatment history up to the pre-specified number of lags. We use standard matching and weighting methods to further refine this matched set so that the treated and matched control observations have similar covariate values. Assessing the quality of matches is done by examining covariate balance. Finally, we estimate both short-term and long-term average treatment effects using the difference-in-differences estimator, accounting for a time trend. We illustrate the proposed methodology through simulation and empirical studies. An open-source software package is available for implementing the proposed methods.

  12. f

    National Panel Survey- Universal Panel Questionnaire, 2008-2015 - United...

    • microdata.fao.org
    Updated Nov 8, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Bureau of Statistics (2022). National Panel Survey- Universal Panel Questionnaire, 2008-2015 - United Republic of Tanzania [Dataset]. https://microdata.fao.org/index.php/catalog/1772
    Explore at:
    Dataset updated
    Nov 8, 2022
    Dataset authored and provided by
    National Bureau of Statistics
    Time period covered
    2008 - 2015
    Area covered
    Tanzania
    Description

    Abstract

    Panel data possess several advantages over conventional cross-sectional and time-series data, including their power to isolate the effects of specific actions, treatments, and general policies often at the core of large-scale econometric development studies. While the concept of panel data alone provides the capacity for modelling the complexities of human behaviour, the notion of universal panel data - in which time- and situation-driven variances leading to variations in tools, and thus results, are mitigated - can further enhance exploitation of the richness of panel information. The NPS Universal Panel Questionnaire (UPQ) consists of both survey instruments and datasets, meticulously aligned and engineered with the aim of facilitating the use of and improving access to the wealth of panel data offered by the NPS. The NPS-UPQ provides a consistent and straightforward means of conducting not only user-driven analyses using convenient, standardized tools, but also for monitoring MKUKUTA, FYDP II, and other national level development indicators reported by the NPS.

    The design of the NPS-UPQ combines the four completed rounds of the NPS - NPS 2008/09 (R1), NPS 2010/11 (R2), NPS 2012/13 (R3), and NPS 2014/15 (R4) - into pooled, module-specific survey instruments and datasets. The panel survey instruments offer the ease of comparability over time, with modifications and variances easily identifiable as well as those aspects of the questionnaire which have remained identical and offer consistent information. By providing all module-specific data over time within compact, pooled datasets, panel datasets eliminate the need for user-generated merges between rounds and present data in a clear, logical format, increasing both the usability and comprehension of complex data.

    Geographic coverage

    Regional coverage

    Analysis unit

    Households

    Universe

    The universe includes all households and individuals in Tanzania with the exception of those residing in military barracks or other institutions.

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    SAMPLING PROCEDURE While the same sample of respondents was maintained over the first three rounds of the NPS, longitudinal surveys tend to suffer from bias introduced by households leaving the survey over time, i.e. attrition. Although the NPS maintains a highly successful recapture rate (roughly 96% retention at the household level), minimizing the escalation of this selection bias, a refresh of longitudinal cohorts was done for the NPS 2014/15 to ensure proper representativeness of estimates while maintaining a sufficient primary sample to maintain cohesion within panel analysis. A newly completed Population and Housing Census (PHC) in 2012, providing updated population figures along with changes in administrative boundaries, emboldened the opportunity to realign the NPS sample and abate collective bias potentially introduced through attrition.

    To maintain the panel concept of the NPS, the sample design for NPS 2014/2015 consisted of a combination of the original NPS sample and a new NPS sample. A nationally representative sub-sample was selected to continue as part of the “Extended Panel” while an entirely new sample, “Refresh Panel”, was selected to represent national and sub-national domains. Similar to the sample in NPS 2008/2009, the sample design for the “Refresh Panel” allows analysis at four primary domains of inference, namely: Dar es Salaam, other urban areas on mainland Tanzania, rural mainland Tanzania, and Zanzibar. This new cohort in NPS 2014/2015 will be maintained and tracked in all future rounds between national censuses.

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    The format of the NPS-UPQ survey instrument is similar to previously disseminated NPS survey instruments. Each module has a questionnaire and clearly identifies if the module collects information at the individual or household level. Within each module-specific questionnaire of the NPS-UPQ survey instrument, there are five distinct sections, arranged vertically: (1) the UPQ - “U” on the survey instrument, (2) R4, (3), R3, (4) R2, and (5) R1 – the latter 4 sections presenting each questionnaire in its original form at time of its respective dissemination.

    The uppermost section of each module’s questionnaire (“U”) represents the model universal panel questionnaire, with questions generated from the comprehensive listing of questions across all four rounds of the NPS and codes generated from the comprehensive collection of codes. The following sections are arranged vertically by round, considering R4 as most recent. While not all rounds will have data reported for each question in the UPQ and not each question will have reports for each of the UPQ codes listed, the NPS-UPQ survey instrument represents the visual, all-inclusive set of information collected by the NPS over time.

    The four round-specific sections (R4, R3, R2, R1) are aligned with their UPQ-equivalent question, visually presenting their contribution to compatibility with the UPQ. Each round-specific section includes the original round-specific variable names, response codes and skip patterns (corresponding to their respective round-specific NPS data sets, and despite their variance from other rounds or from the comprehensive UPQ code listing)4.

    • Household identification;
    • Survey staff details;
    • Household member roster;
    • Education,
    • Health,
    • Labour;
    • Food outside the household;
    • Subject welfare;
    • Food security;
    • Housing, water and sanitation;
    • Consumption of food over the past one week;
    • Non-food expenditures (past one week & one month);
    • Non-food expenditures (past twelve months);
    • Household assets;
    • Family/household non-farm enterprises;
    • Assistance and groups;
    • Credit;
    • Finance;
    • Recent shocks to household welfare;
    • Deaths in the household;
    • Household recontact information;
    • Filter questions;
    • Anthropometry.
  13. d

    Multilevel modeling of time-series cross-sectional data reveals the dynamic...

    • datadryad.org
    zip
    Updated Mar 6, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kodai Kusano (2020). Multilevel modeling of time-series cross-sectional data reveals the dynamic interaction between ecological threats and democratic development [Dataset]. http://doi.org/10.5061/dryad.547d7wm3x
    Explore at:
    zipAvailable download formats
    Dataset updated
    Mar 6, 2020
    Dataset provided by
    Dryad
    Authors
    Kodai Kusano
    Time period covered
    Oct 13, 2019
    Description

    Our Time-Series Cross-Sectional data combine various online databases. Country names were first identified and matched using R-package “countrycode” (Arel-Bundock, Enevoldsen, & Yetman, 2018) before all datasets were merged. Occasionally, we modified unidentified country names to be consistent across datasets. We then transformed “wide” data into “long” data and merged them using R’s Tidyverse framework (Wickham, 2014). Our analysis begins with the year 1949, which was occasioned by the fact that one of the key time-variant level-1 variables, pathogen prevalence was only available from 1949 on. See our Supplemental Material for all data, Stata syntax, R-markdown for visualization, supplemental analyses and detailed results (available at https://osf.io/drt8j/).

  14. B

    Replication Data and Code for: Income convergence among U.S. states:...

    • borealisdata.ca
    Updated Mar 9, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jac C. Heckelman (2022). Replication Data and Code for: Income convergence among U.S. states: cross-sectional and time series evidence [Dataset]. http://doi.org/10.5683/SP3/IWSM0I
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 9, 2022
    Dataset provided by
    Borealis
    Authors
    Jac C. Heckelman
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    The data and programs replicate tables and figures from "Income convergence among U.S. states: cross-sectional and time series evidence", by Heckelman. Please see the ReadMe file for additional details.

  15. m

    Data for: An Output Gap Measure for the Euro Area: Exploiting Country-Level...

    • data.mendeley.com
    Updated Apr 26, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Manuel González-Astudillo (2021). Data for: An Output Gap Measure for the Euro Area: Exploiting Country-Level and Cross-Sectional Data Heterogeneity [Dataset]. http://doi.org/10.17632/7ndfgjbfgw.1
    Explore at:
    Dataset updated
    Apr 26, 2021
    Authors
    Manuel González-Astudillo
    License

    Attribution-NonCommercial 3.0 (CC BY-NC 3.0)https://creativecommons.org/licenses/by-nc/3.0/
    License information was derived automatically

    Description

    Macroeconomic indicators of the euro area.

  16. Time Series Air Flow Rate Data

    • kaggle.com
    zip
    Updated Dec 11, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vitthal Madane (2022). Time Series Air Flow Rate Data [Dataset]. https://www.kaggle.com/datasets/vitthalmadane/airflow14july
    Explore at:
    zip(28264 bytes)Available download formats
    Dataset updated
    Dec 11, 2022
    Authors
    Vitthal Madane
    Description

    Dataset represents Air flow to room to maintain room conditions with respect to date and time. Air Flow is Measured in Cubic Feet per Minute called as CFM. Air Flow through duct is function of Air Velocity and Area of Cross-section thorough which it is flowing.

  17. Data from: Comparative analysis of death by suicide in Brazil and in the...

    • scielo.figshare.com
    png
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alexander Abuabara; Allan Abuabara; Carin Albino Luçolli Tonchuk (2023). Comparative analysis of death by suicide in Brazil and in the United States: descriptive, cross-sectional time series study [Dataset]. http://doi.org/10.6084/m9.figshare.20007110.v1
    Explore at:
    pngAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    SciELOhttp://www.scielo.org/
    Authors
    Alexander Abuabara; Allan Abuabara; Carin Albino Luçolli Tonchuk
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Brazil, United States
    Description

    ABSTRACT CONTEXT AND OBJECTIVE: The World Health Organization recognizes suicide as a public health priority. Increased knowledge of suicide risk factors is needed in order to be able to adopt effective prevention strategies. The aim of this study was to analyze and compare the association between the Gini coefficient (which is used to measure inequality) and suicide death rates over a 14-year period (2000-2013) in Brazil and in the United States (US). The hypothesis put forward was that reduction of income inequality is accompanied by reduction of suicide rates. DESIGN AND SETTING: Descriptive cross-sectional time-series study in Brazil and in the US. METHODS: Population, death and suicide death data were extracted from the DATASUS database in Brazil and from the National Center for Health Statistics in the US. Gini coefficient data were obtained from the World Development Indicators. Time series analysis was performed on Brazilian and American official data regarding the number of deaths caused by suicide between 2000 and 2013 and the Gini coefficients of the two countries. The suicide trends were examined and compared. RESULTS: Brazil and the US present converging Gini coefficients, mainly due to reduction of inequality in Brazil over the last decade. However, suicide rates are not converging as hypothesized, but are in fact rising in both countries. CONCLUSION: The hypothesis that reduction of income inequality is accompanied by reduction of suicide rates was not verified.

  18. Replication Data for: Democratization and Gini index: Panel data analysis...

    • search.datacite.org
    Updated 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    LEIZHEN ZANG; Xiong Feng (2019). Replication Data for: Democratization and Gini index: Panel data analysis based on random forest method [Dataset]. http://doi.org/10.7910/dvn/w2cxvu
    Explore at:
    Dataset updated
    2019
    Dataset provided by
    DataCite
    Harvard Dataverse
    Authors
    LEIZHEN ZANG; Xiong Feng
    Description

    The mechanism for the association between democratic development and the wealth gap has always been the focus of political and economic research, yet with no consistent conclusion. The reasons for that often are, 1) challenges to generalize the results obtained from analyzing a single country’s time series studies or multinational cross-section data analysis, and 2) deviations in research results caused by missing values or variable selection in panel data analysis. When it comes to the latter one, there are two factors contribute to it. One is that the accuracy of estimation is interfered with the presence of missing values in variables, another is that subjective discretion that must be exercised to select suitable proxies amongst many candidates, which are likely to cause variable selection bias. In order to solve these problems, this study is the pioneeringly research to utilize the machine learning method to interpolate missing values efficiently through the random forest model in this topic, and effectively analyzed cross-country data from 151 countries covering the period 1993–2017. Since this paper measures the importance of different variables to the dependent variable, more appropriate and important variables could be selected to construct a complete regression model. Results from different models come to a consensus that the promotion of democracy can significantly narrow the gap between the rich and the poor, with marginally decreasing effect with respect to wealth. In addition, the study finds out that this mechanism exists only in non-colonial nations or presidential states. Finally, this paper discusses the potential theoretical and policy implications of results.

  19. Arroyo de los Pinos Sediment Monitoring Research Station One Minute Average...

    • catalog.data.gov
    Updated Jan 16, 2026
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bureau of Reclamation (2026). Arroyo de los Pinos Sediment Monitoring Research Station One Minute Average Stage Time Series Data [Dataset]. https://catalog.data.gov/dataset/arroyo-de-los-pinos-sediment-monitoring-research-station-one-minute-average-stage-time-ser
    Explore at:
    Dataset updated
    Jan 16, 2026
    Dataset provided by
    United States Bureau of Reclamationhttp://www.usbr.gov/
    Description

    Average stage measured at the monitored cross-section

  20. w

    General Household Survey - Panel 2010-2019, Uniform Panel Data - Nigeria

    • microdata.worldbank.org
    Updated Nov 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Bureau of Statistics (NBS) (2025). General Household Survey - Panel 2010-2019, Uniform Panel Data - Nigeria [Dataset]. https://microdata.worldbank.org/index.php/catalog/5835
    Explore at:
    Dataset updated
    Nov 3, 2025
    Dataset authored and provided by
    National Bureau of Statistics (NBS)
    Time period covered
    2010 - 2019
    Area covered
    Nigeria
    Description

    Abstract

    Panel data possess several advantages over conventional cross-sectional and time-series data, including their power to isolate the effects of specific actions, treatments, and general policies often at the core of large-scale econometric development studies. While the concept of panel data alone provides the capacity for modeling the complexities of human behavior, the notion of universal panel data – in which time- and situation-driven variances leading to variations in tools, and thus results, are mitigated – can further enhance exploitation of the richness of panel information.

    The Basic Information Document (BID) provides a brief overview of the Nigerian General Household Survey (GHS) but focuses primarily on the theoretical development and application of panel data, as well as key elements of the universal panel survey instrument and datasets generated by the four rounds of the GHS. As the BID does not describe in detail the background, development, or use of the GHS itself, the wave-specific GHS BIDs should supplement the information provided here.

    The Nigeria Universal Panel Data (NUPD) consists of both survey instruments and datasets from the two survey visits of the GHS - Post-Planting (PP) and Post-Harvest (PH) - meticulously aligned and engineered with the aim of facilitating the use of and improving access to the wealth of panel data offered by the GHS. The NUPD provides a consistent and straightforward means of conducting user-driven analyses using convenient, standardized tools.

    The design of the NUPD combines the four completed Waves of the GHS Household Post-Planting and Post-Harvest Surveys – Wave 1 (2010/11), Wave 2 (2012/13), Wave 3 (2015/16), and Wave 4 (2018/19) – into pooled, module-specific survey instruments and datasets. The panel survey instruments offer the ease of comparability over time, with modifications and variances easily identifiable as well as those aspects of the questionnaire which have remained identical and offer consistent information. By providing all module-specific data over time within compact, pooled datasets, panel datasets eliminate the need for user-generated merges between rounds and present data in a clear, logical format, increasing both the usability and comprehension of complex data.

    Geographic coverage

    National

    Analysis unit

    • Households
    • Individuals

    Universe

    The survey covered all de jure households excluding prisons, hospitals, military barracks, and school dormitories.

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    Please see the GHS BIDs for each round for detailed descriptions of the sample design used in each round and their respective implementation efforts as this is a compilation of datasets from all previous waves.

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    The larger GHS-Panel project consists of three questionnaires (Household Questionnaire, Agriculture Questionnaire, Community Questionnaire) for each of the two visits (Post-Planting and Post-Harvest). The GHS-NUPD only consists of the Household Questionnaire.

    GHS-Panel Household Questionnaire: The Household Questionnaire provides information on demographics; education; health (including anthropometric measurement for children); labor; food and non-food expenditure; household nonfarm income-generating activities; food security and shocks; safety nets; housing conditions; assets; information and communication technology; and other sources of household income.

    The Household Questionnaire is slightly different for the two visits. Some information was collected only in the post-planting visit, some only in the post-harvest visit, and some in both visits.

    Cleaning operations

    Please see the GHS BIDs for each round for detailed descriptions of data editing and additional data processing efforts as this is a compilation of datasets from all previous waves.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
National Bureau of Economic Research (2007). Macroeconomic Time Series for the United States, United Kingdom, Germany, and France [Dataset]. http://doi.org/10.3886/ICPSR07644.v2
Organization logo

Macroeconomic Time Series for the United States, United Kingdom, Germany, and France

Explore at:
spss, sas, stata, asciiAvailable download formats
Dataset updated
Mar 26, 2007
Dataset provided by
Inter-university Consortium for Political and Social Researchhttps://www.icpsr.umich.edu/web/pages/
Authors
National Bureau of Economic Research
License

https://www.icpsr.umich.edu/web/ICPSR/studies/7644/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/7644/terms

Time period covered
1785 - 1968
Area covered
United States, France, Global, Germany, United Kingdom
Description

This collection contains an array of economic time series data pertaining to the United States, the United Kingdom, Germany, and France, primarily between the 1920s and the 1960s, and including some time series from the 18th and 19th centuries. These data were collected by the National Bureau of Economic Research (NBER), and they constitute a research resource of importance to economists as well as to political scientists, sociologists, and historians. Under a grant from the National Science Foundation, ICPSR and the National Bureau of Economic Research converted this collection (which existed heretofore only on handwritten sheets stored in New York) into fully accessible, readily usable, and completely documented machine-readable form. The NBER collection -- containing an estimated 1.6 million entries -- is divided into 16 major categories: (1) construction, (2) prices, (3) security markets, (4) foreign trade, (5) income and employment, (6) financial status of business, (7) volume of transactions, (8) government finance, (9) distribution of commodities, (10) savings and investments, (11) transportation and public utilities, (12) stocks of commodities, (13) interest rates, and (14) indices of leading, coincident, and lagging indicators, (15) money and banking, and (16) production of commodities. Data from all categories are available in Parts 1-22. The economic variables are usually observations on the entire nation or large subsets of the nation. Frequently, however, and especially in the United States, separate regional and metropolitan data are included in other variables. This makes cross-sectional analysis possible in many cases. The time span of variables in these files may be as short as one year or as long as 160 years. Most data pertain to the first half of the 20th century. Many series, however, extend into the 19th century, and a few reach into the 18th. The oldest series, covering brick production in England and Wales, begins in 1785, and the most recent United States data extend to 1968. The unit of analysis is an interval of time -- a year, a quarter, or a month. The bulk of observations are monthly, and most series of monthly data contain annual values or totals.

Search
Clear search
Close search
Google apps
Main menu