37 datasets found

J
The Employment Effects of the Minimum Wage: A Selection Ratio Approach to...
journaldata.zbw.eu
csv, rtf, stata data +1
Updated Dec 20, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
David Slichter; David Slichter (2022). The Employment Effects of the Minimum Wage: A Selection Ratio Approach to Measuring Treatment Effects (replication data) [Dataset]. http://doi.org/10.15456/jae.2022349.0653510847
Explore at:
csv(4311446), stata data(4372766), stata do(5231), stata do(4393), stata do(9425), csv(513370056), rtf(4784), stata data(371859711)Available download formats
Unique identifier
https://doi.org/10.15456/jae.2022349.0653510847
Dataset updated
Dec 20, 2022
Dataset provided by
ZBW - Leibniz Informationszentrum Wirtschaft
Authors
David Slichter; David Slichter
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Replication files for David Slichter, "The Employment Effects of the Minimum Wage: A Selection Ratio Approach to Measuring Treatment Effects,” Journal of Applied Econometrics, forthcoming

Firstly, I’ve provided a .do file called sr.do which contains general code for implementing the selection ratio approach, with detailed instructions written as comments in the code.

For the minimum wage application, the main data file is mw_final.dta. A .csv version is also provided. Observations are a county in a time period. I have added self-explanatory variable labels for most variables. A few variables warrant a clearer explanation:

adj1-adj14: List of FIPS codes of all counties which are adjacent to the county in question. Each variables holds one adjacent county, and counties with fewer than 14 neighbors will have missing values for some of these variables.

change, logchange: Minimum wage this quarter - minimum wage last quarter, measured either in dollars or in logs.

time, t1-t108: The variable "time" converts years and quarters into a univariate time period, with time=1 in 1990Q1 and time=108 in 2016Q4. t1-t108 are indicators for each of these time periods.

lnemp_1418, lnearnbeg_1418, lnsep_1418, lnhira_1418, lnchurn_1418: Logs of employment, earnings, separations, hires, and churn, respectively, for 14-18 year olds.

gt1-gt6: Dummies for inclusion in each of the six comparisons used for the main (i.e., not spillover-robust) analysis. All treated counties which neighbor a control country take value 1 for each of these variables; all other treated counties take value 0. Among control counties, gt1=1 if the county neighbors a treated county and 0 otherwise, gt2=1 if the county has gt1=0 but neighbors a gt1=1 county, gt3=1 if county has gt1=gt2=0 but neighbors a gt2=1 county, etc.

h2-h6: Dummies for inclusion in each of the first spillover-robust (i.e., excluding border counties only) comparisons. Among control counties, h2-h6 are equal to gt2-gt6. Among treated counties, h2-h6 are equal to 1 if the treated county has gt1=0 but borders a gt1=1 county, and 0 otherwise.

k3-k6: Dummies for inclusion in each of the second spillover-robust (i.e., excluding two layers) comparisons. Among control counties, these variables are equal to gt3-gt6. Among treated counties, all observations take value 1 except those with gt1=1 or h2=1.

The data sources are as follows. The minimum wage law series is taken from David Neumark's website (https://www.economics.uci.edu/~dneumark/datasets.html). The economic variables are taken from the QWI, which I accessed via the Ithaca Virtual RDC. County adjacency files were downloaded from the Census Bureau (https://www.census.gov/geo/reference/county-adjacency.html).

The file main.do then runs the analyses. The resulting output file containing results is results.dta.

For the incumbency application, the main data file is incumb_final.dta. A .csv version is also provided. This file is drawn from Caughey and Sekhon's (2011) data; see their description of most variables here: https://doi.org/10.7910/DVN/8EYYA2

The key added variables are _IDistancea1-_IDistancea50, which are dummies for inclusion in the 50 comparisons used in the paper. Treated observations (i.e., Democratic wins) with margin of victory below 5 points have each of these variables equal to 1. Control observations have these variables equal to 1 if they fall within the margin of victory range, e.g., _IDistancea9=1 for control observations with Republican margin of victory between 8 and 9 points. Note that these variables are redefined by the code for the analyses of treatment effects away from the discontinuity. Lastly, there is a variable called RepWin which is the treatment variable when treatment is defined as a Republican winning.

The file sr_incumb.do then performs the analysis.

Please contact me with any questions at slichter@binghamton.edu.
c
Statistical Regression Methods in Education Teaching Datasets: Longitudinal...
datacatalogue.cessda.eu
beta.ukdataservice.ac.uk
Updated Nov 28, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Cadwallader, S., University of Warwick; Strand, S., University of Warwick (2024). Statistical Regression Methods in Education Teaching Datasets: Longitudinal Study of Young People in England, 2004-2006 [Dataset]. http://doi.org/10.5255/UKDA-SN-6660-1
Explore at:
Unique identifier
https://doi.org/10.5255/UKDA-SN-6660-1
Dataset updated
Nov 28, 2024
Dataset provided by
Institute of Education
Authors
Cadwallader, S., University of Warwick; Strand, S., University of Warwick
Area covered
England
Variables measured
Individuals, Families/households, National
Measurement technique
Compilation or synthesis of existing material
Description
Abstract copyright UK Data Service and data collection copyright owner.

These teaching datasets, comprising a sub-set of a large-scale longitudinal study, the Longitudinal Study of Young People in England (LSYPE), were created as part of the NCRM Developing Statistical Modelling in the Social Sciences: Lancaster-Warwick-Stirling Node Phase 2 project, funded by the Economic and Social Research Council (ESRC). During the project, a web site was created with the aim to provide a web-based training resource about the use of statistical regression methods in educational research. The content is designed to teach users how to perform a variety of regression analyses using SPSS, starting with foundation material in basic statistics and working through to more complex multiple linear, logistic and ordinal regression models. Along with illustrated modules the site contains demonstration videos, interactive quizzes and SPSS exercises and examples that use these LSYPE teaching data. Further information and documentation may be found at the web site, Using Statistical Methods in Education Research. Throughout the site modules users are invited to use the datasets for either following the examples or performing exercises. Prospective users of the data will be directed to register an account in order to download the data.

The full LSYPE study is held at the Archive under SN 5545. The teaching datasets include information drawn from Wave 1 of LSYPE, conducted in 2004, with GCSE results matched from Wave 3 (2006). Further information about the NCRM Node project covering this study may be found on the Developing Statistical Modelling in the Social Sciences ESRC award web page.

Documentation
There is currently no discrete documentation currently available with these teaching datasets; users should consult the web site noted above. Documentation covering the main LSYPE study is available with SN 5545.

For the second edition (July 2011), updated versions of the SPSS data files were deposited to resolve minor anomalies.

Main Topics:

The teaching datasets include variables covering LSYPE respondents' educational test results, academic achievement and school life, and demographic/household characteristics including ethnic group, gender, social class and socio-economic status, computer ownership, private education, and mothers' occupational status and educational background.
H
Replication Data for: "Regression Discontinuity Designs Using Covariates"
dataverse.harvard.edu
Updated Jun 30, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sebastian Calonico; Matias Cattaneo; Max Farrell; Rocio Titiunik (2020). Replication Data for: "Regression Discontinuity Designs Using Covariates" [Dataset]. http://doi.org/10.7910/DVN/LPZLBF
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/LPZLBF
Dataset updated
Jun 30, 2020
Dataset provided by
Harvard Dataverse
Authors
Sebastian Calonico; Matias Cattaneo; Max Farrell; Rocio Titiunik
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Calonico, Sebastian, Cattaneo, Matias D., Farrell, Max H., and Titiunik, Rocio, (2019) "Regression Discontinuity Designs Using Covariates." Review of Economics and Statistics 101:3, 442-451.
d
Replication Data for: Inference in High-Dimensional Regression Models...
search.dataone.org
dataverse.harvard.edu
Updated Nov 8, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Cha, Jooyoung; Chiang, Harold D.; Sasaki, Yuya (2023). Replication Data for: Inference in High-Dimensional Regression Models without the Exact or Lp Sparsity [Dataset]. http://doi.org/10.7910/DVN/DFBV7K
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/DFBV7K
Dataset updated
Nov 8, 2023
Dataset provided by
Harvard Dataverse
Authors
Cha, Jooyoung; Chiang, Harold D.; Sasaki, Yuya
Description
Review of Economics and Statistics: Forthcoming.. Visit https://dataone.org/datasets/sha256%3Ad00937e0e95caca90195351492ee3df98fa25094069700fa52605c182a3a5a0c for complete metadata about this dataset.
A
‘Boston House Prices-Advanced Regression Techniques’ analyzed by Analyst-2
analyst-2.ai
Updated Feb 13, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2022). ‘Boston House Prices-Advanced Regression Techniques’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-boston-house-prices-advanced-regression-techniques-bae0/fd606ebf/?iid=003-689&v=presentation
Explore at:
Dataset updated
Feb 13, 2022
Dataset authored and provided by
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Boston
Description
Analysis of ‘Boston House Prices-Advanced Regression Techniques’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/fedesoriano/the-boston-houseprice-data on 13 February 2022.

--- Dataset description provided by original source is as follows ---

Similar Datasets

Gender Pay Gap Dataset: LINK

California Housing Prices Data (5 new features!): LINK

Company Bankruptcy Prediction: LINK

Context

The Boston house-price data of Harrison, D. and Rubinfeld, D.L. 'Hedonic prices and the demand for clean air', J. Environ. Economics & Management, vol.5, 81-102, 1978.

Attribute Information

Input features in order: 1) CRIM: per capita crime rate by town 2) ZN: proportion of residential land zoned for lots over 25,000 sq.ft. 3) INDUS: proportion of non-retail business acres per town 4) CHAS: Charles River dummy variable (1 if tract bounds river; 0 otherwise) 5) NOX: nitric oxides concentration (parts per 10 million) [parts/10M] 6) RM: average number of rooms per dwelling 7) AGE: proportion of owner-occupied units built prior to 1940 8) DIS: weighted distances to five Boston employment centres 9) RAD: index of accessibility to radial highways 10) TAX: full-value property-tax rate per $10,000 [$/10k] 11) PTRATIO: pupil-teacher ratio by town 12) B: The result of the equation B=1000(Bk - 0.63)^2 where Bk is the proportion of blacks by town 13) LSTAT: % lower status of the population

Output variable: 1) MEDV: Median value of owner-occupied homes in $1000's [k$]

Source

StatLib - Carnegie Mellon University

Relevant Papers

Harrison, David & Rubinfeld, Daniel. (1978). Hedonic housing prices and the demand for clean air. Journal of Environmental Economics and Management. 5. 81-102. 10.1016/0095-0696(78)90006-2. LINK

Belsley, David A. & Kuh, Edwin. & Welsch, Roy E. (1980). Regression diagnostics: identifying influential data and sources of collinearity. New York: Wiley LINK

--- Original source retains full ownership of the source dataset ---
m
Results of univariate gamma regression models for direct costs and per...
data.mendeley.com
narcis.nl
Updated Feb 19, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Paula Becker (2020). Results of univariate gamma regression models for direct costs and per capita direct costs according to results of ASSIST for alcohol, cannabis, and cocaine/crack [Dataset]. http://doi.org/10.17632/mnj98x2ghc.1
Explore at:
Unique identifier
https://doi.org/10.17632/mnj98x2ghc.1
Dataset updated
Feb 19, 2020
Authors
Paula Becker
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This material brings data on the results of univariate gamma regression model for direct costs, which was the first stage of inferential analysis using the linear regression model so that we could analyse which variables could be interacting with total direct cost per capita. The first table shows these data and precedes the multivariate analysis described in the article. The second table shows a more detailed descreptive analysis of per capita direct costs according to the current drug use pattern (evaluated by ASSIST alcohol, cannabis and cocaine/crack), including mean, standard deviation, minimum, maximum, first quartile, median, third quartile and the p value according to Kruskal-Wallis test. These data make reference to the article by Dr. Paula Becker e Dr. Denise Razzouk called " Relationships between age of onset of drug use, use pattern, and direct health costs in a sample of adults’ drug dependents in treatment at a Brazilian community mental health service ".
o
Data and Code for: When Do Politicians Appeal Broadly? The Economic...
openicpsr.org
delimited
Updated Apr 27, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Moya Chin (2022). Data and Code for: When Do Politicians Appeal Broadly? The Economic Consequences of Electoral Rules in Brazil [Dataset]. http://doi.org/10.3886/E168901V1
Explore at:
delimitedAvailable download formats
Unique identifier
https://doi.org/10.3886/E168901V1
Dataset updated
Apr 27, 2022
Dataset provided by
American Economic Association
Authors
Moya Chin
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
1996 - 2016
Area covered
Brazil
Description
Replication package to construct the analysis for "When Do Politicians Appeal Broadly? The Economic Consequences of Electoral Rules in Brazil." It uses datasets constructed from four data sources: 1) Brazil municipal election data; 2) Brazil demographic censuses; 3) Brazil school census; and 4) nighttime lights.
f
Regression analysis for economic growth by OLS.
figshare.com
plos.figshare.com
xls
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Henry Laverde-Rojas; Juan C. Correa; Klaus Jaffe; Mario I. Caicedo (2023). Regression analysis for economic growth by OLS. [Dataset]. http://doi.org/10.1371/journal.pone.0213651.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0213651.t001
Dataset updated
May 31, 2023
Dataset provided by
PLOS ONE
Authors
Henry Laverde-Rojas; Juan C. Correa; Klaus Jaffe; Mario I. Caicedo
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Regression analysis for economic growth by OLS.
f
GWR model parameters description.
plos.figshare.com
xls
Updated May 29, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
GWR model parameters description. [Dataset]. https://plos.figshare.com/articles/dataset/GWR_model_parameters_description_/25926624
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0300443.t002
Dataset updated
May 29, 2024
Dataset provided by
PLOS ONE
Authors
Haidong Zhong; Bifeng Wang; Shaozhong Zhang
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The digital economy (DE) has become a major breakthrough in promoting industrial upgrading and an important engine for high-quality economic growth. However, most studies have neglected the important driving effect of regional economic and social (RES) development on DE. In this paper, we discuss the mechanism of RES development promoting the development of DE, and establish a demand-driven regional DE development model to express the general idea. With the help of spatial analysis toolbox in ArcGIS software, the spatial development characteristics of DE in the Yangtze River Delta City Cluster (YRDCC) is explored. We find the imbalance of spatial development is very significant in YRDCC, no matter at the provincial level or city level. Quantitative analysis reveals that less than 1% likelihood that the imbalanced or clustered pattern of DE development in YRDCC could be the result of random chance. Geographically weighted regression (GWR) analysis with publicly available dataset of YRDCC indicates RES development significantly promotes the development of DE.
f
Sample size of each data set.
figshare.com
xls
Updated Nov 7, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Xiaofeng Xu; Zhaoyuan Chen; Shixiang Chen (2023). Sample size of each data set. [Dataset]. http://doi.org/10.1371/journal.pone.0293303.t003
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0293303.t003
Dataset updated
Nov 7, 2023
Dataset provided by
PLOS ONE
Authors
Xiaofeng Xu; Zhaoyuan Chen; Shixiang Chen
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Urban economic competitiveness is a fundamental indicator for assessing the level of urban development and serves as an effective approach for understanding regional disparities. Traditional economic competitiveness research that relies solely on traditional regression models and assumes feature relationship theory tends to fall short in fully exploring the intricate interrelationships and nonlinear associations among features. As a result, the study of urban economic disparities remains limited to a narrow range of urban features, which is insufficient for comprehending cities as complex systems. The ability of deep learning neural networks to automatically construct models of nonlinear relationships among complex features provides a new approach to research in this issue. In this study, a complex urban feature dataset comprising 1008 features was constructed based on statistical data from 283 prefecture-level cities in China. Employing a machine learning approach based on convolutional neural network (CNN), a novel analytical model is constructed to capture the interrelationships among urban features, which is applied to achieve accurate classification of urban economic competitiveness. In addition, considering the limited number of samples in the dataset owing to the fixed number of cities, this study developed a data augmentation approach based on deep convolutional generative adversarial network (DCGAN) to further enhance the accuracy and generalization ability of the model. The performance of the CNN classification model was effectively improved by adding the generated samples to the original sample dataset. This study provides a precise and stable analytical model for investigating disparities in regional development. In the meantime, it offers a feasible solution to the limited sample size issue in the application of deep learning in urban research.
f
Regression analysis.
figshare.com
plos.figshare.com
xls
Updated Jun 14, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Guowei Li; Zhe Luo; Muhammad Anwar; Yuqiu Lu; Xiantao Wang; Xuening Liu (2023). Regression analysis. [Dataset]. http://doi.org/10.1371/journal.pone.0235462.t006
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0235462.t006
Dataset updated
Jun 14, 2023
Dataset provided by
PLOS ONE
Authors
Guowei Li; Zhe Luo; Muhammad Anwar; Yuqiu Lu; Xiantao Wang; Xuening Liu
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Regression analysis.
d
Replication Data for: Improving Estimation Efficiency via...
search.dataone.org
dataverse.harvard.edu
Updated Dec 16, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jiang, Liang; Linton, Oliver B.; Tang, Haihan; Zhang, Yichong (2023). Replication Data for: Improving Estimation Efficiency via Regression-Adjustment in Covariate-Adaptive Randomizations with Imperfect Compliance [Dataset]. http://doi.org/10.7910/DVN/DNF4QC
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/DNF4QC
Dataset updated
Dec 16, 2023
Dataset provided by
Harvard Dataverse
Authors
Jiang, Liang; Linton, Oliver B.; Tang, Haihan; Zhang, Yichong
Description
Review Economics and Statistics: Forthcoming. Visit https://dataone.org/datasets/sha256%3A5c6d9a57835ff9097126708e2b53bc66c569250f049fb86a1172478089de8737 for complete metadata about this dataset.
d
Replication Data for: Exploring the Spatial Impact of Sci-Tech Finance on...
dataone.org
dataverse.harvard.edu
Updated Sep 24, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Li Yang; Cheng Hong; Wang Qiliang (2024). Replication Data for: Exploring the Spatial Impact of Sci-Tech Finance on High-Quality Economic Development: A Regression Estimation Utilizing Entropy Weight Method, Text Analysis, and Spatial Autoregressive Model [Dataset]. http://doi.org/10.7910/DVN/OPQEAT
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/OPQEAT
Dataset updated
Sep 24, 2024
Dataset provided by
Harvard Dataverse
Authors
Li Yang; Cheng Hong; Wang Qiliang
Description
This dataset utilizes data from Chinese provinces spanning the period from 2004 to 2022,. It employs the entropy weighting method and text analysis to construct indicators that analyze the spatial impact of science and technology finance (Sci-tech finance) on high-quality economic development, using a spatial autoregressive model.
P
Real Estate Price Prediction Dataset
paperswithcode.com
Updated Mar 7, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). Real Estate Price Prediction Dataset [Dataset]. https://paperswithcode.com/dataset/real-estate-price-prediction
Explore at:
Dataset updated
Mar 7, 2025
Description
Problem Statement

👉 Download the case studies here

Investors and buyers in the real estate market faced challenges in accurately assessing property values and market trends. Traditional valuation methods were time-consuming and lacked precision, making it difficult to make informed investment decisions. A real estate firm sought a predictive analytics solution to provide accurate property price forecasts and market insights.

Challenge

Developing a real estate price prediction system involved addressing the following challenges:

Collecting and processing vast amounts of data, including historical property prices, economic indicators, and location-specific factors.

Accounting for diverse variables such as neighborhood quality, proximity to amenities, and market demand.

Ensuring the model’s adaptability to changing market conditions and economic fluctuations.

Solution Provided

A real estate price prediction system was developed using machine learning regression models and big data analytics. The solution was designed to:

Analyze historical and real-time data to predict property prices accurately.

Provide actionable insights on market trends, enabling better investment strategies.

Identify undervalued properties and potential growth areas for investors.

Development Steps

Data Collection

Collected extensive datasets, including property listings, sales records, demographic data, and economic indicators.

Preprocessing

Cleaned and structured data, removing inconsistencies and normalizing variables such as location, property type, and size.

Model Development

Built regression models using techniques such as linear regression, decision trees, and gradient boosting to predict property prices. Integrated feature engineering to account for location-specific factors, amenities, and market trends.

Validation

Tested the models using historical data and cross-validation to ensure high prediction accuracy and robustness.

Deployment

Implemented the prediction system as a web-based platform, allowing users to input property details and receive price estimates and market insights.

Continuous Monitoring & Improvement

Established a feedback loop to update models with new data and refine predictions as market conditions evolved.

Results

Increased Prediction Accuracy

The system delivered highly accurate property price forecasts, improving investor confidence and decision-making.

Informed Investment Decisions

Investors and buyers gained valuable insights into market trends and property values, enabling better strategies and reduced risks.

Enhanced Market Insights

The platform provided detailed analytics on neighborhood trends, demand patterns, and growth potential, helping users identify opportunities.

Scalable Solution

The system scaled seamlessly to include new locations, property types, and market dynamics.

Improved User Experience

The intuitive platform design made it easy for users to access predictions and insights, boosting engagement and satisfaction.
d
Replication Data for: Looking retrospectively at the 2018 Italian general...
dataone.org
dataverse.unimi.it
Updated Nov 9, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Giuliani, Marco (2023). Replication Data for: Looking retrospectively at the 2018 Italian general election: the state of the economy and the presence of foreigners [Dataset]. http://doi.org/10.7910/DVN/FHR8KQ
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/FHR8KQ
Dataset updated
Nov 9, 2023
Dataset provided by
Harvard Dataverse
Authors
Giuliani, Marco
Description
Municipal and provincial Italian electoral data 2018, together with economic and immigration data at the same level. Shapefiles dataset for maps and spatial regression models Scholars agree that two major issues oriented voting behaviours during the Italian general election of 2018. The first was the state of the economy, which had not yet recovered from the lowest points reached during the Great Recession, but had nevertheless exhibited some marginal improvement. The second issue originated from another crisis, the refugee and asylum emergency, which contributed to increasing the presence of foreigners in Italy and the salience of the migration issue. The article investigates the impact of these two types of problem on the 2018 election results by using aggregated objective data at the municipal level. It finds confirmation of the two issues’ impact on retrospective punishment of the incumbent Democratic Party also when using spatial regression models distinguishing the direct influence and the spill-over effects of the poor state of the economy and an increase in the size of the foreign population.
H
Replication data for: Barriers to Entry in the Airline Industry: A...
dataverse.harvard.edu
Updated Feb 22, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Replication data for: Barriers to Entry in the Airline Industry: A Multi-Dimensional Regression-Discontinuity Analysis of AIR-21 [Dataset]. https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/27289
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/27289
Dataset updated
Feb 22, 2016
Dataset provided by
Harvard Dataverse
Authors
Connan Snider; Jonathan Williams
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Snider, Connan, and Williams, Jonathan W., (2015) "Barriers to Entry in the Airline Industry: A Multi-Dimensional Regression-Discontinuity Analysis of AIR-21." Review of Economics and Statistics 97:5, 1002-1022.
Regional differences in unpaid household service work: definitions and...
cy.ons.gov.uk
ons.gov.uk
xlsx
Updated Nov 23, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Office for National Statistics (2018). Regional differences in unpaid household service work: definitions and detailed regression results [Dataset]. https://cy.ons.gov.uk/economy/nationalaccounts/satelliteaccounts/datasets/regionaldifferencesinunpaidhouseholdserviceworkdefinitionsanddetailedregressionresults
Explore at:
xlsxAvailable download formats
Dataset updated
Nov 23, 2018
Dataset provided by
Office for National Statisticshttp://www.ons.gov.uk/
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Description
The first two tables give detailed descriptions of the definitions used in the report. The latter two tables contain the coefficient results from the regression models run in the analysis.
d
Replication Data for: The Causal Effects of R&D Grants: Evidence from a...
search.dataone.org
Updated Nov 8, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Santoleri, Pietro; Mina, Andrea; Di Minin, Alberto; Martelli, Irene (2023). Replication Data for: The Causal Effects of R&D Grants: Evidence from a Regression Discontinuity [Dataset]. http://doi.org/10.7910/DVN/WBQJKD
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/WBQJKD
Dataset updated
Nov 8, 2023
Dataset provided by
Harvard Dataverse
Authors
Santoleri, Pietro; Mina, Andrea; Di Minin, Alberto; Martelli, Irene
Description
Review of Economics and Statistics: Forthcoming. Visit https://dataone.org/datasets/sha256%3A6b7c9ddc73f32ceb5e078ce9a3433aae5552b5e5a6fa8dfb7f13d399d7d1aceb for complete metadata about this dataset.
o
Replication data for: Big Data: New Tricks for Econometrics
openicpsr.org
Updated May 1, 2014
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hal R. Varian (2014). Replication data for: Big Data: New Tricks for Econometrics [Dataset]. http://doi.org/10.3886/E113925V1
Explore at:
Unique identifier
https://doi.org/10.3886/E113925V1
Dataset updated
May 1, 2014
Dataset provided by
American Economic Association
Authors
Hal R. Varian
Time period covered
May 1, 2014
Description
Computers are now involved in many economic transactions and can capture data associated with these transactions, which can then be manipulated and analyzed. Conventional statistical and econometric techniques such as regression often work well, but there are issues unique to big datasets that may require different tools. First, the sheer size of the data involved may require more powerful data manipulation tools. Second, we may have more potential predictors than appropriate for estimation, so we need to do some kind of variable selection. Third, large datasets may allow for more flexible relationships than simple linear models. Machine learning techniques such as decision trees, support vector machines, neural nets, deep learning, and so on may allow for more effective ways to model complex relationships. In this essay, I will describe a few of these tools for manipulating and analyzing big data. I believe that these methods have a lot to offer and should be more widely known and used by economists.
d
Logistic Regression Samples - Forest harvest patterns on private lands in...
catalog.data.gov
data.usgs.gov
+1more
Updated Jul 6, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Geological Survey (2024). Logistic Regression Samples - Forest harvest patterns on private lands in the Cascade Mountains, Washington, USA [Dataset]. https://catalog.data.gov/dataset/logistic-regression-samples-forest-harvest-patterns-on-private-lands-in-the-cascade-mounta
Explore at:
Dataset updated
Jul 6, 2024
Dataset provided by
United States Geological Surveyhttp://www.usgs.gov/
Area covered
Cascade Range, Washington, United States
Description
Forests in Washington State generate substantial economic revenue from commercial timber harvesting on private lands. To investigate the rates, causes, and spatial and temporal patterns of forest harvest on private tracts throughout the central Cascade Mountain area, we relied on a new generation of annual land-use/land-cover (LULC) products created from the application of the Continuous Change Detection and Classification (CCDC) algorithm to Landsat satellite imagery collected from 1985 to 2014. We calculated metrics of landscape pattern using patches of intact and harvested forest patches identified in each annual layer to identify changes throughout the time series. Patch dynamics revealed four distinct eras of logging trends that align with prevailing regulations and economic conditions. We used multiple logistic regression to determine the biophysical and anthropogenic factors that influence fine-scale selection of harvest stands in each time period. Results show that private forestland became significantly reduced and more fragmented from 1985 to 2014. Variables linked to parameters of site conditions, location, climate, and vegetation greenness consistently distinguished harvest selection for each distinct era. This study demonstrates the utility of annual LULC data for investigating the underlying factors that influence land cover change.

Facebook

Twitter

Click to copy link

Link copied

Cite

David Slichter; David Slichter (2022). The Employment Effects of the Minimum Wage: A Selection Ratio Approach to Measuring Treatment Effects (replication data) [Dataset]. http://doi.org/10.15456/jae.2022349.0653510847

The Employment Effects of the Minimum Wage: A Selection Ratio Approach to Measuring Treatment Effects (replication data)

Explore at:

csv(4311446), stata data(4372766), stata do(5231), stata do(4393), stata do(9425), csv(513370056), rtf(4784), stata data(371859711)Available download formats

Unique identifier

https://doi.org/10.15456/jae.2022349.0653510847

Dataset updated

Dec 20, 2022

Dataset provided by

ZBW - Leibniz Informationszentrum Wirtschaft

Authors

David Slichter; David Slichter

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Replication files for David Slichter, "The Employment Effects of the Minimum Wage: A Selection Ratio Approach to Measuring Treatment Effects,” Journal of Applied Econometrics, forthcoming

Firstly, I’ve provided a .do file called sr.do which contains general code for implementing the selection ratio approach, with detailed instructions written as comments in the code.

For the minimum wage application, the main data file is mw_final.dta. A .csv version is also provided. Observations are a county in a time period. I have added self-explanatory variable labels for most variables. A few variables warrant a clearer explanation:

adj1-adj14: List of FIPS codes of all counties which are adjacent to the county in question. Each variables holds one adjacent county, and counties with fewer than 14 neighbors will have missing values for some of these variables.

change, logchange: Minimum wage this quarter - minimum wage last quarter, measured either in dollars or in logs.

time, t1-t108: The variable "time" converts years and quarters into a univariate time period, with time=1 in 1990Q1 and time=108 in 2016Q4. t1-t108 are indicators for each of these time periods.

lnemp_1418, lnearnbeg_1418, lnsep_1418, lnhira_1418, lnchurn_1418: Logs of employment, earnings, separations, hires, and churn, respectively, for 14-18 year olds.

gt1-gt6: Dummies for inclusion in each of the six comparisons used for the main (i.e., not spillover-robust) analysis. All treated counties which neighbor a control country take value 1 for each of these variables; all other treated counties take value 0. Among control counties, gt1=1 if the county neighbors a treated county and 0 otherwise, gt2=1 if the county has gt1=0 but neighbors a gt1=1 county, gt3=1 if county has gt1=gt2=0 but neighbors a gt2=1 county, etc.

h2-h6: Dummies for inclusion in each of the first spillover-robust (i.e., excluding border counties only) comparisons. Among control counties, h2-h6 are equal to gt2-gt6. Among treated counties, h2-h6 are equal to 1 if the treated county has gt1=0 but borders a gt1=1 county, and 0 otherwise.

k3-k6: Dummies for inclusion in each of the second spillover-robust (i.e., excluding two layers) comparisons. Among control counties, these variables are equal to gt3-gt6. Among treated counties, all observations take value 1 except those with gt1=1 or h2=1.

The data sources are as follows. The minimum wage law series is taken from David Neumark's website (https://www.economics.uci.edu/~dneumark/datasets.html). The economic variables are taken from the QWI, which I accessed via the Ithaca Virtual RDC. County adjacency files were downloaded from the Census Bureau (https://www.census.gov/geo/reference/county-adjacency.html).

The file main.do then runs the analyses. The resulting output file containing results is results.dta.

For the incumbency application, the main data file is incumb_final.dta. A .csv version is also provided. This file is drawn from Caughey and Sekhon's (2011) data; see their description of most variables here: https://doi.org/10.7910/DVN/8EYYA2

The key added variables are _IDistancea1-_IDistancea50, which are dummies for inclusion in the 50 comparisons used in the paper. Treated observations (i.e., Democratic wins) with margin of victory below 5 points have each of these variables equal to 1. Control observations have these variables equal to 1 if they fall within the margin of victory range, e.g., _IDistancea9=1 for control observations with Republican margin of victory between 8 and 9 points. Note that these variables are redefined by the code for the analyses of treatment effects away from the discontinuity. Lastly, there is a variable called RepWin which is the treatment variable when treatment is defined as a Republican winning.

The file sr_incumb.do then performs the analysis.

Please contact me with any questions at slichter@binghamton.edu.

Clear search

Close search

Google apps

Main menu

The Employment Effects of the Minimum Wage: A Selection Ratio Approach to...

Statistical Regression Methods in Education Teaching Datasets: Longitudinal...

Replication Data for: "Regression Discontinuity Designs Using Covariates"

Replication Data for: Inference in High-Dimensional Regression Models...

‘Boston House Prices-Advanced Regression Techniques’ analyzed by Analyst-2

Similar Datasets

Context

Attribute Information

Source

Relevant Papers

Results of univariate gamma regression models for direct costs and per...

Data and Code for: When Do Politicians Appeal Broadly? The Economic...

Regression analysis for economic growth by OLS.

GWR model parameters description.

Sample size of each data set.

Regression analysis.

Replication Data for: Improving Estimation Efficiency via...

Replication Data for: Exploring the Spatial Impact of Sci-Tech Finance on...

Real Estate Price Prediction Dataset

Replication Data for: Looking retrospectively at the 2018 Italian general...

Replication data for: Barriers to Entry in the Airline Industry: A...

Regional differences in unpaid household service work: definitions and...

Replication Data for: The Causal Effects of R&D Grants: Evidence from a...

Replication data for: Big Data: New Tricks for Econometrics

Logistic Regression Samples - Forest harvest patterns on private lands in...

The Employment Effects of the Minimum Wage: A Selection Ratio Approach to Measuring Treatment Effects (replication data)See More Versions

The Employment Effects of the Minimum Wage: A Selection Ratio Approach to Measuring Treatment Effects (replication data)