A more recent web map on this same topic is available for ArcGIS Online subscribers here.This map shows the socioeconomic status of each census tract. Data come from the US Census Bureau's 2011-2015 American Community Survey. Neighborhood Socioeconomic Status, over and above individual socioeconomic status, is a predictor of many health outcomes. The Neighborhood Socioeconomic Status (NSES) Index is on a scale from 0 to 100 with 50 being the national average around 2010. The Index incorporates the following indicators (fields in this layer's attribute table):Median Household Income (from Table B19013)Percent of individuals with income below the Federal Poverty Line (from Table S1701)The educational attainment of adults (age 25+) (from Table B15003)Unemployment Rate (from Table S2301)Percent of households with children under the age of 18 that are "female-headed" (no male present) (from Table B11005)NSES = log(median household income) + (-1.129 * (log(percent of female-headed households))) + (-1.104 * (log(unemployment rate))) + (-1.974 * (log(percent below poverty))) + .451*((high school grads)+(2*(bachelor's degree holders)))To learn more about how the NSES Index was developed, please explore this journal articleMiles, Jeremy and Weden, Margaret; Lavery, Diana; Escarce, José; Kathleen Cagney; Shih, Regina. 2016. “Constructing a Time-Invariant Measure of the Socio-Economic Status of U.S. Census Tracts.” Journal of Urban Health, vol. 93, issue no.1, pp. 213-232. or this PPT presentation presented at the University of Texas at San Antonio's Applied Demography Conference in 2014.
https://www.icpsr.umich.edu/web/ICPSR/studies/38528/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/38528/terms
These datasets contain measures of socioeconomic and demographic characteristics by U.S. census tract for the years 1990-2022 and ZIP code tabulation area (ZCTA) for the years 2008-2022. Example measures include population density; population distribution by race, ethnicity, age, and income; income inequality by race and ethnicity; and proportion of population living below the poverty level, receiving public assistance, and female-headed or single parent families with kids. The datasets also contain a set of theoretically derived measures capturing neighborhood socioeconomic disadvantage and affluence, as well as a neighborhood index of Hispanic, foreign born, and limited English.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Here are Yost indexes for census tracts and block groups in the United States for various years from 1990-2019. The Yost index is a composite index of socioeconomic status that consists of a percentile score from 1 (highest SES) to 100 (lowest SES). Data for 1990 and 2000 include the 50 US states plus the District of Columbia. For years after 2000, the data additionally include Puerto Rico. To rescale the index to geographic units smaller than the US, the score column may be used, where scores range from about -1.8 for the highest SES to 1.8 for the lowest SES.More about the Yost index can be found here: Yost K, Perkins C, Cohen R, Morris C, Wright W. Socioeconomic status and breast cancer incidence in California for different race/ethnic groups. Cancer Causes and Control 2001; 12(8): 703–711.
Yu M, Tatalovich Z, Gibson JT, Cronin KA. Using a composite index of socioeconomic status to investigate health disparities while protecting the confidentiality of cancer registry data. Cancer Causes and Control. 2014; 25(1): 81-92.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
We extend our previous work with the Yost Index by adding 90% confidence intervals to the index values. These were calculated using the variance replicate estimates published in association with the American Community Survey of the United States Census Bureau.
In the file yost-tract-2015-2019.csv, the data fields consists of 11-digit geographic ID built from FIPS codes (2 digit state, 3 digit county, 6 digit census tract); Yost index, 90% lower confidence interval; 90% upper confidence interval. Data is provided for 72,793 census tracts for which sufficient data were available. The Yost Index ranges from 1 (lowest socioeconomic position) to 100 (highest socioeconomic position).
For those only interested in using the index as we have calculated it, the file yost-tract-2015-2019 is the only file you need. The other 368 files here are provided for anyone who wishes to replicate our results using the R program yost-conf-intervals.R. The program presumes the user is running Windows machine and that all files reside in a folder called C:/yostindex. The R program requires a number of packages, all of which are specified in lines 10-22 of the program.
Details of this project were published in Boscoe FP, Liu B, LaFantasie J, Niu L, Lee FF. Estimating uncertainty in a socioeconomic index derived from the American Community Survey. SSM-Population Health 2022; 18: 101078. Full text
Additional years of data following this format are planned to be added to this repository in time.
This data package has the purpose to offer data for socio-economic indicators and to cover as much as possible the entire this indicator category with regard to the indicator type and to the geographic level. The major sources of the data are the U.S. Census Bureau and the U.S. Bureau for Labor Statistics. Another used sources of data are the U.S. Department of Housing and Urban Development and the U.S. Department of Housing and the U.S. Department Of Agriculture (Economic Research Service).
[[NOTE: Placeholder for DOI. Data still in preparation. Data are currently only accessible to qualified reviewers.]] This dataset includes the home mailing addresses of all participants (registrants with at least one courseware action) in MITx and HarvardX courses. For U.S. residents, These mailing addresses can be parsed and geo-matched with data from the US Census to develop a suite of socioeconomic status indicators, including median neighborhood income and neighborhood level of education. We also include self-reported survey data about parental level of education, and we include an indicator for whether or not the participant earned a certificate. These data provide insight into how SES factors predict student enrollment and completion in MOOCs. REQUEST ACCESS by filling out form at http://vpal.harvard.edu/access-vpal-research-data
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Background: Socioeconomic status (SES) is an important determinant of health and potential modifier of the effects of environmental contaminants. There has been a lack of comprehensive indices for measuring overall SES in Canada. Here, a more comprehensive SES index is developed aiming to support future studies exploring health outcomes related to environmental pollution in Canada. Methods: SES variables (n=22, Census Canada 2006) were selected based on: cultural identities, housing characteristics, variables identified in Canadian environmental injustice studies and a previous deprivation index (Pampalon index). Principal component analysis with a single varimax rotation (factor loadings=¦60¦) was performed on SES variables for 52974 census dissemination areas (DA). The final index was created by averaging the factor scores per DA according to the three components retained. The index was validated by examining its association with preterm birth (gestational age<37 weeks), term low birth weight (LBW, <2500 g), small for gestational age (SGA, <10 percentile of birth weight for gestational age) and PM2.5 (particulate matter=2.5 µm) exposures in Edmonton, Alberta (1999–2008). Results: Index values exhibited a relatively normal distribution (median=0.11, mean=0.0, SD=0.58) across Canada. Values in Alberta tended to be higher than in Newfoundland and Labrador, Northwest Territories and Nunavut (Pearson chi-square p<0.001 across provinces). Lower quintiles of our index and the Pampalon’s index confirmed know associations with a higher prevalence of LBW, SGA, preterm birth and PM2.5 exposure. Results with our index exhibited greater statistical significance and a more consistent gradient of PM2.5 levels and prevalence of pregnancy outcomes. Conclusions: Our index reflects more dimensions of SES than an earlier index and it performed superiorly in capturing gradients in prevalence of pregnancy outcomes. It can be used for future research involving environmental pollution and health in Canada. These metadata can also be found on SAGE's searchable metadata website: http://sagemetadata.policywise.com/nada/index.php/catalog/14
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Here are the raw data and R code used in the paper "A comparison of two neighborhood-level socioeconomic indices in the United States" by Boscoe and Li currently under review. The raw data and data dictionary are exactly as they were obtained from the National Historical Geographic Information System (NHGIS). The data comprise the 7 American Community Survey variables used to construct the Yost Index at the block group level for the period 2011-2015.
This dataset contains a selection of six socioeconomic indicators of public health significance and a “hardship index,” by Chicago community area, for the years 2008 – 2012. The indicators are the percent of occupied housing units with more than one person per room (i.e., crowded housing); the percent of households living below the federal poverty level; the percent of persons in the labor force over the age of 16 years that are unemployed; the percent of persons over the age of 25 years without a high school diploma; the percent of the population under 18 or over 64 years of age (i.e., dependency); and per capita income. Indicators for Chicago as a whole are provided in the final row of the table. See the full dataset description for more information at: https://data.cityofchicago.org/api/views/fwb8-6aw5/files/A5KBlegGR2nWI1jgP6pjJl32CTPwPbkl9KU3FxlZk-A?download=true&filename=P:\EPI\OEPHI\MATERIALS\REFERENCES\ECONOMIC_INDICATORS\Dataset_Description_socioeconomic_indicators_2012_FOR_PORTAL_ONLY.pdf
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘Socioeconomic Status Social Vulnerability Index 2014-2018’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://catalog.data.gov/dataset/2c5de914-693a-405e-80f7-5abb6b7dd3d3 on 28 January 2022.
--- Dataset description provided by original source is as follows ---
Socioeconomic Status is one of the themes of the Social Vulnerability Index from the CDC. This data set is data from the Social Vulnerability Index. Only the Socioeconomic Status columns are represented in this dataset.
ATSDR’s Geospatial Research, Analysis & Services Program (GRASP) created Centers for Disease Control and Prevention Social Vulnerability Index (CDC SVI or simply SVI, hereafter) to help public health officials and emergency response planners identify and map the communities that will most likely need support before, during, and after a hazardous event.
SVI indicates the relative vulnerability of every U.S. Census tract. Census tracts are subdivisions of counties for which the Census collects statistical data. SVI ranks the tracts on 15 social factors, including unemployment, minority status, and disability, and further groups them into four related themes. Thus, each tract receives a ranking for each Census variable and for each of the four themes, as well as an overall ranking.
In addition to tract-level rankings, SVI 2018 also has corresponding rankings at the county level. Notes below that describe “tract” methods also refer to county methods.
--- Original source retains full ownership of the source dataset ---
This dataset contains a selection of six socioeconomic indicators of public health significance and a “hardship index,” by Chicago community area, for the years 2007 – 2011. The indicators are the percent of occupied housing units with more than one person per room (i.e., crowded housing); the percent of households living below the federal poverty level; the percent of persons in the labor force over the age of 16 years that are unemployed; the percent of persons over the age of 25 years without a high school diploma; the percent of the population under 18 or over 64 years of age (i.e., dependency); and per capita income. Indicators for Chicago as a whole are provided in the final row of the table. See the full dataset description for more information at https://data.cityofchicago.org/api/assets/8D10B9D1-CCA3-4E7E-92C7-5125E9AB46E9.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘Socioeconomic Status of CDC Social Vulnerability Index’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://catalog.data.gov/dataset/a70b39b7-1151-42dd-b333-d8923d20f22a on 28 January 2022.
--- Dataset description provided by original source is as follows ---
Socioeconomic Status is one of the themes of the Social Vulnerability Index from the CDC. This data set is data from the Social Vulnerability Index. Only the Socioeconomic Status columns are represented in this dataset. Years of data in collection 2014, 2016, 2018
ATSDR’s Geospatial Research, Analysis & Services Program (GRASP) created Centers for Disease Control and Prevention Social Vulnerability Index (CDC SVI or simply SVI, hereafter) to help public health officials and emergency response planners identify and map the communities that will most likely need support before, during, and after a hazardous event.
SVI indicates the relative vulnerability of every U.S. Census tract. Census tracts are subdivisions of counties for which the Census collects statistical data. SVI ranks the tracts on 15 social factors, including unemployment, minority status, and disability, and further groups them into four related themes. Thus, each tract receives a ranking for each Census variable and for each of the four themes, as well as an overall ranking.
In addition to tract-level rankings, SVI 2018 also has corresponding rankings at the county level. Notes below that describe “tract” methods also refer to county methods.
--- Original source retains full ownership of the source dataset ---
What is CDC Social Vulnerability Index?ATSDR’s Geospatial Research, Analysis & Services Program (GRASP) created the Social Vulnerability Index (SVI) to help emergency response planners and public health officials identify and map the communities that will most likely need support before, during, and after a hazardous event.SVI uses U.S Census Data to determine the social vulnerability of every county and tract. CDC SVI ranks each county and tract on 16 social factors, including poverty, lack of vehicle access, and crowded housing, and groups them into four related themes:Theme 1 - Socioeconomic StatusTheme 2 - Household CharacteristicsTheme 3 - Racial & Ethnic Minority StatusTheme 4 - Housing Type & Transportation VariablesFor a detailed description of variable uses, please refer to the full SVI 2020 Documentation.RankingsWe ranked counties and tracts for the entire United States against one another. This feature layer can be used for mapping and analysis of relative vulnerability of counties in multiple states, or across the U.S. as a whole. Rankings are based on percentiles. Percentile ranking values range from 0 to 1, with higher values indicating greater vulnerability. For each county and tract, we generated its percentile rank among all counties and tracts for 1) the sixteen individual variables, 2) the four themes, and 3) its overall position. Overall Rankings:We totaled the sums for each theme, ordered the counties, and then calculated overall percentile rankings. Please note: taking the sum of the sums for each theme is the same as summing individual variable rankings.The overall tract summary ranking variable is RPL_THEMES. Theme rankings:For each of the four themes, we summed the percentiles for the variables comprising each theme. We ordered the summed percentiles for each theme to determine theme-specific percentile rankings. The four summary theme ranking variables are: Socioeconomic Status - RPL_THEME1Household Characteristics - RPL_THEME2Racial & Ethnic Minority Status - RPL_THEME3Housing Type & Transportation - RPL_THEME4FlagsCounties and tracts in the top 10%, i.e., at the 90th percentile of values, are given a value of 1 to indicate high vulnerability. Counties and tracts below the 90th percentile are given a value of 0. For a theme, the flag value is the number of flags for variables comprising the theme. We calculated the overall flag value for each county as the total number of all variable flags. SVI Informational VideosIntroduction to CDC Social Vulnerability Index (SVI)Methods for CDC Social Vulnerability Index (SVI)More Questions?CDC SVI 2020 Full DocumentationSVI Home PageContact the SVI Coordinator
The following report outlines the workflow used to optimize your Find Outliers result:Initial Data Assessment.There were 1684 valid input features.POVERTY Properties:Min0.0000Max91.8000Mean18.9902Std. Dev.12.7152There were 22 outlier locations; these will not be used to compute the optimal fixed distance band.Scale of AnalysisThe optimal fixed distance band was based on the average distance to 30 nearest neighbors: 3709.0000 Meters.Outlier AnalysisCreating the random reference distribution with 499 permutations.There are 1155 output features statistically significant based on a FDR correction for multiple testing and spatial dependence.There are 68 statistically significant high outlier features.There are 84 statistically significant low outlier features.There are 557 features part of statistically significant low clusters.There are 446 features part of statistically significant high clusters.OutputPink output features are part of a cluster of high POVERTY values.Light Blue output features are part of a cluster of low POVERTY values.Red output features represent high outliers within a cluster of low POVERTY values.Blue output features represent low outliers within a cluster of high POVERTY values.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset contains measures of socioeconomic and demographic characteristics by US census tract 1990-2010. Example measures include population density; population distribution by race, ethnicity, age, and income; and proportion of population living below the poverty level, receiving public assistance, and female-headed families. The dataset also contains a set of index variables to represent neighborhood disadvantage and affluence.
What is CDC Social Vulnerability Index?ATSDR’s Geospatial Research, Analysis & Services Program (GRASP) created the Social Vulnerability Index (SVI) to help emergency response planners and public health officials identify and map the communities that will most likely need support before, during, and after a hazardous event.SVI uses U.S Census Data to determine the social vulnerability of every county and tract. CDC SVI ranks each county and tract on 16 social factors, including poverty, lack of vehicle access, and crowded housing, and groups them into four related themes:Theme 1 - Socioeconomic StatusTheme 2 - Household CharacteristicsTheme 3 - Racial & Ethnic Minority StatusTheme 4 - Housing Type & Transportation VariablesFor a detailed description of variable uses, please refer to the full SVI 2020 Documentation.RankingsWe ranked counties and tracts for the entire United States against one another. This feature layer can be used for mapping and analysis of relative vulnerability of counties in multiple states, or across the U.S. as a whole. Rankings are based on percentiles. Percentile ranking values range from 0 to 1, with higher values indicating greater vulnerability. For each county and tract, we generated its percentile rank among all counties and tracts for 1) the sixteen individual variables, 2) the four themes, and 3) its overall position. Overall Rankings:We totaled the sums for each theme, ordered the counties, and then calculated overall percentile rankings. Please note: taking the sum of the sums for each theme is the same as summing individual variable rankings.The overall tract summary ranking variable is RPL_THEMES. Theme rankings:For each of the four themes, we summed the percentiles for the variables comprising each theme. We ordered the summed percentiles for each theme to determine theme-specific percentile rankings. The four summary theme ranking variables are: Socioeconomic Status - RPL_THEME1Household Characteristics - RPL_THEME2Racial & Ethnic Minority Status - RPL_THEME3Housing Type & Transportation - RPL_THEME4FlagsCounties and tracts in the top 10%, i.e., at the 90th percentile of values, are given a value of 1 to indicate high vulnerability. Counties and tracts below the 90th percentile are given a value of 0. For a theme, the flag value is the number of flags for variables comprising the theme. We calculated the overall flag value for each county as the total number of all variable flags. SVI Informational VideosIntroduction to CDC Social Vulnerability Index (SVI)Methods for CDC Social Vulnerability Index (SVI)More Questions?CDC SVI 2020 Full DocumentationSVI Home PageContact the SVI Coordinator
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
In 2021, 20.1% of people from the Indian ethnic group were in higher managerial and professional occupations – the highest percentage out of all ethnic groups in this socioeconomic group.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘Country Socioeconomic Status Scores: 1880-2010’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/sdorius/globses on 14 February 2022.
--- Dataset description provided by original source is as follows ---
This dataset contains estimates of the socioeconomic status (SES) position of each of 149 countries covering the period 1880-2010. Measures of SES, which are in decades, allow for a 130 year time-series analysis of the changing position of countries in the global status hierarchy. SES scores are the average of each country’s income and education ranking and are reported as percentile rankings ranging from 1-99. As such, they can be interpreted similarly to other percentile rankings, such has high school standardized test scores. If country A has an SES score of 55, for example, it indicates that 55 percent of the world’s people live in a country with a lower average income and education ranking than country A. ISO alpha and numeric country codes are included to allow users to merge these data with other variables, such as those found in the World Bank’s World Development Indicators Database and the United Nations Common Database.
See here for a working example of how the data might be used to better understand how the world came to look the way it does, at least in terms of status position of countries.
VARIABLE DESCRIPTIONS: UNID: ISO numeric country code (used by the United Nations) WBID: ISO alpha country code (used by the World Bank) SES: Socioeconomic status score (percentile) based on GDP per capita and educational attainment (n=174) country: Short country name year: Survey year SES: Socioeconomic status score (1-99) for each of 174 countries gdppc: GDP per capita: Single time-series (imputed) yrseduc: Completed years of education in the adult (15+) population popshare: Total population shares
DATA SOURCES:
The dataset was compiled by Shawn Dorius (sdorius@iastate.edu) from a large number of data sources, listed below.
GDP per Capita:
1. Maddison, Angus. 2004. 'The World Economy: Historical Statistics'. Organization for Economic Co-operation and Development: Paris. Maddison population data in 000s; GDP & GDP per capita data in (1990 Geary-Khamis dollars, PPPs of currencies and average prices of commodities). Maddison data collected from: http://www.ggdc.net/MADDISON/Historical_Statistics/horizontal-file_02-2010.xls.
2. World Development Indicators Database
Years of Education
1. Morrisson and Murtin.2009. 'The Century of Education'. Journal of Human Capital(3)1:1-42. Data downloaded from http://www.fabricemurtin.com/
2. Cohen, Daniel & Marcelo Cohen. 2007. 'Growth and human capital: Good data, good results' Journal of economic growth 12(1):51-76. Data downloaded from http://soto.iae-csic.org/Data.htm
3. Barro, Robert and Jong-Wha Lee, 2013, "A New Data Set of Educational Attainment in the World, 1950-2010." Journal of Development Economics, vol 104, pp.184-198. Data downloaded from http://www.barrolee.com/
Total Population
1. Maddison, Angus. 2004. 'The World Economy: Historical Statistics'. Organization for Economic Co-operation and Development: Paris. 13.
2. United Nations Population Division. 2009.
--- Original source retains full ownership of the source dataset ---
Data on county socioeconomic status for 2,132 US counties and each county’s average annual cardiovascular mortality rate (CMR) and total PM2.5 concentration for 21 years (1990-2010). County CMR, PM2.5, and socioeconomic data were obtained from the U.S. National Center for Health Statistics, U.S. Environmental Protection Agency’s Community Multiscale Air Quality modeling system, and the U.S. Census, respectively. A socioeconomic index was created using seven county-level measures from the 1990 US census using factor analysis. Quintiles of this index were used to generate categories of county socioeconomic status. This dataset is associated with the following publication: Wyatt, L., G. Peterson, T. Wade, L. Neas, and A. Rappold. The contribution of improved air quality to reduced cardiovascular mortality: Declines in socioeconomic differences over time. ENVIRONMENT INTERNATIONAL. Elsevier B.V., Amsterdam, NETHERLANDS, 136: 105430, (2020).
A more recent web map on this same topic is available for ArcGIS Online subscribers here.This map shows the socioeconomic status of each census tract. Data come from the US Census Bureau's 2011-2015 American Community Survey. Neighborhood Socioeconomic Status, over and above individual socioeconomic status, is a predictor of many health outcomes. The Neighborhood Socioeconomic Status (NSES) Index is on a scale from 0 to 100 with 50 being the national average around 2010. The Index incorporates the following indicators (fields in this layer's attribute table):Median Household Income (from Table B19013)Percent of individuals with income below the Federal Poverty Line (from Table S1701)The educational attainment of adults (age 25+) (from Table B15003)Unemployment Rate (from Table S2301)Percent of households with children under the age of 18 that are "female-headed" (no male present) (from Table B11005)NSES = log(median household income) + (-1.129 * (log(percent of female-headed households))) + (-1.104 * (log(unemployment rate))) + (-1.974 * (log(percent below poverty))) + .451*((high school grads)+(2*(bachelor's degree holders)))To learn more about how the NSES Index was developed, please explore this journal articleMiles, Jeremy and Weden, Margaret; Lavery, Diana; Escarce, José; Kathleen Cagney; Shih, Regina. 2016. “Constructing a Time-Invariant Measure of the Socio-Economic Status of U.S. Census Tracts.” Journal of Urban Health, vol. 93, issue no.1, pp. 213-232. or this PPT presentation presented at the University of Texas at San Antonio's Applied Demography Conference in 2014.