The Integrated Public Use Microdata Series (IPUMS) Complete Count Data include more than 650 million individual-level and 7.5 million household-level records. The microdata are the result of collaboration between IPUMS and the nation’s two largest genealogical organizations—Ancestry.com and FamilySearch—and provides the largest and richest source of individual level and household data.
All manuscripts (and other items you'd like to publish) must be submitted to
phsdatacore@stanford.edu for approval prior to journal submission.
We will check your cell sizes and citations.
For more information about how to cite PHS and PHS datasets, please visit:
https:/phsdocs.developerhub.io/need-help/citing-phs-data-core
Historic data are scarce and often only exists in aggregate tables. The key advantage of historic US census data is the availability of individual and household level characteristics that researchers can tabulate in ways that benefits their specific research questions. The data contain demographic variables, economic variables, migration variables and family variables. Within households, it is possible to create relational data as all relations between household members are known. For example, having data on the mother and her children in a household enables researchers to calculate the mother’s age at birth. Another advantage of the Complete Count data is the possibility to follow individuals over time using a historical identifier.
In sum: the historic US census data are a unique source for research on social and economic change and can provide population health researchers with information about social and economic determinants.
The historic US 1920 census data was collected in January 1920. Enumerators collected data traveling to households and counting the residents who regularly slept at the household. Individuals lacking permanent housing were counted as residents of the place where they were when the data was collected. Household members absent on the day of data collected were either listed to the household with the help of other household members or were scheduled for the last census subdivision.
Notes
We provide household and person data separately so that it is convenient to explore the descriptive statistics on each level. In order to obtain a full dataset, merge the household and person on the variables SERIAL and SERIALP. In order to create a longitudinal dataset, merge datasets on the variable HISTID.
Households with more than 60 people in the original data were broken up for processing purposes. Every person in the large households are considered to be in their own household. The original large households can be identified using the variable SPLIT, reconstructed using the variable SPLITHID, and the original count is found in the variable SPLITNUM.
Coded variables derived from string variables are still in progress. These variables include: occupation and industry.
Missing observations have been allocated and some inconsistencies have been edited for the following variables: SPEAKENG, YRIMMIG, CITIZEN, AGE, BPL, MBPL, FBPL, LIT, SCHOOL, OWNERSHP, MORTGAGE, FARM, CLASSWKR, OCC1950, IND1950, MARST, RACE, SEX, RELATE, MTONGUE. The flag variables indicating an allocated observation for the associated variables can be included in your extract by clicking the ‘Select data quality flags’ box on the extract summary page.
Most inconsistent information was not edited for this release, thus there are observations outside of the universe for some variables. In particular, the variables GQ, and GQTYPE have known inconsistencies and will be improved with the next release.
%3C!-- --%3E
This dataset was created on 2020-01-10 18:46:34.647
by merging multiple datasets together. The source datasets for this version were:
IPUMS 1920 households: This dataset includes all households from the 1920 US census.
IPUMS 1920 persons: This dataset includes all individuals from the 1920 US census.
IPUMS 1920 Lookup: This dataset includes variable names, variable labels, variable values, and corresponding variable value labels for the IPUMS 1920 datasets.
In the past four centuries, the population of the United States has grown from a recorded 350 people around the Jamestown colony of Virginia in 1610, to an estimated 331 million people in 2020. The pre-colonization populations of the indigenous peoples of the Americas have proven difficult for historians to estimate, as their numbers decreased rapidly following the introduction of European diseases (namely smallpox, plague and influenza). Native Americans were also omitted from most censuses conducted before the twentieth century, therefore the actual population of what we now know as the United States would have been much higher than the official census data from before 1800, but it is unclear by how much. Population growth in the colonies throughout the eighteenth century has primarily been attributed to migration from the British Isles and the Transatlantic slave trade; however it is also difficult to assert the ethnic-makeup of the population in these years as accurate migration records were not kept until after the 1820s, at which point the importation of slaves had also been illegalized. Nineteenth century In the year 1800, it is estimated that the population across the present-day United States was around six million people, with the population in the 16 admitted states numbering at 5.3 million. Migration to the United States began to happen on a large scale in the mid-nineteenth century, with the first major waves coming from Ireland, Britain and Germany. In some aspects, this wave of mass migration balanced out the demographic impacts of the American Civil War, which was the deadliest war in U.S. history with approximately 620 thousand fatalities between 1861 and 1865. The civil war also resulted in the emancipation of around four million slaves across the south; many of whose ancestors would take part in the Great Northern Migration in the early 1900s, which saw around six million black Americans migrate away from the south in one of the largest demographic shifts in U.S. history. By the end of the nineteenth century, improvements in transport technology and increasing economic opportunities saw migration to the United States increase further, particularly from southern and Eastern Europe, and in the first decade of the 1900s the number of migrants to the U.S. exceeded one million people in some years. Twentieth and twenty-first century The U.S. population has grown steadily throughout the past 120 years, reaching one hundred million in the 1910s, two hundred million in the 1960s, and three hundred million in 2007. In the past century, the U.S. established itself as a global superpower, with the world's largest economy (by nominal GDP) and most powerful military. Involvement in foreign wars has resulted in over 620,000 further U.S. fatalities since the Civil War, and migration fell drastically during the World Wars and Great Depression; however the population continuously grew in these years as the total fertility rate remained above two births per woman, and life expectancy increased (except during the Spanish Flu pandemic of 1918).
Since the Second World War, Latin America has replaced Europe as the most common point of origin for migrants, with Hispanic populations growing rapidly across the south and border states. Because of this, the proportion of non-Hispanic whites, which has been the most dominant ethnicity in the U.S. since records began, has dropped more rapidly in recent decades. Ethnic minorities also have a much higher birth rate than non-Hispanic whites, further contributing to this decline, and the share of non-Hispanic whites is expected to fall below fifty percent of the U.S. population by the mid-2000s. In 2020, the United States has the third-largest population in the world (after China and India), and the population is expected to reach four hundred million in the 2050s.
This dataset includes all households from the 1920 US census.
These data on 19th- and early 20th-century police department and arrest behavior were collected between 1975 and 1978 for a study of police and crime in the United States. Raw and aggregated time-series data are presented in Parts 1 and 3 on 23 American cities for most years during the period 1860-1920. The data were drawn from annual reports of police departments found in the Library of Congress or in newspapers and legislative reports located elsewhere. Variables in Part 1, for which the city is the unit of analysis, include arrests for drunkenness, conditional offenses and homicides, persons dismissed or held, police personnel, and population. Part 3 aggregates the data by year and reports some of these variables on a per capita basis, using a linear interpolation from the last decennial census to estimate population. Part 2 contains data for 267 United States cities for the period 1880-1890 and was generated from the 1880 federal census volume, REPORT ON THE DEFECTIVE, DEPENDENT, AND DELINQUENT CLASSES, published in 1888, and from the 1890 federal census volume, SOCIAL STATISTICS OF CITIES. Information includes police personnel and expenditures, arrests, persons held overnight, trains entering town, and population.
The world's population first reached one billion people in 1803, and reach eight billion in 2023, and will peak at almost 11 billion by the end of the century. Although it took thousands of years to reach one billion people, it did so at the beginning of a phenomenon known as the demographic transition; from this point onwards, population growth has skyrocketed, and since the 1960s the population has increased by one billion people every 12 to 15 years. The demographic transition sees a sharp drop in mortality due to factors such as vaccination, sanitation, and improved food supply; the population boom that follows is due to increased survival rates among children and higher life expectancy among the general population; and fertility then drops in response to this population growth. Regional differences The demographic transition is a global phenomenon, but it has taken place at different times across the world. The industrialized countries of Europe and North America were the first to go through this process, followed by some states in the Western Pacific. Latin America's population then began growing at the turn of the 20th century, but the most significant period of global population growth occurred as Asia progressed in the late-1900s. As of the early 21st century, almost two thirds of the world's population live in Asia, although this is set to change significantly in the coming decades. Future growth The growth of Africa's population, particularly in Sub-Saharan Africa, will have the largest impact on global demographics in this century. From 2000 to 2100, it is expected that Africa's population will have increased by a factor of almost five. It overtook Europe in size in the late 1990s, and overtook the Americas a decade later. In contrast to Africa, Europe's population is now in decline, as birth rates are consistently below death rates in many countries, especially in the south and east, resulting in natural population decline. Similarly, the population of the Americas and Asia are expected to go into decline in the second half of this century, and only Oceania's population will still be growing alongside Africa. By 2100, the world's population will have over three billion more than today, with the vast majority of this concentrated in Africa. Demographers predict that climate change is exacerbating many of the challenges that currently hinder progress in Africa, such as political and food instability; if Africa's transition is prolonged, then it may result in further population growth that would place a strain on the region's resources, however, curbing this growth earlier would alleviate some of the pressure created by climate change.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
This dataset includes all individuals from the 1920 US census.
Sources: U.S. Census Bureau, Census 2020; generated by CCRPC staff; using 2020 Census Demographic Data Map Viewer; https://www.census.gov/library/visualizations/2021/geo/demographicmapviewer.html; (18 August 2021); U.S. Census Bureau; Census 2000, Summary File 1, Table DP-1; generated by CCRPC staff; using American FactFinder; http://factfinder2.census.gov; (30 December 2015). U.S. Census Bureau; Census 2010, Summary File 1, Table P1; generated by CCRPC staff; using American FactFinder; http://factfinder2.census.gov; (30 December 2015). U.S. Census Bureau; 1980 Census of Population, Volume 1: Characteristics of the Population, Chapter A: Number of Inhabitants, Part 15: Illinois, PC80-1-A15, Table 2, Land Area and Population: 1930-1980. U.S. Census Bureau; Fourteenth Census of the United States; State Compendium Illinois, Table 1. - Area and Population of Counties: 1850 to 1920; https://www.census.gov/library/publications/1924/dec/state-compendium.html; (23 August 2018).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Context
The dataset tabulates the Bloomingdale population distribution across 18 age groups. It lists the population in each age group along with the percentage population relative of the total population for Bloomingdale. The dataset can be utilized to understand the population distribution of Bloomingdale by age. For example, using this dataset, we can identify the largest age group in Bloomingdale.
Key observations
The largest age group in Bloomingdale, IL was for the group of age 35-39 years with a population of 1,920 (8.56%), according to the 2021 American Community Survey. At the same time, the smallest age group in Bloomingdale, IL was the 80-84 years with a population of 605 (2.70%). Source: U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.
When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.
Age groups:
Variables / Data Columns
Good to know
Margin of Error
Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.
Custom data
If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.
Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
This dataset is a part of the main dataset for Bloomingdale Population by Age. You can refer the same here
This dataset includes variable names, variable labels, variable values, and corresponding variable value labels for the IPUMS 1920 datasets.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Context
The dataset tabulates the Lexington city population distribution across 18 age groups. It lists the population in each age group along with the percentage population relative of the total population for Lexington city. The dataset can be utilized to understand the population distribution of Lexington city by age. For example, using this dataset, we can identify the largest age group in Lexington city.
Key observations
The largest age group in Lexington city, VA was for the group of age 20-24 years with a population of 1,920 (26.35%), according to the 2021 American Community Survey. At the same time, the smallest age group in Lexington city, VA was the 10-14 years with a population of 94 (1.29%). Source: U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.
When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.
Age groups:
Variables / Data Columns
Good to know
Margin of Error
Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.
Custom data
If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.
Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
This dataset is a part of the main dataset for Lexington city Population by Age. You can refer the same here
The estimated population of the U.S. was approximately 334.9 million in 2023, and the largest age group was adults aged 30 to 34. There were 11.88 million males in this age category and around 11.64 million females. Which U.S. state has the largest population? The population of the United States continues to increase, and the country is the third most populous in the world behind China and India. The gender distribution has remained consistent for many years, with the number of females narrowly outnumbering males. In terms of where the residents are located, California was the state with the highest population in 2023. The U.S. population by race and ethnicity The United States is well known the world over for having a diverse population. In 2023, the number of Black or African American individuals was estimated to be 45.76 million, which represented an increase of over four million since the 2010 census. The number of Asian residents has increased at a similar rate during the same time period and the Hispanic population in the U.S. has also continued to grow.
In 1800, the present-day region of Mexico had a population of just over six million people. Mexico gained its independence from the Spanish crown in 1821, and population growth remained steady for the next 85 years. Growth then halted with with the Panic of 1907, an American financial crisis whose ripple effects in Mexico would set the stage for the Mexican Revolution in 1910. This revolution would see population flatline at just over fifteen million between 1910 and 1920, as widespread conflict and result in the death of between 1.7 to 2.7 million over the decade, and the coinciding 1918 Spanish Flu epidemic would see the loss of another 300,000 in this time period. Following the end of both the Mexican Revolution and the Spanish Flu epidemic in 1920, the population of Mexico would begin to increase rapidly as modernization would see mortality rates fall and standards of living rise throughout the country. This growth has continued steadily into the 21st century, and in 2020, Mexico is estimated to have a population of just under 129 million.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Context
The dataset tabulates the Lexington population distribution across 18 age groups. It lists the population in each age group along with the percentage population relative of the total population for Lexington. The dataset can be utilized to understand the population distribution of Lexington by age. For example, using this dataset, we can identify the largest age group in Lexington.
Key observations
The largest age group in Lexington, VA was for the group of age 20-24 years with a population of 1,920 (26.35%), according to the 2021 American Community Survey. At the same time, the smallest age group in Lexington, VA was the 10-14 years with a population of 94 (1.29%). Source: U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.
When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.
Age groups:
Variables / Data Columns
Good to know
Margin of Error
Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.
Custom data
If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.
Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
This dataset is a part of the main dataset for Lexington Population by Age. You can refer the same here
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Chart and table of population level and growth rate for the state of California from 1900 to 2024.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Context
The dataset tabulates the Pickens County population distribution across 18 age groups. It lists the population in each age group along with the percentage population relative of the total population for Pickens County. The dataset can be utilized to understand the population distribution of Pickens County by age. For example, using this dataset, we can identify the largest age group in Pickens County.
Key observations
The largest age group in Pickens County, SC was for the group of age 20-24 years with a population of 15,311 (11.81%), according to the 2021 American Community Survey. At the same time, the smallest age group in Pickens County, SC was the 85+ years with a population of 1,920 (1.48%). Source: U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.
When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.
Age groups:
Variables / Data Columns
Good to know
Margin of Error
Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.
Custom data
If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.
Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
This dataset is a part of the main dataset for Pickens County Population by Age. You can refer the same here
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
This US History GeoInquiry is designed to enhance teaching the "Dust Bowl" in US History classes. The activity uses a web-based map and is tied to the C3 Framework.In this activity, teachers will lead students as they explore the Dust Bowl region using population change, agriculture, and precipitation data. Learning outcomes:
Students will be able to analyze the effect of climate on population.Students will be able to analyze the change in California’s population relative to the
change in the Dust Bowl states’ population during the 1920s and 1930s. Find more US History GeoInquiries here or explore all GeoInquiries at https://www.esri.com/geoinquiries
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
The Integrated Public Use Microdata Series (IPUMS) Complete Count Data include more than 650 million individual-level and 7.5 million household-level records. The microdata are the result of collaboration between IPUMS and the nation’s two largest genealogical organizations—Ancestry.com and FamilySearch—and provides the largest and richest source of individual level and household data.
All manuscripts (and other items you'd like to publish) must be submitted to
phsdatacore@stanford.edu for approval prior to journal submission.
We will check your cell sizes and citations.
For more information about how to cite PHS and PHS datasets, please visit:
https:/phsdocs.developerhub.io/need-help/citing-phs-data-core
Historic data are scarce and often only exists in aggregate tables. The key advantage of historic US census data is the availability of individual and household level characteristics that researchers can tabulate in ways that benefits their specific research questions. The data contain demographic variables, economic variables, migration variables and family variables. Within households, it is possible to create relational data as all relations between household members are known. For example, having data on the mother and her children in a household enables researchers to calculate the mother’s age at birth. Another advantage of the Complete Count data is the possibility to follow individuals over time using a historical identifier.
In sum: the historic US census data are a unique source for research on social and economic change and can provide population health researchers with information about social and economic determinants.
The historic US 1920 census data was collected in January 1920. Enumerators collected data traveling to households and counting the residents who regularly slept at the household. Individuals lacking permanent housing were counted as residents of the place where they were when the data was collected. Household members absent on the day of data collected were either listed to the household with the help of other household members or were scheduled for the last census subdivision.
Notes
We provide household and person data separately so that it is convenient to explore the descriptive statistics on each level. In order to obtain a full dataset, merge the household and person on the variables SERIAL and SERIALP. In order to create a longitudinal dataset, merge datasets on the variable HISTID.
Households with more than 60 people in the original data were broken up for processing purposes. Every person in the large households are considered to be in their own household. The original large households can be identified using the variable SPLIT, reconstructed using the variable SPLITHID, and the original count is found in the variable SPLITNUM.
Coded variables derived from string variables are still in progress. These variables include: occupation and industry.
Missing observations have been allocated and some inconsistencies have been edited for the following variables: SPEAKENG, YRIMMIG, CITIZEN, AGE, BPL, MBPL, FBPL, LIT, SCHOOL, OWNERSHP, MORTGAGE, FARM, CLASSWKR, OCC1950, IND1950, MARST, RACE, SEX, RELATE, MTONGUE. The flag variables indicating an allocated observation for the associated variables can be included in your extract by clicking the ‘Select data quality flags’ box on the extract summary page.
Most inconsistent information was not edited for this release, thus there are observations outside of the universe for some variables. In particular, the variables GQ, and GQTYPE have known inconsistencies and will be improved with the next release.
%3C!-- --%3E
This dataset was created on 2020-01-10 18:46:34.647
by merging multiple datasets together. The source datasets for this version were:
IPUMS 1920 households: This dataset includes all households from the 1920 US census.
IPUMS 1920 persons: This dataset includes all individuals from the 1920 US census.
IPUMS 1920 Lookup: This dataset includes variable names, variable labels, variable values, and corresponding variable value labels for the IPUMS 1920 datasets.