Historical record of Arlington population as captured by the 1920 census record.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
The Integrated Public Use Microdata Series (IPUMS) Complete Count Data include more than 650 million individual-level and 7.5 million household-level records. The microdata are the result of collaboration between IPUMS and the nation’s two largest genealogical organizations—Ancestry.com and FamilySearch—and provides the largest and richest source of individual level and household data.
All manuscripts (and other items you'd like to publish) must be submitted to
phsdatacore@stanford.edu for approval prior to journal submission.
We will check your cell sizes and citations.
For more information about how to cite PHS and PHS datasets, please visit:
https:/phsdocs.developerhub.io/need-help/citing-phs-data-core
Historic data are scarce and often only exists in aggregate tables. The key advantage of historic US census data is the availability of individual and household level characteristics that researchers can tabulate in ways that benefits their specific research questions. The data contain demographic variables, economic variables, migration variables and family variables. Within households, it is possible to create relational data as all relations between household members are known. For example, having data on the mother and her children in a household enables researchers to calculate the mother’s age at birth. Another advantage of the Complete Count data is the possibility to follow individuals over time using a historical identifier.
In sum: the historic US census data are a unique source for research on social and economic change and can provide population health researchers with information about social and economic determinants.
The historic US 1920 census data was collected in January 1920. Enumerators collected data traveling to households and counting the residents who regularly slept at the household. Individuals lacking permanent housing were counted as residents of the place where they were when the data was collected. Household members absent on the day of data collected were either listed to the household with the help of other household members or were scheduled for the last census subdivision.
Notes
We provide household and person data separately so that it is convenient to explore the descriptive statistics on each level. In order to obtain a full dataset, merge the household and person on the variables SERIAL and SERIALP. In order to create a longitudinal dataset, merge datasets on the variable HISTID.
Households with more than 60 people in the original data were broken up for processing purposes. Every person in the large households are considered to be in their own household. The original large households can be identified using the variable SPLIT, reconstructed using the variable SPLITHID, and the original count is found in the variable SPLITNUM.
Coded variables derived from string variables are still in progress. These variables include: occupation and industry.
Missing observations have been allocated and some inconsistencies have been edited for the following variables: SPEAKENG, YRIMMIG, CITIZEN, AGE, BPL, MBPL, FBPL, LIT, SCHOOL, OWNERSHP, MORTGAGE, FARM, CLASSWKR, OCC1950, IND1950, MARST, RACE, SEX, RELATE, MTONGUE. The flag variables indicating an allocated observation for the associated variables can be included in your extract by clicking the ‘Select data quality flags’ box on the extract summary page.
Most inconsistent information was not edited for this release, thus there are observations outside of the universe for some variables. In particular, the variables GQ, and GQTYPE have known inconsistencies and will be improved with the next release.
%3C!-- --%3E
This dataset was created on 2020-01-10 18:46:34.647
by merging multiple datasets together. The source datasets for this version were:
IPUMS 1920 households: This dataset includes all households from the 1920 US census.
IPUMS 1920 persons: This dataset includes all individuals from the 1920 US census.
IPUMS 1920 Lookup: This dataset includes variable names, variable labels, variable values, and corresponding variable value labels for the IPUMS 1920 datasets.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Historical population as enumerated and corrected from 1790 through 2020. North Carolina was one of the 13 original States and by the time of the 1790 census had essentially its current boundaries. The Census is mandated by the United States Constitution and was first completed for 1790. The population has been counted every ten years hence, with some limitations. In 1790 census coverage included most of the State, except for areas in the west, parts of which were not enumerated until 1840. The population for 1810 includes Walton County, enumerated as part of Georgia although actually within North Carolina. Historical populations shown here reflect the population of the respective named county and not necessarily the population of the area of the county as it was defined for a particular census. County boundaries shown in maps reflect boundaries as defined in 2020. Historic boundaries for some counties may include additional geographic areas or may be smaller than the current geographic boundaries. Notes below list the county or counties with which the population of a currently defined county were enumerated historically (Current County: Population counted in). The current 100 counties have been in place since the 1920 Census, although some modifications to the county boundaries have occurred since that time. For historical county boundaries see: Atlas of Historical County Boundaries Project (newberry.org)County Notes: Note 1: Total for 1810 includes population (1,026) of Walton County, reported as a Georgia county but later determined to be situated in western North Carolina. Total for 1890 includes 2 Indians in prison, not reported by county. Note 2: Alexander: *Iredell, Burke, Wilkes. Note 3: Avery: *Caldwell, Mitchell, Watauga. Note 4: Buncombe: *Burke, Rutherford; see also note 22. Note 5: Caldwell: *Burke, Wilkes, Yancey. Note 6: Cleveland: *Rutherford, Lincoln. Note 7: Columbus: *Bladen, Brunswick. Note 8: Dare: *Tyrrell, Currituck, Hyde. Note 9: Hoke: *Cumberland, Robeson. Note 10: Jackson: *Macon, Haywood. Note 11: Lee: *Moore, Chatham. Note 12: Lenoir: *Dobbs (Greene); Craven. Note 13: McDowell: *Burke, Rutherford. Note 14: Madison: *Buncombe, Yancey. Note 15: Mitchell: *Yancey, Watauga. Note 16: Pamlico: *Craven, Beaufort. Note 17: Polk: *Rutherford, Henderson. Note 18: Swain: *Jackson, Macon. Note 19: Transylvania: *Henderson, Jackson. Note 20: Union: *Mecklenburg, Anson. Note 21: Vance: *Granville, Warren, Franklin. Note 22: Walton: Created in 1803 as a Georgia county and reported in 1810 as part of Georgia; abolished after a review of the State boundary determined that its area was located in North Carolina. By 1820 it was part of Buncombe County. Note 23: Watauga: *Ashe, Yancey, Wilkes; Burke. Note 24: Wilson: *Edgecombe, Nash, Wayne, Johnston. Note 25: Yancey: *Burke, Buncombe. Note 26: Alleghany: *Ashe. Note 27: Haywood: *Buncombe. Note 28: Henderson: *Buncombe. Note 29: Person: Caswell. Note 30: Clay: Cherokee. Note 31: Graham: Cherokee. Note 32: Harnett: Cumberland. Note 33: Macon: Haywood.
Note 34: Catawba: Lincoln. Note 35: Gaston: Lincoln. Note 36: Cabarrus: Mecklenburg.
Note 37: Stanly: Montgomery. Note 38: Pender: New Hanover. Note 39: Alamance: Orange.
Note 40: Durham: Orange, Wake. Note 41: Scotland: Richmond. Note 42: Davidson: Rowan. Note 43: Davie: Rowan.Note 44: Forsyth: Stokes. Note 45: Yadkin: Surry.
Note 46: Washington: Tyrrell.Note 47: Ashe: Wilkes. Part III. Population of Counties, Earliest Census to 1990The 1840 population of Person County, NC should be 9,790. The 1840 population of Perquimans County, NC should be 7,346.
1920 United States Federal Census contains records from Philadelphia, Pennsylvania, USA by Fourteenth Census of the United States, 1920. (NARA microfilm publication T625, 2076 rolls). Records of the Bureau of the Census, Record Group 29. National Archives, Washington, D.C. Year: 1920; Census Place: Philadelphia Ward 42, Philadelphia, Pennsylvania; Roll: T625_1643; Page: 13A; Enumeration District: 1564 - .
This dataset includes all households from the 1920 US census.
https://www.icpsr.umich.edu/web/ICPSR/studies/4344/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/4344/terms
The data comprising the Puerto Rico Census Project, 1920 contain individual and household records drawn from the 1920 Puerto Rican Population Census. The data include variables containing basic demographic information such as age, sex, race, marital status, number of children born and surviving, family size, place of birth, immigration status, county and neighborhood of residence, urban/rural status, and citizenship. The data also describe language proficiency, literacy, school attendance, and disabilities (blind or deaf) of the individuals. Other variables provide data on occupation, industry, ownership of residence, status of mortgage, and farm ownership. There are four classifications of variables belonging to this dataset: original input variables, coded variables, constructed variables, and quality flag variables. The original input variables contain the raw data collected by the enumerators. The coded variables are variables that were recoded by the University of Wisconsin Survey Center (UWSC) as part of the Puerto Rico Census Project. Constructed variables were produced by UWSC to capture additional relevant information. For example, one constructed variable measures literacy by combining separate variables containing data on whether the individual could read and if they could write. Finally, quality flag variables were created by UWSC to indicate whether it could be logically deduced that individual records had been hand edited by the Census Office.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
The 1st Population Census. In order to clarify the state of Japan’s population and households, the population census has been conducted in Japan almost every five years.More details on the "Population Census of Japan" overall including other years can be found here: https://d-infra.ier.hit-u.ac.jp/Japanese/statistical-yb/b001.html. Unlike in recent censuses, individuals were not counted at their place of usual residence but at the place where they were at 0:00h on October 1st.
This dataset includes all individuals from the 1920 US census.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Japan Population Census: Age 30 to 34 Years data was reported at 7,290,878.000 Person in 2015. This records a decrease from the previous number of 8,341,497.000 Person for 2010. Japan Population Census: Age 30 to 34 Years data is updated yearly, averaging 7,681,689.000 Person from Dec 1920 (Median) to 2015, with 20 observations. The data reached an all-time high of 10,771,731.000 Person in 1980 and a record low of 3,609,450.000 Person in 1920. Japan Population Census: Age 30 to 34 Years data remains active status in CEIC and is reported by Statistical Bureau. The data is categorized under Global Database’s Japan – Table JP.G002: Population: Annual.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The dataset is based on two separate censuses of the Netherlands of 1920, population census (7 vols) and occupational census (3 vols).Content: images of the publication, pdf files of the text sections and excel files with data entered from the published tables.
This dataset includes variable names, variable labels, variable values, and corresponding variable value labels for the IPUMS 1920 datasets.
This crosswalk consists of individuals matched between the 1900 and 1920 complete-count US Censuses. Within the crosswalk, users have the option to select the linking method with which these matches were created. This version of the crosswalk contains links made by the ABE-exact (conservative and standard) method, the ABE-NYSIIS (conservative and standard) method and the ABE-NYSIIS (conservative and standard) method where race is used as a matching variable. This crosswalk also includes Census Tree Links created by Joseph Price, Kasey Buckles and Mark Clement at the Brigham Young University (BYU) Record Linking Lab. More detail on these links can be found in the census_tree_links_BYU_readme. For any chosen method, users can merge into this crosswalk a wide set of individual- and household-level variables provided publicly by IPUMS, thereby creating a historical longitudinal dataset for analysis.
These data on 19th- and early 20th-century police department and arrest behavior were collected between 1975 and 1978 for a study of police and crime in the United States. Raw and aggregated time-series data are presented in Parts 1 and 3 on 23 American cities for most years during the period 1860-1920. The data were drawn from annual reports of police departments found in the Library of Congress or in newspapers and legislative reports located elsewhere. Variables in Part 1, for which the city is the unit of analysis, include arrests for drunkenness, conditional offenses and homicides, persons dismissed or held, police personnel, and population. Part 3 aggregates the data by year and reports some of these variables on a per capita basis, using a linear interpolation from the last decennial census to estimate population. Part 2 contains data for 267 United States cities for the period 1880-1890 and was generated from the 1880 federal census volume, REPORT ON THE DEFECTIVE, DEPENDENT, AND DELINQUENT CLASSES, published in 1888, and from the 1890 federal census volume, SOCIAL STATISTICS OF CITIES. Information includes police personnel and expenditures, arrests, persons held overnight, trains entering town, and population.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
BackgroundAddressing contemporary anti-Asian racism and its impacts on health requires understanding its historical roots, including discriminatory restrictions on immigration, citizenship, and land ownership. Archival secondary data such as historical census records provide opportunities to quantitatively analyze structural dynamics that affect the health of Asian immigrants and Asian Americans. Census data overcome weaknesses of other data sources, such as small sample size and aggregation of Asian subgroups. This article explores the strengths and limitations of early twentieth-century census data for understanding Asian Americans and structural racism.MethodsWe used California census data from three decennial census spanning 1920–1940 to compare two criteria for identifying Asian Americans: census racial categories and Asian surname lists (Chinese, Indian, Japanese, Korean, and Filipino) that have been validated in contemporary population data. This paper examines the sensitivity and specificity of surname classification compared to census-designated “color or race” at the population level.ResultsSurname criteria were found to be highly specific, with each of the five surname lists having a specificity of over 99% for all three census years. The Chinese surname list had the highest sensitivity (ranging from 0.60–0.67 across census years), followed by the Indian (0.54–0.61) and Japanese (0.51–0.62) surname lists. Sensitivity was much lower for Korean (0.40–0.45) and Filipino (0.10–0.21) surnames. With the exception of Indian surnames, the sensitivity values of surname criteria were lower for the 1920–1940 census data than those reported for the 1990 census. The extent of the difference in sensitivity and trends across census years vary by subgroup.DiscussionSurname criteria may have lower sensitivity in detecting Asian subgroups in historical data as opposed to contemporary data as enumeration procedures for Asians have changed across time. We examine how the conflation of race, ethnicity, and nationality in the census could contribute to low sensitivity of surname classification compared to census-designated “color or race.” These results can guide decisions when operationalizing race in the context of specific research questions, thus promoting historical quantitative study of Asian American experiences. Furthermore, these results stress the need to situate measures of race and racism in their specific historical context.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Japan Population Census: Age 25 to 29 Years data was reported at 6,409,612.000 Person in 2015. This records a decrease from the previous number of 7,293,701.000 Person for 2010. Japan Population Census: Age 25 to 29 Years data is updated yearly, averaging 7,743,240.500 Person from Dec 1920 (Median) to 2015, with 20 observations. The data reached an all-time high of 10,794,583.000 Person in 1975 and a record low of 3,923,949.000 Person in 1920. Japan Population Census: Age 25 to 29 Years data remains active status in CEIC and is reported by Statistical Bureau. The data is categorized under Global Database’s Japan – Table JP.G002: Population: Annual.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Japan Population Census: Female: Age 65 Years & Over data was reported at 18,979,972.000 Person in 2015. This records an increase from the previous number of 16,775,273.000 Person for 2010. Japan Population Census: Female: Age 65 Years & Over data is updated yearly, averaging 3,820,838.000 Person from Dec 1920 (Median) to 2015, with 20 observations. The data reached an all-time high of 18,979,972.000 Person in 2015 and a record low of 1,638,915.000 Person in 1920. Japan Population Census: Female: Age 65 Years & Over data remains active status in CEIC and is reported by Statistical Bureau. The data is categorized under Global Database’s Japan – Table JP.G002: Population: Annual.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Japan Population Census: Age 20 to 24 Years data was reported at 5,968,127.000 Person in 2015. This records a decrease from the previous number of 6,426,433.000 Person for 2010. Japan Population Census: Age 20 to 24 Years data is updated yearly, averaging 7,822,781.000 Person from Dec 1920 (Median) to 2015, with 20 observations. The data reached an all-time high of 10,727,636.000 Person in 1970 and a record low of 4,609,310.000 Person in 1920. Japan Population Census: Age 20 to 24 Years data remains active status in CEIC and is reported by Statistical Bureau. The data is categorized under Global Database’s Japan – Table JP.G002: Population: Annual.
Historical record of Arlington population as captured by the 1920 census record.