This dataset includes variable names, variable labels, variable values, and corresponding variable value labels for the IPUMS 1900 datasets.
The Integrated Public Use Microdata Series (IPUMS) Complete Count Data include more than 650 million individual-level and 7.5 million household-level records. The microdata are the result of collaboration between IPUMS and the nation’s two largest genealogical organizations—Ancestry.com and FamilySearch—and provides the largest and richest source of individual level and household data.
All manuscripts (and other items you'd like to publish) must be submitted to
phsdatacore@stanford.edu for approval prior to journal submission.
We will check your cell sizes and citations.
For more information about how to cite PHS and PHS datasets, please visit:
https:/phsdocs.developerhub.io/need-help/citing-phs-data-core
This dataset was created on 2020-01-10 22:52:11.461
by merging multiple datasets together. The source datasets for this version were:
IPUMS 1930 households: This dataset includes all households from the 1930 US census.
IPUMS 1930 persons: This dataset includes all individuals from the 1930 US census.
IPUMS 1930 Lookup: This dataset includes variable names, variable labels, variable values, and corresponding variable value labels for the IPUMS 1930 datasets.
Historic data are scarce and often only exists in aggregate tables. The key advantage of historic US census data is the availability of individual and household level characteristics that researchers can tabulate in ways that benefits their specific research questions. The data contain demographic variables, economic variables, migration variables and family variables. Within households, it is possible to create relational data as all relations between household members are known. For example, having data on the mother and her children in a household enables researchers to calculate the mother’s age at birth. Another advantage of the Complete Count data is the possibility to follow individuals over time using a historical identifier.
In sum: the historic US census data are a unique source for research on social and economic change and can provide population health researchers with information about social and economic determinants.Historic data are scarce and often only exists in aggregate tables. The key advantage of historic US census data is the availability of individual and household level characteristics that researchers can tabulate in ways that benefits their specific research questions. The data contain demographic variables, economic variables, migration variables and family variables. Within households, it is possible to create relational data as all relations between household members are known. For example, having data on the mother and her children in a household enables researchers to calculate the mother’s age at birth. Another advantage of the Complete Count data is the possibility to follow individuals over time using a historical identifier. In sum: the historic US census data are a unique source for research on social and economic change and can provide population health researchers with information about social and economic determinants.
The historic US 1930 census data was collected in April 1930. Enumerators collected data traveling to households and counting the residents who regularly slept at the household. Individuals lacking permanent housing were counted as residents of the place where they were when the data was collected. Household members absent on the day of data collected were either listed to the household with the help of other household members or were scheduled for the last census subdivision.
Notes
We provide IPUMS household and person data separately so that it is convenient to explore the descriptive statistics on each level. In order to obtain a full dataset, merge the household and person on the variables SERIAL and SERIALP. In order to create a longitudinal dataset, merge datasets on the variable HISTID.
Households with more than 60 people in the original data were broken up for processing purposes. Every person in the large households are considered to be in their own household. The original large households can be identified using the variable SPLIT, reconstructed using the variable SPLITHID, and the original count is found in the variable SPLITNUM.
Coded variables derived from string variables are still in progress. These variables include: occupation and industry.
Missing observations have been allocated and some inconsistencies have been edited for the following variables: SPEAKENG, YRIMMIG, CITIZEN, AGEMARR, AGE, BPL, MBPL, FBPL, LIT, SCHOOL, OWNERSHP, FARM, EMPSTAT, OCC1950, IND1950, MTONGUE, MARST, RACE, SEX, RELATE, CLASSWKR. The flag variables indicating an allocated observation for the associated variables can be included in your extract by clicking the ‘Select data quality flags’ box on the extract summary page.
Most inconsistent information was not edite
https://www.icpsr.umich.edu/web/ICPSR/studies/8236/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/8236/terms
The 1940 Census Public Use Microdata Sample Project was assembled through a collaborative effort between the United States Bureau of the Census and the Center for Demography and Ecology at the University of Wisconsin. The collection contains a stratified 1-percent sample of households, with separate records for each household, for each "sample line" respondent, and for each person in the household. These records were encoded from microfilm copies of original handwritten enumeration schedules from the 1940 Census of Population. Geographic identification of the location of the sampled households includes Census regions and divisions, states (except Alaska and Hawaii), standard metropolitan areas (SMAs), and state economic areas (SEAs). Accompanying the data collection is a codebook that includes an abstract, descriptions of sample design, processing procedures and file structure, a data dictionary (record layout), category code lists, and a glossary. Also included is a procedural history of the 1940 Census. Each of the 20 subsamples contains three record types: household, sample line, and person. Household variables describe the location and condition of the household. The sample line records contain variables describing demographic characteristics such as nativity, marital status, number of children, veteran status, wage deductions for Social Security, and occupation. Person records also contain variables describing demographic characteristics including nativity, marital status, family membership, education, employment status, income, and occupation.
Website alows the public full access to the 1940 Census images, census maps and descriptions.
https://www.icpsr.umich.edu/web/ICPSR/studies/2877/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/2877/terms
This data collection, Aging of Veterans of the Union Army: Surgeons' Certificates, United States, 1862-1940, constitutes a portion of the historical data collected by the project "Early Indicators of Later Work Levels, Disease, and Death." With the goal of constructing datasets suitable for longitudinal analyses of factors affecting the aging process, the project collects military, medical, and socioeconomic data on a sample of white males mustered into the Union Army during the Civil War. The surgeons' certificates contain information from examining physicians to determine eligibility for pension benefits. Also included are questions regarding the age, occupation, residence, and military experience of the veterans. These data can be linked to "Aging of Veterans of the Union Army: Military, Pension, and Medical Records, 1820-1940" (ICPSR 6837) and "Aging of Veterans of the Union Army: United States Federal Census Records, 1850, 1860, 1900, 1910" (ICPSR 6836) using the variable "recidnum."
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
U.S. Census Bureau Index of Economic Activity - IDEA data was reported at -0.700 % in Apr 2025. This records a decrease from the previous number of 0.600 % for Mar 2025. U.S. Census Bureau Index of Economic Activity - IDEA data is updated monthly, averaging 0.070 % from Aug 2004 (Median) to Apr 2025, with 249 observations. The data reached an all-time high of 2.540 % in Mar 2022 and a record low of -7.710 % in Apr 2020. U.S. Census Bureau Index of Economic Activity - IDEA data remains active status in CEIC and is reported by U.S. Census Bureau. The data is categorized under Global Database’s United States – Table US.A: U.S. Census Bureau Index of Economic Activity.
The Bureau of the Census has released Census 2000 Summary File 1 (SF1) 100-Percent data. The file includes the following population items: sex, age, race, Hispanic or Latino origin, household relationship, and household and family characteristics. Housing items include occupancy status and tenure (whether the unit is owner or renter occupied). SF1 does not include information on incomes, poverty status, overcrowded housing or age of housing. These topics will be covered in Summary File 3. Data are available for states, counties, county subdivisions, places, census tracts, block groups, and, where applicable, American Indian and Alaskan Native Areas and Hawaiian Home Lands. The SF1 data are available on the Bureau's web site and may be retrieved from American FactFinder as tables, lists, or maps. Users may also download a set of compressed ASCII files for each state via the Bureau's FTP server. There are over 8000 data items available for each geographic area. The full listing of these data items is available here as a downloadable compressed data base file named TABLES.ZIP. The uncompressed is in FoxPro data base file (dbf) format and may be imported to ACCESS, EXCEL, and other software formats. While all of this information is useful, the Office of Community Planning and Development has downloaded selected information for all states and areas and is making this information available on the CPD web pages. The tables and data items selected are those items used in the CDBG and HOME allocation formulas plus topics most pertinent to the Comprehensive Housing Affordability Strategy (CHAS), the Consolidated Plan, and similar overall economic and community development plans. The information is contained in five compressed (zipped) dbf tables for each state. When uncompressed the tables are ready for use with FoxPro and they can be imported into ACCESS, EXCEL, and other spreadsheet, GIS and database software. The data are at the block group summary level. The first two characters of the file name are the state abbreviation. The next two letters are BG for block group. Each record is labeled with the code and name of the city and county in which it is located so that the data can be summarized to higher-level geography. The last part of the file name describes the contents . The GEO file contains standard Census Bureau geographic identifiers for each block group, such as the metropolitan area code and congressional district code. The only data included in this table is total population and total housing units. POP1 and POP2 contain selected population variables and selected housing items are in the HU file. The MA05 table data is only for use by State CDBG grantees for the reporting of the racial composition of beneficiaries of Area Benefit activities. The complete package for a state consists of the dictionary file named TABLES, and the five data files for the state. The logical record number (LOGRECNO) links the records across tables.
https://www.icpsr.umich.edu/web/ICPSR/studies/7923/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/7923/terms
This data collection consists of modified records from CENSUS OF POPULATION AND HOUSING, 1970 [UNITED STATES]: PUBLIC USE SAMPLES (ICPSR 0018). The original records consisted of 120-character household records and 120-character person records, whereas the new modified records are rectangular (each person record is combined with the corresponding household record) with a length of 188, after the deletion of some items. Additional information was added to the data records, including typical educational requirement for current occupation, occupational prestige score, and group identification code. This version also differs from the original public use census samples in other ways: persons aged 15-75 were included, no majority males were included, but the majority males from CENSUS OF POPULATION AND HOUSING [UNITED STATES], 1970 PUBLIC USE SAMPLE: MODIFIED 1/1000 5% STATE SAMPLES (ICPSR 7922) were included for convenience, 10 percent of the Black population from each file was included, and Mexican Americans (identified by a Spanish surname) from outside the five southwestern states of Arizona, California, Colorado, New Mexico, and Texas were not included in this file. Variables provide information on the housing unit, such as occupancy and vacancy status of house, value of property, commercial use, ratio of rent and property value to family income, availability of plumbing facilities, sewage disposal, complete kitchen facilities, heating facilities, flush toilet, water, television, and telephone. Data are also provided on household characteristics such as household size, family size, and household relationships. Other demographic variables specify age, sex, place of birth, state of residence, Spanish descent, marital status, race, veteran status, income, and ratio of family income to poverty cutoff level. This collection was made available by the National Chicano Research Network of the Institute for Social Research, University of Michigan. See the related collection, CENSUS OF POPULATION AND HOUSING [UNITED STATES], 1970 PUBLIC USE SAMPLE: MODIFIED 1/1000 5% STATE SAMPLES (ICPSR 7922).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. This project contains the files necessary to closely replicate the links between the 1900 and 1910 censuses. For more information, consult the included Read Me file, and visit https://censustree.org.
https://www.icpsr.umich.edu/web/ICPSR/studies/7756/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/7756/terms
This collection contains individual-level and 1-percent national sample data from the 1960 Census of Population and Housing conducted by the Census Bureau. It consists of a representative sample of the records from the 1960 sample questionnaires. The data are stored in 30 separate files, containing in total over two million records, organized by state. Some files contain the sampled records of several states while other files contain all or part of the sample for a single state. There are two types of records stored in the data files: one for households and one for persons. Each household record is followed by a variable number of person records, one for each of the household members. Data items in this collection include the individual responses to the basic social, demographic, and economic questions asked of the population in the 1960 Census of Population and Housing. Data are provided on household characteristics and features such as the number of persons in household, number of rooms and bedrooms, and the availability of hot and cold piped water, flush toilet, bathtub or shower, sewage disposal, and plumbing facilities. Additional information is provided on tenure, gross rent, year the housing structure was built, and value and location of the structure, as well as the presence of air conditioners, radio, telephone, and television in the house, and ownership of an automobile. Other demographic variables provide information on age, sex, marital status, race, place of birth, nationality, education, occupation, employment status, income, and veteran status. The data files were obtained by ICPSR from the Center for Social Analysis, Columbia University.
1930 United States Federal Census contains records from Swanzey, Cheshire, New Hampshire, USA by Census Place: Swanzey, Cheshire, New Hampshire; Page: 7B; Enumeration District: 0029; FHL microfilm: 2341034 - .
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Starting in July, data.census.gov will be the primary way to access Census Bureau data, including upcoming releases from the 2018 American Community Survey, 2017 Economic Census, 2020 Census and more. After July 1, 2019, all new data (previously released on American FactFinder) will be released on this new data platform. (https://factfinder.census.gov/faces/nav/jsf/pages/index.xhtml)
1920 United States Federal Census contains records from Philadelphia, Pennsylvania, USA by Fourteenth Census of the United States, 1920. (NARA microfilm publication T625, 2076 rolls). Records of the Bureau of the Census, Record Group 29. National Archives, Washington, D.C. Year: 1920; Census Place: Philadelphia Ward 42, Philadelphia, Pennsylvania; Roll: T625_1643; Page: 13A; Enumeration District: 1564 - .
1940 United States Federal Census contains records from Philadelphia, Pennsylvania, USA by United States of America, Bureau of the Census. Sixteenth Census of the United States, 1940. Washington, D.C.: National Archives and Records Administration, 1940. T627, 4,643 rolls. Year: 1940; Census Place: Upper Dublin, Montgomery, Pennsylvania; Roll: m-t0627-03585; Page: 20B; Enumeration District: 46-208 - .
U.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically
From website:
The 1940 census records were released by the US National Archives April 2, 2012, and brought online through a partnership with Archives.com. This website allows you full access to the 1940 census images, in addition to 1940 census maps and descriptions.
https://creativecommons.org/share-your-work/public-domain/pdmhttps://creativecommons.org/share-your-work/public-domain/pdm
The U.S. Census Bureau TIGER/Line® files in this data collection were originally distributed by the Inter-university Consortium for Political and Social Research (ICPSR) through its TIGER/Line file web site, which was decommissioned in 2018 (archived version: https://web.archive.org/web/20090924181858/http://www.icpsr.umich.edu/TIGER/index.html). There, users could download various versions of the U.S. Census Bureau's TIGER (Topologically Integrated Geographic Encoding and Referencing) database. The TIGER/Line files do not include demographic data, but they do contain geographic information that can be linked to the Census Bureau’s demographic data. Due to file number limitations in openICPSR, the original data collections have been bundled into single zip packages. A single TIGER_directory.txt file listing the original files and the original directory structure is included with the root directory. Documentation files are also included as standalone subdirectories in each collection so users do not need to download entire zip bundles to view documentation. The TIGER/Line data are stored in compressed format in subdirectories by state name. There is one TIGER/Line file (in a compressed format) for each county or county equivalent. The file names consist of TGR + the 2-digit state FIPS (Federal Information Processing Standards) code + the 3-digit county FIPS code (i.e. TGR01031.ZIP for Coffee County, Alabama). Each state folder contains individual county files.The individual county files include one file for each record type included for that county with the following name convention: tgr01031.rt1. The convention follows the order described above with each file having a suffix which includes 'rt' (record type) followed by its designation (in this case record type 1). Each county file also contains its own metadata record.If present, documentation files for the TIGER/Line data are stored in a directory named '0docs' which is located in the 'Parent Directory'. This directory appears at the top of the index of state subdirectories for each edition of the TIGER/Line files. The documentation includes a complete list of FIPS state and county codes.
1940 US Census contains records from Montpelier, Washington, Vermont, USA by Ancestry.com. 1940 United States Federal Census [database on-line]. Provo, UT, USA: Ancestry.com Operations, Inc., 2012.; Year: 1940; Census Place: Montpelier, Washington, Vermont; Roll: m-t0627-04238; Page: 3B; Enumeration District: 12-31; Original data: United States of America, Bureau of the Census. Sixteenth Census of the United States, 1940. Washington, D.C.: National Archives and Records Administration, 1940. T627, 4,643 rolls. - .
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
https://www.icpsr.umich.edu/web/ICPSR/studies/7924/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/7924/terms
This collection consists of modified records from CENSUS OF POPULATION AND HOUSING, 1960 PUBLIC USE SAMPLE [UNITED STATES]: ONE-IN-ONE HUNDRED SAMPE (ICPSR 7756). The original records consisted of 120-character household records and 120-character person records, whereas the new modified records are rectangular (each person record is combined with the corresponding household record) with a length of 188, after the deletion of some items. Additional information was added to the data records including typical educational requirement for current occupation, occupational prestige score, and group identification code. This version differs from the original public-use sample in the following ways: ages of persons 15-74 are included, 10 percent of the Black population from each file is included, and Mexican Americans (identified by a Spanish surname) from outside Arizona, California, Colorado, New Mexico, and Texas are not included. This dataset uses the 1970 equivalent occupational codes. The Census Bureau originally used two separate codes for the 1970 and 1960 files, but these have been modified and are now identical.
https://www.icpsr.umich.edu/web/ICPSR/studies/2863/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/2863/terms
The objective of this data collection was to examine inequalities of wealth and the geographic distribution of wealthy individuals in late 18th- and early 19th-century New York and to investigate wealth in relationship to occupation and location. For this study, the entire set of tax assessment records and United States Census records for New York City were computerized and occupational status was added for all entries. The collection addresses topics such as social class structure, demographic factors, occupational status and geographic distribution, property values and geographic distribution, and the relationship of these factors to the political system. Units of analysis were individual property owners and renters for the tax assessment data and heads of households for the census data. Data collected included the individual's name, address, occupation, sex, and race, the type, quantity, and value of real and personal property, and the type and occupancy of the structure at the address. Occupational data from city directories were used to supplement the tax and census data.
This dataset includes variable names, variable labels, variable values, and corresponding variable value labels for the IPUMS 1900 datasets.