Website alows the public full access to the 1940 Census images, census maps and descriptions.
The Integrated Public Use Microdata Series (IPUMS) Complete Count Data include more than 650 million individual-level and 7.5 million household-level records. The microdata are the result of collaboration between IPUMS and the nation’s two largest genealogical organizations—Ancestry.com and FamilySearch—and provides the largest and richest source of individual level and household data.
All manuscripts (and other items you'd like to publish) must be submitted to
phsdatacore@stanford.edu for approval prior to journal submission.
We will check your cell sizes and citations.
For more information about how to cite PHS and PHS datasets, please visit:
https:/phsdocs.developerhub.io/need-help/citing-phs-data-core
Historic data are scarce and often only exists in aggregate tables. The key advantage of historic US census data is the availability of individual and household level characteristics that researchers can tabulate in ways that benefits their specific research questions. The data contain demographic variables, economic variables, migration variables and family variables. Within households, it is possible to create relational data as all relations between household members are known. For example, having data on the mother and her children in a household enables researchers to calculate the mother’s age at birth. Another advantage of the Complete Count data is the possibility to follow individuals over time using a historical identifier.
In sum: the historic US census data are a unique source for research on social and economic change and can provide population health researchers with information about social and economic determinants.
The historic US 1920 census data was collected in January 1920. Enumerators collected data traveling to households and counting the residents who regularly slept at the household. Individuals lacking permanent housing were counted as residents of the place where they were when the data was collected. Household members absent on the day of data collected were either listed to the household with the help of other household members or were scheduled for the last census subdivision.
Notes
We provide household and person data separately so that it is convenient to explore the descriptive statistics on each level. In order to obtain a full dataset, merge the household and person on the variables SERIAL and SERIALP. In order to create a longitudinal dataset, merge datasets on the variable HISTID.
Households with more than 60 people in the original data were broken up for processing purposes. Every person in the large households are considered to be in their own household. The original large households can be identified using the variable SPLIT, reconstructed using the variable SPLITHID, and the original count is found in the variable SPLITNUM.
Coded variables derived from string variables are still in progress. These variables include: occupation and industry.
Missing observations have been allocated and some inconsistencies have been edited for the following variables: SPEAKENG, YRIMMIG, CITIZEN, AGE, BPL, MBPL, FBPL, LIT, SCHOOL, OWNERSHP, MORTGAGE, FARM, CLASSWKR, OCC1950, IND1950, MARST, RACE, SEX, RELATE, MTONGUE. The flag variables indicating an allocated observation for the associated variables can be included in your extract by clicking the ‘Select data quality flags’ box on the extract summary page.
Most inconsistent information was not edited for this release, thus there are observations outside of the universe for some variables. In particular, the variables GQ, and GQTYPE have known inconsistencies and will be improved with the next release.
%3C!-- --%3E
This dataset was created on 2020-01-10 18:46:34.647
by merging multiple datasets together. The source datasets for this version were:
IPUMS 1920 households: This dataset includes all households from the 1920 US census.
IPUMS 1920 persons: This dataset includes all individuals from the 1920 US census.
IPUMS 1920 Lookup: This dataset includes variable names, variable labels, variable values, and corresponding variable value labels for the IPUMS 1920 datasets.
The Bureau of the Census has released Census 2000 Summary File 1 (SF1) 100-Percent data. The file includes the following population items: sex, age, race, Hispanic or Latino origin, household relationship, and household and family characteristics. Housing items include occupancy status and tenure (whether the unit is owner or renter occupied). SF1 does not include information on incomes, poverty status, overcrowded housing or age of housing. These topics will be covered in Summary File 3. Data are available for states, counties, county subdivisions, places, census tracts, block groups, and, where applicable, American Indian and Alaskan Native Areas and Hawaiian Home Lands. The SF1 data are available on the Bureau's web site and may be retrieved from American FactFinder as tables, lists, or maps. Users may also download a set of compressed ASCII files for each state via the Bureau's FTP server. There are over 8000 data items available for each geographic area. The full listing of these data items is available here as a downloadable compressed data base file named TABLES.ZIP. The uncompressed is in FoxPro data base file (dbf) format and may be imported to ACCESS, EXCEL, and other software formats. While all of this information is useful, the Office of Community Planning and Development has downloaded selected information for all states and areas and is making this information available on the CPD web pages. The tables and data items selected are those items used in the CDBG and HOME allocation formulas plus topics most pertinent to the Comprehensive Housing Affordability Strategy (CHAS), the Consolidated Plan, and similar overall economic and community development plans. The information is contained in five compressed (zipped) dbf tables for each state. When uncompressed the tables are ready for use with FoxPro and they can be imported into ACCESS, EXCEL, and other spreadsheet, GIS and database software. The data are at the block group summary level. The first two characters of the file name are the state abbreviation. The next two letters are BG for block group. Each record is labeled with the code and name of the city and county in which it is located so that the data can be summarized to higher-level geography. The last part of the file name describes the contents . The GEO file contains standard Census Bureau geographic identifiers for each block group, such as the metropolitan area code and congressional district code. The only data included in this table is total population and total housing units. POP1 and POP2 contain selected population variables and selected housing items are in the HU file. The MA05 table data is only for use by State CDBG grantees for the reporting of the racial composition of beneficiaries of Area Benefit activities. The complete package for a state consists of the dictionary file named TABLES, and the five data files for the state. The logical record number (LOGRECNO) links the records across tables.
The 1950 Census population schedules were created by the Bureau of the Census in an attempt to enumerate every person living in the United States on April 1, 1950, although some persons were missed. The 1950 census population schedules were digitized by the National Archives and Records Administration (NARA) and released publicly on April 1, 2022. The 1950 Census enumeration district maps contain maps of counties, cities, and other minor civil divisions that show enumeration districts, census tracts, and related boundaries and numbers used for each census. The coverage is nation wide and includes territorial areas. The 1950 Census enumeration district descriptions contain written descriptions of census districts, subdivisions, and enumeration districts.
This dataset includes variable names, variable labels, variable values, and corresponding variable value labels for the IPUMS 1920 datasets.
Official statistics are produced impartially and free from political influence.
Official statistics are produced impartially and free from political influence.
This dataset includes all households from the 1920 US census.
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Archive of 1971 census aggregate data for England, Wales and Scotland, as made available originally on the Casweb (https://casweb.ukdataservice.ac.uk) platform.
Summary data of fixed broadband coverage by geographic area. License and Attribution: Broadband data from FCC Form 477, and data from the U.S. Census Bureau that are presented on this site are offered free and not subject to copyright restriction. Data and content created by government employees within the scope of their employment are not subject to domestic copyright protection under 17 U.S.C. § 105. See, e.g., U.S. Government Works. While not required, when using content, data, documentation, code and related materials from fcc.gov or broadbandmap.fcc.gov in your own work, we ask that proper credit be given. Examples include: • Source data: FCC Form 477 • Map layer based on FCC Form 477 • Code data based on broadbandmap.fcc.gov The geography look ups are created from the US census shapefiles, which are in Global Coordinate System North American Datum of 1983 (GCS NAD83). The coordinates do not get reprojected during processing. The "centroid_lng", "centroid_lat" columns in the lookup table are the exact values from the US census shapefile (INTPTLON, INTPTLAT). The "bbox_arr" column is calculated from the bounding box/extent of the original geometry in the shapefile; no reprojection or transformations are done to the geometry.
https://www.ons.gov.uk/methodology/geography/licenceshttps://www.ons.gov.uk/methodology/geography/licences
This file contains the National Statistics Postcode Lookup (NSPL) for the United Kingdom as at August 2022 in Comma Separated Variable (CSV) and ASCII text (TXT) formats. To download the zip file click the Download button. The NSPL relates both current and terminated postcodes to a range of current statutory geographies via ‘best-fit’ allocation from the 2021 Census Output Areas (national parks and Workplace Zones are exempt from ‘best-fit’ and use ‘exact-fit’ allocations) for England and Wales. Scotland and Northern Ireland has the 2011 Census Output AreasIt supports the production of area based statistics from postcoded data. The NSPL is produced by ONS Geography, who provide geographic support to the Office for National Statistics (ONS) and geographic services used by other organisations. The NSPL is issued quarterly. (File size - 184 MB).
This dataset includes all individuals from the 1920 US census.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset shows census data for Nigeria from government data sources and the World Bank data portal.
https://www.ons.gov.uk/methodology/geography/licenceshttps://www.ons.gov.uk/methodology/geography/licences
This file contains the National Statistics Postcode Lookup (NSPL) for the United Kingdom as at February 2024 in Comma Separated Variable (CSV) and ASCII text (TXT) formats. To download the zip file click the Download button. The NSPL relates both current and terminated postcodes to a range of current statutory geographies via ‘best-fit’ allocation from the 2021 Census Output Areas (national parks and Workplace Zones are exempt from ‘best-fit’ and use ‘exact-fit’ allocations) for England, Wales and Northern Ireland. Scotland has the 2011 Census Output Areas
It supports the production of area-based statistics from postcoded data. The NSPL is produced by ONS Geography, who provide geographic support to the Office for National Statistics (ONS) and geographic services used by other organisations. The NSPL is issued quarterly. (File size - 176 MB).Updated 26/02/2024 to remove the BUASD11 field included in error.
The TIGER/Line shapefiles and related database files (.dbf) are an extract of selected geographic and cartographic information from the U.S. Census Bureau's Master Address File / Topologically Integrated Geographic Encoding and Referencing (MAF/TIGER) Database (MTDB). The MTDB represents a seamless national file with no overlaps or gaps between parts, however, each TIGER/Line shapefile is designed to stand alone as an independent data set, or they can be combined to cover the entire nation. ZIP Code Tabulation Areas (ZCTAs) are approximate area representations of U.S. Postal Service (USPS) ZIP Code service areas that the Census Bureau creates to present statistical data for each decennial census. The Census Bureau delineates ZCTA boundaries for the United States, Puerto Rico, American Samoa, Guam, the Commonwealth of the Northern Mariana Islands, and the U.S. Virgin Islands once each decade following the decennial census. Data users should not use ZCTAs to identify the official USPS ZIP Code for mail delivery. The USPS makes periodic changes to ZIP Codes to support more efficient mail delivery. The Census Bureau uses tabulation blocks as the basis for defining each ZCTA. Tabulation blocks are assigned to a ZCTA based on the most frequently occurring ZIP Code for the addresses contained within that block. The most frequently occurring ZIP Code also becomes the five-digit numeric code of the ZCTA. These codes may contain leading zeros. Blocks that do not contain addresses but are surrounded by a single ZCTA (enclaves) are assigned to the surrounding ZCTA. Because the Census Bureau only uses the most frequently occurring ZIP Code to assign blocks, a ZCTA may not exist for every USPS ZIP Code. Some ZIP Codes may not have a matching ZCTA because too few addresses were associated with the specific ZIP Code or the ZIP Code was not the most frequently occurring ZIP Code within any of the blocks where it exists. The ZCTA boundaries in this release are those delineated following the 2020 Census.
This dataset includes variable names, variable labels, variable values, and corresponding variable value labels for the IPUMS 1940 datasets.
For 156 years (1840 - 1996), the U.S. Department of Commerce, Bureau of the Census was responsible for collecting census of agriculture data. The 1997 Appropriations Act contained a provision that transferred the responsibility for the census of agriculture from the Bureau of the Census to the U.S. Department of Agriculture (USDA), National Agricultural Statistics Service (NASS). The 2007 Census of Agriculture is the 27th Federal census of agriculture and the third conducted by NASS. The first agriculture census was taken in 1840 as part of the sixth decennial census of population. The agriculture census continued to be taken as part of the decennial census through 1950. A separate middecade census of agriculture was conducted in 1925, 1935, and 1945. From 1954 to 1974, the census was taken for the years ending in 4 and 9. In 1976, Congress authorized the census of agriculture to be taken for 1978 and 1982 to adjust the data reference year so that it coincided with other economic censuses. This adjustment in timing established the agriculture census on a 5-year cycle collecting data for years ending in 2 and 7. Agriculture census data are used to:
• Evaluate, change, promote, and formulate farm and rural policies and programs that help agricultural producers; • Study historical trends, assess current conditions, and plan for the future; • Formulate market strategies, provide more efficient production and distribution systems, and locate facilities for agricultural communities; • Make energy projections and forecast needs for agricultural producers and their communities; • Develop new and improved methods to increase agricultural production and profitability; • Allocate local and national funds for farm programs, e.g. extension service projects, agricultural research, soil conservation programs, and land-grant colleges and universities; • Plan for operations during drought and emergency outbreaks of diseases or infestations of pests. • Analyze and report on the current state of food, fuel, feed, and fiber production in the United States.
American Samoa is one of the territories collectively referred as the "US Outlying areas". The 2008 American Samoa Census of Agriculture was conducted by personal interviews of all farm operations on the list of commercial farms, and supplemented by an area sample of the remaining households. The purpose of the area sample was to efficiently accountfor farms not on the commercialfarmlist and provide an accurate measure of the agricultural activity in American Samoa.
National coverage
Households
The statistical unit for the CA 2008 was the farm, an operating unit defined as any place from which USD 1 000 or more of agricultural products were produced and sold, or normally would have been sold, during the census year.
Census/enumeration data [cen]
i. Methodological modality for conducting the census The classical approach was used in the CA 2008.
ii. sample design The design of the sample for the 2008 Census of Agriculture made use of materials and information available from the American Samoa Department of Commerce. These included detailed maps of all the islands in the territory, up-to-date map-spotting (location on a map) of all households in the territory, a system of numbering each household to provide it a unique identifier, and identification of householdswhich were on the list of commercial farms. The households that were on the list of commercial farms were excluded from the universe used to select the area sample. A random sample of the remaining households was selected, using the available maps with the household identification information. It was determined that a 20 percent sample would be optimal. A serpentine selection methodology, starting at a point determined by the generation of a random number, was used to select the area sample.
Face-to-face paper [f2f]
One questionnaire was used which collected information on:
DATA PROCESSING AND ARCHIVING The completed forms were scanned and Optical Mark Recognition (OMR) was used to retrieve categorical responses and to identify the other answer zones in which some type of mark was present. The edit system determined the best value to impute for reported responses that were deemed unreasonable and for required responses that were absent. The complex edit ensured the full internal consistency of the record. After tabulation and review of the aggregates, a comprehensive disclosure review was conducted. Cell suppression was used to protect the cells that were determined to be sensitive to a disclosure of information.
CENSUS DATA QUALITY NASS conducted an extensive program to follow-up all non-response. NASS also used capture-recapture methodology to adjust for under-coverage, non-response, and misclassification. To implement capture-recapture methods, two independent surveys were required --the 2012 Census of Agriculture (based on the Census Mail List) and the 2012 June Agricultural Survey (based on the area frame). Historically, NASS has been careful to maintain the independence of these two surveys.
The complete data series from the 2008 Census of Agriculture is available from the NASS website free of charge in multiple formats, including Quick Stats 2.0 - an online database to retrieve customized tables with Census data at the national, state and county levels. The 2012 Census of Agriculture provides information on a range of topics, including agricultural practices, conservation, organic production, as well as traditional and specialty crops.
Registration information on interstate, intrastate non-hazmat, and intrastate truck and bus companies that operate in the United States and have registered with FMCSA. Contains contact information and demographic information (number of drivers, vehicles, commodities carried, etc).
Census tracts are small, relatively permanent geographic entities within counties (or the statistical equivalents of counties) delineated by a committee of local data users. Generally, census tracts have between 2,500 and 8,000 residents and boundaries that follow visible features. When first established, census tracts are to be as homogeneous as possible with respect to population characteristics, economic status, and living conditions. (www.census.gov)
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘Geography Lookup Table’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://catalog.data.gov/dataset/801e1bd7-e19b-4cb9-8959-b34b8fc61ab7 on 12 February 2022.
--- Dataset description provided by original source is as follows ---
Summary data of fixed broadband coverage by geographic area. License and Attribution: Broadband data from FCC Form 477, and data from the U.S. Census Bureau that are presented on this site are offered free and not subject to copyright restriction. Data and content created by government employees within the scope of their employment are not subject to domestic copyright protection under 17 U.S.C. § 105. See, e.g., U.S. Government Works.
While not required, when using content, data, documentation, code and related materials from fcc.gov or broadbandmap.fcc.gov in your own work, we ask that proper credit be given. Examples include: • Source data: FCC Form 477 • Map layer based on FCC Form 477 • Code data based on broadbandmap.fcc.gov
The geography look ups are created from the US census shapefiles, which are in Global Coordinate System North American Datum of 1983 (GCS NAD83). The coordinates do not get reprojected during processing. The "centroid_lng", "centroid_lat" columns in the lookup table are the exact values from the US census shapefile (INTPTLON, INTPTLAT). The "bbox_arr" column is calculated from the bounding box/extent of the original geometry in the shapefile; no reprojection or transformations are done to the geometry.
--- Original source retains full ownership of the source dataset ---
Website alows the public full access to the 1940 Census images, census maps and descriptions.