The 1950 Census population schedules were created by the Bureau of the Census in an attempt to enumerate every person living in the United States on April 1, 1950, although some persons were missed. The 1950 census population schedules were digitized by the National Archives and Records Administration (NARA) and released publicly on April 1, 2022. The 1950 Census enumeration district maps contain maps of counties, cities, and other minor civil divisions that show enumeration districts, census tracts, and related boundaries and numbers used for each census. The coverage is nation wide and includes territorial areas. The 1950 Census enumeration district descriptions contain written descriptions of census districts, subdivisions, and enumeration districts.
This data collection contains a stratified 1-percent sample of households, with separate records for each household, each "sample line" respondent, and each person in the household. These records were encoded from microfilm copies of original handwritten enumeration schedules from the 1950 Census of Population. Geographic identification of the location of the sampled households includes Census regions and divisions, states (except Alaska and Hawaii), Standard Metropolitan Areas (SMAs), and State Economic Areas (SEAs). The data collection was constructed from and consists of 20 independently-drawn subsamples stored in 20 discrete physical files. The 1950 Census had both a complete-count and a sample component. Individuals selected for the sample component were asked a set of additional questions. Only households with a sample line person were included in the 1950 Public Use Microdata Sample. The collection also contains records of group quarters members who were also on the Census sample line. Each household record contains variables describing the location and composition of the household. The sample line records contain variables describing demographic characteristics such as nativity, marital status, number of children, veteran status, education, income, and occupation. The person records contain demographic variables such as nativity, marital status, family membership, and occupation. (Source: downloaded from ICPSR 7/13/10)
Please Note: This dataset is part of the historical CISER Data Archive Collection and is also available at ICPSR at https://doi.org/10.3886/ICPSR08251.v1. We highly recommend using the ICPSR version as they may make this dataset available in multiple data formats in the future.
Census Year 1950 Census Tracts. The dataset contains polygons representing CY 1950 census tracts, created as part of the D.C. Geographic Information System (DC GIS) for the D.C. Office of the Chief Technology Officer (OCTO) and participating D.C. government agencies. Census tracts were identified from maps provided by the U.S. Census Bureau and the D.C. Office of Planning. The tract polygons were created by selecting street arcs from the WGIS planimetric street centerlines. Where necessary, polygons were also heads-up digitized from 1995/1999 orthophotographs.
Two novel datasets—French 19th-century and U.S. 1950 Census records—to demonstrate our approach.
Aggregate data from documents from the state bureaus of the census
This data collection and its 1940 counterpart were assembled through a collaborative effort between the United States Bureau of the Census and the Center for Demography and Ecology of the University of Wisconsin. The 1940 and 1950 Census Public Use Sample Project was supported by The National Science Foundation under Grant SES-7704135. The collections contain a stratified 1-percent sample of households, with separate records for each household, for each \'sample line\' respondent, and for each person in the household. These records were encoded from microfilm copies of original handwritten enumeration schedules from the 1940 and 1950 Censuses of Population. The universe for the sample included all persons and households within the United States. Geographic identification of the location of the sampled households includes Census regions and divisions, States (except Alaska and Hawaii), Standard Metropolitan Areas (SMA\'s), and State Economic Areas (SEA\'s). The SMA\'s and SEA\'s are comparable for both the 1940 and 1950 Public Use Microdata Samples (PUMS). The data collections were constructed from and consist of 20 independently-drawn subsamples stored in 20 discrete physical files. Each of the 20 subsamples contains three record types (household, \'sample line\', and person). Both collections had both a complete-count and a sample component. Individuals selected for the sample component were asked a set of additional questions. Only households with a \'sample line\' person were included in the public use microdata sample. The collections also contain records of group quarters members who were also on the Census \'sample line\'. For the 1940 and 1950 collections, each household record contains variables describing the location and composition of the household. The \'sample line\' records for 1950 contain variables describing demographic characteristics such as nativity, marital status, number of children, veteran status, education, income, and occupation. The person records for 1950 contain such demographic variables as nativity, marital status, family membership, and occupation. Accompanying the data collections are code books which include an abstract, descriptions of sample design, processing procedures and file structure, a data dictionary (record layout), category code lists, and a glossary. The data collections are arranged by subsample with each subsample stored as a separate physical file of information. The 20 subsamples were selected randomly. Within each of the 20 subsamples, records are sequenced by State. Extracting all of the records for one State entails reading through all of the 20 physical files and selecting that State\'s records from each of the 20 subsamples. Record types are ordered within household (household characteristics first, \'sample line\' next, and person records last). The 1950 collection consists of a total of 2,844,458 data records: 461,130 household records, 461,130 \'sample line\' records, and 1,922,198 person records. Each record type has a logical record length of 133.;
https://www.icpsr.umich.edu/web/ICPSR/studies/2931/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/2931/terms
The 1950 Census Tract files were originally created by keypunching the data from the printed publications prepared by the Bureau of the Census. The work was done under the direction of Dr. Donald Bogue, whose wife, Elizabeth Mullen Bogue, completed much of the data work. Subsequently, the punchcards were converted to data files and transferred to the National Archive and Records Administration (NARA). ICPSR received copies of these files from NARA and converted the binary block-length records to ASCII format.
1950 United States Census contains records from Montpelier, Washington, Vermont, USA by United States of America, Bureau of the Census; Washington, D.C.; Seventeenth Census of the United States, 1950; Record Group: Records of the Bureau of the Census, 1790-2007; Record Group Number: 29; Residence Date: 1950; Home in 1950: Montpelier, Washington, Vermont; Roll: 471; Sheet Number: 11; Enumeration District: 12-40 - .
"Website allows the public full access to the 1950 Census images, census maps and descriptions.
1950 Dwellings Census Data for Baltimore, Maryland. Refer to the 1950 codebook (codebook_1950.pdf) for more information. This is part of a collection of 221 Baltimore Ecosystem Study metadata records that point to a geodatabase. The geodatabase is available online and is considerably large. Upon request, and under certain arrangements, it can be shipped on media, such as a usb hard drive. The geodatabase is roughly 51.4 Gb in size, consisting of 4,914 files in 160 folders. Although this metadata record and the others like it are not rich with attributes, it is nonetheless made available because the data that it represents could be indeed useful.
1950 Education Census Data for Baltimore, Maryland. Refer to the 1950 codebook (codebook_1950.pdf) for more information. This is part of a collection of 221 Baltimore Ecosystem Study metadata records that point to a geodatabase. The geodatabase is available online and is considerably large. Upon request, and under certain arrangements, it can be shipped on media, such as a usb hard drive. The geodatabase is roughly 51.4 Gb in size, consisting of 4,914 files in 160 folders. Although this metadata record and the others like it are not rich with attributes, it is nonetheless made available because the data that it represents could be indeed useful.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
This is the shapefile of the mapped 1950 census data for Austin, Texas.
1950 Age Census Data for Baltimore, Maryland. Refer to the 1950 codebook (codebook_1950.pdf) for more information. This is part of a collection of 221 Baltimore Ecosystem Study metadata records that point to a geodatabase. The geodatabase is available online and is considerably large. Upon request, and under certain arrangements, it can be shipped on media, such as a usb hard drive. The geodatabase is roughly 51.4 Gb in size, consisting of 4,914 files in 160 folders. Although this metadata record and the others like it are not rich with attributes, it is nonetheless made available because the data that it represents could be indeed useful.
The United States Census Bureau’s international dataset provides estimates of country populations since 1950 and projections through 2050. Specifically, the dataset includes midyear population figures broken down by age and gender assignment at birth. Additionally, time-series data is provided for attributes including fertility rates, birth rates, death rates, and migration rates.
You can use the BigQuery Python client library to query tables in this dataset in Kernels. Note that methods available in Kernels are limited to querying data. Tables are at bigquery-public-data.census_bureau_international.
What countries have the longest life expectancy? In this query, 2016 census information is retrieved by joining the mortality_life_expectancy and country_names_area tables for countries larger than 25,000 km2. Without the size constraint, Monaco is the top result with an average life expectancy of over 89 years!
SELECT
age.country_name,
age.life_expectancy,
size.country_area
FROM (
SELECT
country_name,
life_expectancy
FROM
bigquery-public-data.census_bureau_international.mortality_life_expectancy
WHERE
year = 2016) age
INNER JOIN (
SELECT
country_name,
country_area
FROM
bigquery-public-data.census_bureau_international.country_names_area
where country_area > 25000) size
ON
age.country_name = size.country_name
ORDER BY
2 DESC
/* Limit removed for Data Studio Visualization */
LIMIT
10
Which countries have the largest proportion of their population under 25? Over 40% of the world’s population is under 25 and greater than 50% of the world’s population is under 30! This query retrieves the countries with the largest proportion of young people by joining the age-specific population table with the midyear (total) population table.
SELECT
age.country_name,
SUM(age.population) AS under_25,
pop.midyear_population AS total,
ROUND((SUM(age.population) / pop.midyear_population) * 100,2) AS pct_under_25
FROM (
SELECT
country_name,
population,
country_code
FROM
bigquery-public-data.census_bureau_international.midyear_population_agespecific
WHERE
year =2017
AND age < 25) age
INNER JOIN (
SELECT
midyear_population,
country_code
FROM
bigquery-public-data.census_bureau_international.midyear_population
WHERE
year = 2017) pop
ON
age.country_code = pop.country_code
GROUP BY
1,
3
ORDER BY
4 DESC /* Remove limit for visualization*/
LIMIT
10
The International Census dataset contains growth information in the form of birth rates, death rates, and migration rates. Net migration is the net number of migrants per 1,000 population, an important component of total population and one that often drives the work of the United Nations Refugee Agency. This query joins the growth rate table with the area table to retrieve 2017 data for countries greater than 500 km2.
SELECT
growth.country_name,
growth.net_migration,
CAST(area.country_area AS INT64) AS country_area
FROM (
SELECT
country_name,
net_migration,
country_code
FROM
bigquery-public-data.census_bureau_international.birth_death_growth_rates
WHERE
year = 2017) growth
INNER JOIN (
SELECT
country_area,
country_code
FROM
bigquery-public-data.census_bureau_international.country_names_area
Historic (none)
United States Census Bureau
Terms of use: This dataset is publicly available for anyone to use under the following terms provided by the Dataset Source - http://www.data.gov/privacy-policy#data_policy - and is provided "AS IS" without any warranty, express or implied, from Google. Google disclaims all liability for any damages, direct or indirect, resulting from the use of the dataset.
See the GCP Marketplace listing for more details and sample queries: https://console.cloud.google.com/marketplace/details/united-states-census-bureau/international-census-data
The Integrated Public Use Microdata Series (IPUMS) Complete Count Data include more than 650 million individual-level and 7.5 million household-level records. The microdata are the result of collaboration between IPUMS and the nation’s two largest genealogical organizations—Ancestry.com and FamilySearch—and provides the largest and richest source of individual level and household data.
All manuscripts (and other items you'd like to publish) must be submitted to
phsdatacore@stanford.edu for approval prior to journal submission.
We will check your cell sizes and citations.
For more information about how to cite PHS and PHS datasets, please visit:
https:/phsdocs.developerhub.io/need-help/citing-phs-data-core
Historic data are scarce and often only exists in aggregate tables. The key advantage of historic US census data is the availability of individual and household level characteristics that researchers can tabulate in ways that benefits their specific research questions. The data contain demographic variables, economic variables, migration variables and family variables. Within households, it is possible to create relational data as all relations between household members are known. For example, having data on the mother and her children in a household enables researchers to calculate the mother’s age at birth. Another advantage of the Complete Count data is the possibility to follow individuals over time using a historical identifier.
In sum: the historic US census data are a unique source for research on social and economic change and can provide population health researchers with information about social and economic determinants.
The historic US 1920 census data was collected in January 1920. Enumerators collected data traveling to households and counting the residents who regularly slept at the household. Individuals lacking permanent housing were counted as residents of the place where they were when the data was collected. Household members absent on the day of data collected were either listed to the household with the help of other household members or were scheduled for the last census subdivision.
Notes
We provide household and person data separately so that it is convenient to explore the descriptive statistics on each level. In order to obtain a full dataset, merge the household and person on the variables SERIAL and SERIALP. In order to create a longitudinal dataset, merge datasets on the variable HISTID.
Households with more than 60 people in the original data were broken up for processing purposes. Every person in the large households are considered to be in their own household. The original large households can be identified using the variable SPLIT, reconstructed using the variable SPLITHID, and the original count is found in the variable SPLITNUM.
Coded variables derived from string variables are still in progress. These variables include: occupation and industry.
Missing observations have been allocated and some inconsistencies have been edited for the following variables: SPEAKENG, YRIMMIG, CITIZEN, AGE, BPL, MBPL, FBPL, LIT, SCHOOL, OWNERSHP, MORTGAGE, FARM, CLASSWKR, OCC1950, IND1950, MARST, RACE, SEX, RELATE, MTONGUE. The flag variables indicating an allocated observation for the associated variables can be included in your extract by clicking the ‘Select data quality flags’ box on the extract summary page.
Most inconsistent information was not edited for this release, thus there are observations outside of the universe for some variables. In particular, the variables GQ, and GQTYPE have known inconsistencies and will be improved with the next release.
%3C!-- --%3E
This dataset was created on 2020-01-10 18:46:34.647
by merging multiple datasets together. The source datasets for this version were:
IPUMS 1920 households: This dataset includes all households from the 1920 US census.
IPUMS 1920 persons: This dataset includes all individuals from the 1920 US census.
IPUMS 1920 Lookup: This dataset includes variable names, variable labels, variable values, and corresponding variable value labels for the IPUMS 1920 datasets.
1950 Employment Census Data for Baltimore, Maryland. Refer to the 1950 codebook (codebook_1950.pdf) for more information. This is part of a collection of 221 Baltimore Ecosystem Study metadata records that point to a geodatabase. The geodatabase is available online and is considerably large. Upon request, and under certain arrangements, it can be shipped on media, such as a usb hard drive. The geodatabase is roughly 51.4 Gb in size, consisting of 4,914 files in 160 folders. Although this metadata record and the others like it are not rich with attributes, it is nonetheless made available because the data that it represents could be indeed useful.
This metadata report documents tabular data sets consisting of items from the Census of Agriculture. These data are a subset of items from county-level data (including state totals) for the conterminous United States covering the census reporting years (every five years, with adjustments for 1978 and 1982) beginning with the 1950 Census of Agriculture and ending with the 2012 Census of Agriculture. Historical (1950-1997) data were extracted from digital files obtained through the Intra-university Consortium on Political and Social Research (ICPSR). More current (1997-2012) data were extracted from the National Agriculture Statistical Service (NASS) Census Query Tool for the census years of 1997, 2002, 2007, and 2012. Most census reports contain item values from the prior census for comparison. At times these values are updated or reweighted by the reporting agency; the Census Bureau prior to 1997 or NASS from 1997 on. Where available, the updated or reweighted data were used; otherwise, the original reported values were used. Changes in census item definitions and reporting as well as changes to county areas and names over the time span required a degree of manipulation on the data and county codes to make the data as comparable as possible over time. Not all of the census items are present for the entire 1950-2012 time span as certain items have been added since 1950 and when possible the items were derived from other items by subtracting or combining sub items. Specific changes and calculations are documented in the processing steps sections of this report. Other missing data occurs at the state and (or) county level due to census non-disclosure rules where small numbers of farms reporting an item have acres and (or) production values withheld to prevent identification of individual farms. In general, caution should be exercised when comparing current (2012) data with values reported in earlier censuses. While the 1974-2012 data are comparable, data prior to 1974 will have inflated farm counts and slightly inflated production amounts due to the differences in collection methods, primarily, the definition of a farm. Further discussion on comparability can be found the comparability section of the Supplemental Information element of this metadata report. Excluded from the tabular data are the District of Columbia, Menominee County, Wisconsin, and the independent cities of Virginia with the exception of the three county-equivalent cities of Chesapeake City, Suffolk, and Virginia Beach. Data for independent cities of Virginia prior to 1959 have been included with their surrounding or adjacent county. Please refer to the Supplemental Information element for information on terminology, the Census of Agriculture, the Inter-university Consortium for Political and Social Research (ICPSR), table and variable structure, data comparability, all farms and economic class 1-5 farms, item calculations, increase of farms from 1974 to 1978, missing data and exclusion explanations, 1978 crop irregularities, pastureland irregularities, county alignment, definitions, and references. In addition to the metadata is an excel workbook (VariableKey.xlsx) with spreadsheets containing key spreadsheets for items and variables by category and a spreadsheet noting the presence or absence of entire variable data by year. Note: this dataset was updated on 2016-02-10 to populate omitted irrigation values for Miami-Dade County, Florida in 1997.
This polygon shapefile provides county or county-equivalent boundaries for the conterminous United States and was created specifically for use with the data tables published as Selected Items from the Census of Agriculture for the Conterminous United States, 1950-2012 (LaMotte, 2015). This data layer is a modified version of Historic Counties for the 2000 Census of Population and Housing produced by the National Historical Geographic Information System (NHGIS) project, which is identical to the U.S. Census Bureau TIGER/Line Census 2000 file, with the exception of added shorelines. Excluded from the CAO_STCOFIPS boundary layer are Broomfield County, Colorado, Menominee County, Wisconsin, and the independent cities of Virginia with the exception of the 3 county-equivalent cities of Chesapeake City, Suffolk, and Virginia Beach. The census of agriculture was not taken in the District of Columbia for 1959, but available data indicate few if any farms in that area, the polygon was left in place to preserve the areas of the surrounding counties. Baltimore City, Maryland was combined with Baltimore County and the St. Louis City, Missouri, was combined with St. Louis County. La Paz County, Arizona was combined with Yuma County, Arizona and Cibola County, New Mexico was combined with Valencia County, New Mexico. Minor county border changes were at a level of precision beyond the scope of the data collection. A major objective of the census data tabulation is to maintain a reasonable degree of comparability of agricultural data from census to census. The tabular data collection is from 14 different censuses where definitions and data collection techniques may change over time and while the data are mostly comparable, a degree of caution should be exercised when using the data in analysis procedures. While the data are at a county-level resolution, a regional approach is more appropriate than a county-by-county analysis. The main purpose of this layer is to provide a base to generate a county raster for the allocation of agricultural census values to specific (agricultural) pixels. Vector format is provided so the raster pixel size can be user designated. References cited: LaMotte, A.E., 2015, Selected items from the Census of Agriculture at the county level for the conterminous United States, 1950-2012: U.S. Geological Survey data release, http://dx.doi.org/10.5066/F7H13016. National Historical Geographic Information System, Minnesota Population Center, 2004, Historic counties for the 2000 census of population and housing: Minneapolis, MN, University of Minnesota, accessed 03/18/2013 at http://nhgis.org
Link to the ScienceBase Item Summary page for the item described by this metadata record. Service Protocol: Link to the ScienceBase Item Summary page for the item described by this metadata record. Application Profile: Web Browser. Link Function: information
Unadjusted decennial census data from 1950-2000 and projected figures from 2010-2040: summary table of New York City population numbers and percentage share by Borough, including school-age (5 to 17), 65 and Over, and total population.
The 1950 Census population schedules were created by the Bureau of the Census in an attempt to enumerate every person living in the United States on April 1, 1950, although some persons were missed. The 1950 census population schedules were digitized by the National Archives and Records Administration (NARA) and released publicly on April 1, 2022. The 1950 Census enumeration district maps contain maps of counties, cities, and other minor civil divisions that show enumeration districts, census tracts, and related boundaries and numbers used for each census. The coverage is nation wide and includes territorial areas. The 1950 Census enumeration district descriptions contain written descriptions of census districts, subdivisions, and enumeration districts.