The 1950 Census population schedules were created by the Bureau of the Census in an attempt to enumerate every person living in the United States on April 1, 1950, although some persons were missed. The 1950 census population schedules were digitized by the National Archives and Records Administration (NARA) and released publicly on April 1, 2022. The 1950 Census enumeration district maps contain maps of counties, cities, and other minor civil divisions that show enumeration districts, census tracts, and related boundaries and numbers used for each census. The coverage is nation wide and includes territorial areas. The 1950 Census enumeration district descriptions contain written descriptions of census districts, subdivisions, and enumeration districts.
1950 Age Census Data for Baltimore, Maryland. Refer to the 1950 codebook (codebook_1950.pdf) for more information. This is part of a collection of 221 Baltimore Ecosystem Study metadata records that point to a geodatabase. The geodatabase is available online and is considerably large. Upon request, and under certain arrangements, it can be shipped on media, such as a usb hard drive. The geodatabase is roughly 51.4 Gb in size, consisting of 4,914 files in 160 folders. Although this metadata record and the others like it are not rich with attributes, it is nonetheless made available because the data that it represents could be indeed useful.
1950 Dwellings Census Data for Baltimore, Maryland. Refer to the 1950 codebook (codebook_1950.pdf) for more information. This is part of a collection of 221 Baltimore Ecosystem Study metadata records that point to a geodatabase. The geodatabase is available online and is considerably large. Upon request, and under certain arrangements, it can be shipped on media, such as a usb hard drive. The geodatabase is roughly 51.4 Gb in size, consisting of 4,914 files in 160 folders. Although this metadata record and the others like it are not rich with attributes, it is nonetheless made available because the data that it represents could be indeed useful.
1950 Employment Census Data for Baltimore, Maryland. Refer to the 1950 codebook (codebook_1950.pdf) for more information. This is part of a collection of 221 Baltimore Ecosystem Study metadata records that point to a geodatabase. The geodatabase is available online and is considerably large. Upon request, and under certain arrangements, it can be shipped on media, such as a usb hard drive. The geodatabase is roughly 51.4 Gb in size, consisting of 4,914 files in 160 folders. Although this metadata record and the others like it are not rich with attributes, it is nonetheless made available because the data that it represents could be indeed useful.
This data collection and its 1940 counterpart were assembled through a collaborative effort between the United States Bureau of the Census and the Center for Demography and Ecology of the University of Wisconsin. The 1940 and 1950 Census Public Use Sample Project was supported by The National Science Foundation under Grant SES-7704135. The collections contain a stratified 1-percent sample of households, with separate records for each household, for each \'sample line\' respondent, and for each person in the household. These records were encoded from microfilm copies of original handwritten enumeration schedules from the 1940 and 1950 Censuses of Population. The universe for the sample included all persons and households within the United States. Geographic identification of the location of the sampled households includes Census regions and divisions, States (except Alaska and Hawaii), Standard Metropolitan Areas (SMA\'s), and State Economic Areas (SEA\'s). The SMA\'s and SEA\'s are comparable for both the 1940 and 1950 Public Use Microdata Samples (PUMS). The data collections were constructed from and consist of 20 independently-drawn subsamples stored in 20 discrete physical files. Each of the 20 subsamples contains three record types (household, \'sample line\', and person). Both collections had both a complete-count and a sample component. Individuals selected for the sample component were asked a set of additional questions. Only households with a \'sample line\' person were included in the public use microdata sample. The collections also contain records of group quarters members who were also on the Census \'sample line\'. For the 1940 and 1950 collections, each household record contains variables describing the location and composition of the household. The \'sample line\' records for 1950 contain variables describing demographic characteristics such as nativity, marital status, number of children, veteran status, education, income, and occupation. The person records for 1950 contain such demographic variables as nativity, marital status, family membership, and occupation. Accompanying the data collections are code books which include an abstract, descriptions of sample design, processing procedures and file structure, a data dictionary (record layout), category code lists, and a glossary. The data collections are arranged by subsample with each subsample stored as a separate physical file of information. The 20 subsamples were selected randomly. Within each of the 20 subsamples, records are sequenced by State. Extracting all of the records for one State entails reading through all of the 20 physical files and selecting that State\'s records from each of the 20 subsamples. Record types are ordered within household (household characteristics first, \'sample line\' next, and person records last). The 1950 collection consists of a total of 2,844,458 data records: 461,130 household records, 461,130 \'sample line\' records, and 1,922,198 person records. Each record type has a logical record length of 133.;
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
This is the census data collected for Austin, Texas in 1950.
This metadata report documents tabular data sets consisting of items from the Census of Agriculture. These data are a subset of items from county-level data (including state totals) for the conterminous United States covering the census reporting years (every five years, with adjustments for 1978 and 1982) beginning with the 1950 Census of Agriculture and ending with the 2012 Census of Agriculture. Historical (1950-1997) data were extracted from digital files obtained through the Intra-university Consortium on Political and Social Research (ICPSR). More current (1997-2012) data were extracted from the National Agriculture Statistical Service (NASS) Census Query Tool for the census years of 1997, 2002, 2007, and 2012. Most census reports contain item values from the prior census for comparison. At times these values are updated or reweighted by the reporting agency; the Census Bureau prior to 1997 or NASS from 1997 on. Where available, the updated or reweighted data were used; otherwise, the original reported values were used. Changes in census item definitions and reporting as well as changes to county areas and names over the time span required a degree of manipulation on the data and county codes to make the data as comparable as possible over time. Not all of the census items are present for the entire 1950-2012 time span as certain items have been added since 1950 and when possible the items were derived from other items by subtracting or combining sub items. Specific changes and calculations are documented in the processing steps sections of this report. Other missing data occurs at the state and (or) county level due to census non-disclosure rules where small numbers of farms reporting an item have acres and (or) production values withheld to prevent identification of individual farms. In general, caution should be exercised when comparing current (2012) data with values reported in earlier censuses. While the 1974-2012 data are comparable, data prior to 1974 will have inflated farm counts and slightly inflated production amounts due to the differences in collection methods, primarily, the definition of a farm. Further discussion on comparability can be found the comparability section of the Supplemental Information element of this metadata report. Excluded from the tabular data are the District of Columbia, Menominee County, Wisconsin, and the independent cities of Virginia with the exception of the three county-equivalent cities of Chesapeake City, Suffolk, and Virginia Beach. Data for independent cities of Virginia prior to 1959 have been included with their surrounding or adjacent county. Please refer to the Supplemental Information element for information on terminology, the Census of Agriculture, the Inter-university Consortium for Political and Social Research (ICPSR), table and variable structure, data comparability, all farms and economic class 1-5 farms, item calculations, increase of farms from 1974 to 1978, missing data and exclusion explanations, 1978 crop irregularities, pastureland irregularities, county alignment, definitions, and references. In addition to the metadata is an excel workbook (VariableKey.xlsx) with spreadsheets containing key spreadsheets for items and variables by category and a spreadsheet noting the presence or absence of entire variable data by year. Note: this dataset was updated on 2016-02-10 to populate omitted irrigation values for Miami-Dade County, Florida in 1997.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
This is the shapefile of the mapped 1950 census data for Austin, Texas.
This dataset is made up of images containing handwritten 3-digit occupation codes from the Norwegian population census of 1950. The occupation codes were added to the census sheets by Statistics Norway after the census was concluded for the purpose of creating aggregated occupational statistics for the entire population. The coding standard used in the 1950 census is, according to Statistics Norway’s official publications (https://www.ssb.no/historisk-statistikk/folketellinger/folketellingen-1950, booklet 4, page 81), very similar to the standards used in the census for 1920. Cf. the 13th booklet published for the 1920 census (https://www.ssb.no/historisk-statistikk/folketellinger/folketellingen-1920, note that this booklet is only available in Norwegian). In short, an occupation code is a 3-digit number that corresponds to a given occupation or type of occupation. According to the official list of occupation codes provided by Statistics Norway there are 339 unique codes. These are not all necessarily sequential or hierarchical in general, but some subgroupings are. This list can be found under Files. It is also worth noting that these images were extracted from the original census sheet images algorithmically. This process was not flawless and lead to additional images being extracted, these can contain written occupation titles or be left entirely blank. The dataset consists of 90,000 unique images, and 9,000 images that were randomly selected and copied from the unique images. These were all used for a research project (link to preprint article: https://doi.org/10.48550/arXiv.2306.16126) where we (author list can be found in preprint) tried to find a more efficient way of reviewing and correcting classification results from a Machine Learning model, where the results did not pass a pre-set confidence threshold. This was a follow-up to our previous article where we describe the initial project and creating of our model in more detail, if it is of interest (“Lessons Learned Developing and Using a Machine Learning Model to Automatically Transcribe 2.3 Million Handwritten Occupation Codes”, https://doi.org/10.51964/hlcs11331).
This polygon shapefile provides county or county-equivalent boundaries for the conterminous United States and was created specifically for use with the data tables published as Selected Items from the Census of Agriculture for the Conterminous United States, 1950-2012 (LaMotte, 2015). This data layer is a modified version of Historic Counties for the 2000 Census of Population and Housing produced by the National Historical Geographic Information System (NHGIS) project, which is identical to the U.S. Census Bureau TIGER/Line Census 2000 file, with the exception of added shorelines. Excluded from the CAO_STCOFIPS boundary layer are Broomfield County, Colorado, Menominee County, Wisconsin, and the independent cities of Virginia with the exception of the 3 county-equivalent cities of Chesapeake City, Suffolk, and Virginia Beach. The census of agriculture was not taken in the District of Columbia for 1959, but available data indicate few if any farms in that area, the polygon was left in place to preserve the areas of the surrounding counties. Baltimore City, Maryland was combined with Baltimore County and the St. Louis City, Missouri, was combined with St. Louis County. La Paz County, Arizona was combined with Yuma County, Arizona and Cibola County, New Mexico was combined with Valencia County, New Mexico. Minor county border changes were at a level of precision beyond the scope of the data collection. A major objective of the census data tabulation is to maintain a reasonable degree of comparability of agricultural data from census to census. The tabular data collection is from 14 different censuses where definitions and data collection techniques may change over time and while the data are mostly comparable, a degree of caution should be exercised when using the data in analysis procedures. While the data are at a county-level resolution, a regional approach is more appropriate than a county-by-county analysis. The main purpose of this layer is to provide a base to generate a county raster for the allocation of agricultural census values to specific (agricultural) pixels. Vector format is provided so the raster pixel size can be user designated. References cited: LaMotte, A.E., 2015, Selected items from the Census of Agriculture at the county level for the conterminous United States, 1950-2012: U.S. Geological Survey data release, http://dx.doi.org/10.5066/F7H13016. National Historical Geographic Information System, Minnesota Population Center, 2004, Historic counties for the 2000 census of population and housing: Minneapolis, MN, University of Minnesota, accessed 03/18/2013 at http://nhgis.org
The United States Census Bureau’s international dataset provides estimates of country populations since 1950 and projections through 2050. Specifically, the dataset includes midyear population figures broken down by age and gender assignment at birth. Additionally, time-series data is provided for attributes including fertility rates, birth rates, death rates, and migration rates.
You can use the BigQuery Python client library to query tables in this dataset in Kernels. Note that methods available in Kernels are limited to querying data. Tables are at bigquery-public-data.census_bureau_international.
What countries have the longest life expectancy? In this query, 2016 census information is retrieved by joining the mortality_life_expectancy and country_names_area tables for countries larger than 25,000 km2. Without the size constraint, Monaco is the top result with an average life expectancy of over 89 years!
SELECT
age.country_name,
age.life_expectancy,
size.country_area
FROM (
SELECT
country_name,
life_expectancy
FROM
bigquery-public-data.census_bureau_international.mortality_life_expectancy
WHERE
year = 2016) age
INNER JOIN (
SELECT
country_name,
country_area
FROM
bigquery-public-data.census_bureau_international.country_names_area
where country_area > 25000) size
ON
age.country_name = size.country_name
ORDER BY
2 DESC
/* Limit removed for Data Studio Visualization */
LIMIT
10
Which countries have the largest proportion of their population under 25? Over 40% of the world’s population is under 25 and greater than 50% of the world’s population is under 30! This query retrieves the countries with the largest proportion of young people by joining the age-specific population table with the midyear (total) population table.
SELECT
age.country_name,
SUM(age.population) AS under_25,
pop.midyear_population AS total,
ROUND((SUM(age.population) / pop.midyear_population) * 100,2) AS pct_under_25
FROM (
SELECT
country_name,
population,
country_code
FROM
bigquery-public-data.census_bureau_international.midyear_population_agespecific
WHERE
year =2017
AND age < 25) age
INNER JOIN (
SELECT
midyear_population,
country_code
FROM
bigquery-public-data.census_bureau_international.midyear_population
WHERE
year = 2017) pop
ON
age.country_code = pop.country_code
GROUP BY
1,
3
ORDER BY
4 DESC /* Remove limit for visualization*/
LIMIT
10
The International Census dataset contains growth information in the form of birth rates, death rates, and migration rates. Net migration is the net number of migrants per 1,000 population, an important component of total population and one that often drives the work of the United Nations Refugee Agency. This query joins the growth rate table with the area table to retrieve 2017 data for countries greater than 500 km2.
SELECT
growth.country_name,
growth.net_migration,
CAST(area.country_area AS INT64) AS country_area
FROM (
SELECT
country_name,
net_migration,
country_code
FROM
bigquery-public-data.census_bureau_international.birth_death_growth_rates
WHERE
year = 2017) growth
INNER JOIN (
SELECT
country_area,
country_code
FROM
bigquery-public-data.census_bureau_international.country_names_area
Historic (none)
United States Census Bureau
Terms of use: This dataset is publicly available for anyone to use under the following terms provided by the Dataset Source - http://www.data.gov/privacy-policy#data_policy - and is provided "AS IS" without any warranty, express or implied, from Google. Google disclaims all liability for any damages, direct or indirect, resulting from the use of the dataset.
See the GCP Marketplace listing for more details and sample queries: https://console.cloud.google.com/marketplace/details/united-states-census-bureau/international-census-data
Link to the ScienceBase Item Summary page for the item described by this metadata record. Service Protocol: Link to the ScienceBase Item Summary page for the item described by this metadata record. Application Profile: Web Browser. Link Function: information
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This data set provides the annual population of counties and states calculated from decennial U.S. censuses conducted from 1890-1950 and the Census Bureau’s annual projections of state population growth. The primary sources are “Population of States and Counties of the United States: 1790-1990,” published by the U.S. Bureau of the Census (1966); “Census U.S. Decennial County Population Data, 1900-1990” published by the NBER (2007); “Historical Statistics of Hawaii,” published by University Press of Hawaii (1977); and “Annual Estimates of the Population for the U.S. and States,” published by the U.S. Bureau of the Census from 1890 to 1950. The digitized, transparent, and consistent nature of this data and provides numerous benefits, including ease of access and greater potential for analysis.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Mexico Population: Census: 50 to 54 Yrs Old data was reported at 6,155.306 Person th in 2015. This records an increase from the previous number of 5,064.291 Person th for 2010. Mexico Population: Census: 50 to 54 Yrs Old data is updated yearly, averaging 2,896.049 Person th from Dec 1950 (Median) to 2015, with 9 observations. The data reached an all-time high of 6,155.306 Person th in 2015 and a record low of 828.126 Person th in 1950. Mexico Population: Census: 50 to 54 Yrs Old data remains active status in CEIC and is reported by National Institute of Statistics and Geography. The data is categorized under Global Database’s Mexico – Table MX.G002: Population: Census.
Midyear population estimates and projections for all countries and areas of the world with a population of 5,000 or more // Source: U.S. Census Bureau, Population Division, International Programs Center // Note: Total population available from 1950 to 2100 for 227 countries and areas. Other demographic variables available from base year to 2100. Base year varies by country and therefore data are not available for all years for all countries. For the United States, total population available from 1950-2060, and other demographic variables available from 1980-2060. See methodology at https://www.census.gov/programs-surveys/international-programs/about/idb.html
Midyear population estimates and projections for all countries and areas of the world with a population of 5,000 or more // Source: U.S. Census Bureau, Population Division, International Programs Center// Note: Total population available from 1950 to 2100 for 227 countries and areas. Other demographic variables available from base year to 2100. Base year varies by country and therefore data are not available for all years for all countries. For the United States, total population available from 1950-2060, and other demographic variables available from 1980-2060. See methodology at https://www.census.gov/programs-surveys/international-programs/about/idb.html
The study of social class and corresponding measurement schemes has evolved separately in Europe and the US. On both continents a standardized occupational coding system exists that can be transferred into a wide scala of measures of socioeconomic status. This dataset contains a crosswalk between the two standardized historical occupational coding schemes: HISCO and Occ1950.
The Historical International Standardized Classification of Occupations (HISCO) is the European standard for occupational coding and can be used to generate social class schemes, such as HISCLASS, SOCPO, and HISCAM. The U.S. Bureau of the Census' 1950 standard (Occ1950) is the U.S. standard for occupational coding and can be used to generate social class schemes, like NPBOSS, OCCSCORE, PRESGL, and SEI. With the crosswalk, HISCO can be converted to the American class coding schemes and Occ1950 into the European class coding schemes.
Occupational categories were linked between HISCO and Occ1950 on the underlying occupations. Both HISCO and Occ1950 consist of multiple layers of occupational groups. HISCO is divided in 7 major, 76 minor, 296 unit, and 1,675 micro groups, which roughly correspond with: social classes, sectors, occupational groups, and occupational subgroups. Occ1950 on the other hand is divided in 10 social classes and 269 occupational groups. HISCO’s micro groups and Occ1950’s occupational subgroups are based on a well-documented number of occupations, which can easily be compared and matched between both occupational coding schemes.
In the translation from HISCO to Occ1950 1,675 occupational categories were collapsed into 229 Occ1950 unique occupational groups. Although 40 occupational groups in Occ1950 could not be retrieved from HISCO, all occupations were successfully attributed to the right social class. Vice versa, 269 occupational groups in Occ1950 were recoded into 227 HISCO micro groups. Together these 227 unique codes are well-spread over the different branches of the HISCO tree, as they cover most of the unit groups.
#Please note that this is not the crosswalk from Occ1950 to the intermediate HISCO used by the NAPP project, also known as OCCHISCO or NAPPHISCO. This crosswalk can be retrieved from: https://github.com/rlzijdeman/o-clack/tree/master/crosswalks/occhisco_to_hisco
#HISCO is the European standard for occupational coding and can be used to generate HISCLASS, SOCPO and HISCAM classifications. The necessary conversion table has been made available by Mandemakers et al. and is available on: https://socialhistory.org/en/hsn/hsn-standardized-hisco-coded-and-classified-occupational-titles-release-201301?language=en
#Occ1950 is the US standard for occupational coding. The occupational coding system is based on the US Census of 1950 and can be transferred into OCCSCORE, PRESGL, SEI, and Nam-Powers-Boyd. Crosswalks are available on request: https://usa.ipums.org/usa/vols_4_5_index.shtml
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
These are the files, results, model and source code for the OCCODE project. They include:The results from running the occupational column from the full county 1950 Norwegian census through the OCCODE pipeline.An overview of the coding system used by Statistics Norway to encode the occupations of the 1950 Norwegian full count census.This overview was created by Statistics Norway, and the original can be found here: https://www.ssb.no/a/folketellinger/The model trained on the Random sample dataset.Previous results from running the occupational column from the full county 1950 Norwegian census through the OCCODE pipeline.Note that these results are worse than the current ones, found in RandomSample_total_confidence_scores.csvThe source code for the OCCODE pipeline as of 2021.05.11
This product provides tabular data from the U.S. Department of Agriculture (USDA) Census of Agriculture for selected items for the period 1950-2017 for counties in the conterminous United States. Data from 1950-2012 are taken from LaMotte (2015) and 2017 data are retrieved from the USDA QuickStats online tool. Data which are withheld in the Census of Agriculture are filled with estimates. The data include crop production values for 12 commodities (for example, corn in bushels), land use values for 7 land use types (for example, acres of total cropland), and 9 values for livestock types (for example, number of hogs and pigs). The data are largely intended as a 2017 update to the LaMotte dataset for items of research interest. LaMotte, A.E., 2015, Selected items from the Census of Agriculture at the county level for the conterminous United States, 1950-2012: U.S. Geological Survey data release, http://dx.doi.org/10.5066/F7H13016.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
The United States Census Bureau’s International Dataset provides estimates of country populations since 1950 and projections through 2050.
The U.S. Census Bureau provides estimates and projections for countries and areas that are recognized by the U.S. Department of State that have a population of at least 5,000. Specifically, the data set includes midyear population figures broken down by age and gender assignment at birth. Additionally, they provide time-series data for attributes including fertility rates, birth rates, death rates, and migration rates.
Fork this kernel to get started.
https://bigquery.cloud.google.com/dataset/bigquery-public-data:census_bureau_international
https://cloud.google.com/bigquery/public-data/international-census
Dataset Source: www.census.gov
This dataset is publicly available for anyone to use under the following terms provided by the Dataset Source -http://www.data.gov/privacy-policy#data_policy - and is provided "AS IS" without any warranty, express or implied, from Google. Google disclaims all liability for any damages, direct or indirect, resulting from the use of the dataset.
Banner Photo by Steve Richey from Unsplash.
What countries have the longest life expectancy?
Which countries have the largest proportion of their population under 25?
Which countries are seeing the largest net migration?
The 1950 Census population schedules were created by the Bureau of the Census in an attempt to enumerate every person living in the United States on April 1, 1950, although some persons were missed. The 1950 census population schedules were digitized by the National Archives and Records Administration (NARA) and released publicly on April 1, 2022. The 1950 Census enumeration district maps contain maps of counties, cities, and other minor civil divisions that show enumeration districts, census tracts, and related boundaries and numbers used for each census. The coverage is nation wide and includes territorial areas. The 1950 Census enumeration district descriptions contain written descriptions of census districts, subdivisions, and enumeration districts.