9 datasets found
  1. f

    IPUMS CPS download

    • figshare.com
    application/gzip
    Updated Jun 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ansel Schiavone (2023). IPUMS CPS download [Dataset]. http://doi.org/10.6084/m9.figshare.13833281.v1
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    figshare
    Authors
    Ansel Schiavone
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Citatation:Sarah Flood, Miriam King, Renae Rodgers, Steven Ruggles and J. Robert Warren. Integrated Public Use Microdata Series, Current Population Survey: Version 8.0 [dataset]. Minneapolis, MN: IPUMS, 2020. https://doi.org/10.18128/D030.V8.0Data may be found at: https://cps.ipums.org/cps/citation.shtml

  2. H

    CenSoc-Numident

    • dataverse.harvard.edu
    • search.dataone.org
    Updated Dec 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Joshua R. Goldstein; Monica Alexander; Casey Breen; Andrea Miranda González; Felipe Menares; Maria Osborne; Mallika Snyder; Ugur Yildirim (2024). CenSoc-Numident [Dataset]. http://doi.org/10.7910/DVN/I0TLPI
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 2, 2024
    Dataset provided by
    Harvard Dataverse
    Authors
    Joshua R. Goldstein; Monica Alexander; Casey Breen; Andrea Miranda González; Felipe Menares; Maria Osborne; Mallika Snyder; Ugur Yildirim
    License

    https://dataverse.harvard.edu/api/datasets/:persistentId/versions/7.0/customlicense?persistentId=doi:10.7910/DVN/I0TLPIhttps://dataverse.harvard.edu/api/datasets/:persistentId/versions/7.0/customlicense?persistentId=doi:10.7910/DVN/I0TLPI

    Description

    The CenSoc-Numident dataset links the 1940 census to the National Archives’ public release of the Social Security Numident file (“NARA Numident”). Our linking strategy relies on first name, last name, year of birth, and place of birth. To link unmarried women, we use father’s last name as a proxy for women’s maiden name. We use the ABE fully automated linking approach developed by Abramitzky, Boustan, and Eriksson (2012, 2014, 2017). To work with this dataset, researchers must download and link the 1940 full-count Census sample from IPUMS-USA on the HISTID variable. Please adhere to the citation and usage guidelines of both CenSoc and IPUMS-USA when using this dataset. The CenSoc-Numident supplemental geography file contains additional variables with place of birth and/or place of death information, such as county of birth and death, for a subset of the CenSoc-Numident dataset. The CenSoc-Numident sibling files identify sibling groups in the CenSoc-Numident dataset.

  3. Historic US census - 1930

    • redivis.com
    application/jsonl +7
    Updated Jan 10, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stanford Center for Population Health Sciences (2020). Historic US census - 1930 [Dataset]. http://doi.org/10.57761/6e5q-rh85
    Explore at:
    application/jsonl, parquet, spss, csv, arrow, stata, avro, sasAvailable download formats
    Dataset updated
    Jan 10, 2020
    Dataset provided by
    Redivis Inc.
    Authors
    Stanford Center for Population Health Sciences
    Time period covered
    Jan 1, 1930 - Dec 31, 1930
    Area covered
    United States
    Description

    Abstract

    The Integrated Public Use Microdata Series (IPUMS) Complete Count Data include more than 650 million individual-level and 7.5 million household-level records. The microdata are the result of collaboration between IPUMS and the nation’s two largest genealogical organizations—Ancestry.com and FamilySearch—and provides the largest and richest source of individual level and household data.

    Before Manuscript Submission

    All manuscripts (and other items you'd like to publish) must be submitted to

    phsdatacore@stanford.edu for approval prior to journal submission.

    We will check your cell sizes and citations.

    For more information about how to cite PHS and PHS datasets, please visit:

    https:/phsdocs.developerhub.io/need-help/citing-phs-data-core

    Documentation

    This dataset was created on 2020-01-10 22:52:11.461 by merging multiple datasets together. The source datasets for this version were:

    IPUMS 1930 households: This dataset includes all households from the 1930 US census.

    IPUMS 1930 persons: This dataset includes all individuals from the 1930 US census.

    IPUMS 1930 Lookup: This dataset includes variable names, variable labels, variable values, and corresponding variable value labels for the IPUMS 1930 datasets.

    Section 2

    Historic data are scarce and often only exists in aggregate tables. The key advantage of historic US census data is the availability of individual and household level characteristics that researchers can tabulate in ways that benefits their specific research questions. The data contain demographic variables, economic variables, migration variables and family variables. Within households, it is possible to create relational data as all relations between household members are known. For example, having data on the mother and her children in a household enables researchers to calculate the mother’s age at birth. Another advantage of the Complete Count data is the possibility to follow individuals over time using a historical identifier.

    In sum: the historic US census data are a unique source for research on social and economic change and can provide population health researchers with information about social and economic determinants.Historic data are scarce and often only exists in aggregate tables. The key advantage of historic US census data is the availability of individual and household level characteristics that researchers can tabulate in ways that benefits their specific research questions. The data contain demographic variables, economic variables, migration variables and family variables. Within households, it is possible to create relational data as all relations between household members are known. For example, having data on the mother and her children in a household enables researchers to calculate the mother’s age at birth. Another advantage of the Complete Count data is the possibility to follow individuals over time using a historical identifier. In sum: the historic US census data are a unique source for research on social and economic change and can provide population health researchers with information about social and economic determinants.

    The historic US 1930 census data was collected in April 1930. Enumerators collected data traveling to households and counting the residents who regularly slept at the household. Individuals lacking permanent housing were counted as residents of the place where they were when the data was collected. Household members absent on the day of data collected were either listed to the household with the help of other household members or were scheduled for the last census subdivision.

    Notes

    • We provide IPUMS household and person data separately so that it is convenient to explore the descriptive statistics on each level. In order to obtain a full dataset, merge the household and person on the variables SERIAL and SERIALP. In order to create a longitudinal dataset, merge datasets on the variable HISTID.

    • Households with more than 60 people in the original data were broken up for processing purposes. Every person in the large households are considered to be in their own household. The original large households can be identified using the variable SPLIT, reconstructed using the variable SPLITHID, and the original count is found in the variable SPLITNUM.

    • Coded variables derived from string variables are still in progress. These variables include: occupation and industry.

    • Missing observations have been allocated and some inconsistencies have been edited for the following variables: SPEAKENG, YRIMMIG, CITIZEN, AGEMARR, AGE, BPL, MBPL, FBPL, LIT, SCHOOL, OWNERSHP, FARM, EMPSTAT, OCC1950, IND1950, MTONGUE, MARST, RACE, SEX, RELATE, CLASSWKR. The flag variables indicating an allocated observation for the associated variables can be included in your extract by clicking the ‘Select data quality flags’ box on the extract summary page.

    • Most inconsistent information was not edite

  4. e

    Data on Alaskan Population demographics ranging from 1940 to 2015

    • knb.ecoinformatics.org
    • dataone.org
    • +1more
    Updated Feb 7, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    United States Census Bureau; Juliet Bachtel; John Randazzo (2019). Data on Alaskan Population demographics ranging from 1940 to 2015 [Dataset]. http://doi.org/10.5063/F1CV4FZX
    Explore at:
    Dataset updated
    Feb 7, 2019
    Dataset provided by
    Knowledge Network for Biocomplexity
    Authors
    United States Census Bureau; Juliet Bachtel; John Randazzo
    Time period covered
    Jan 1, 1940 - Dec 31, 2015
    Area covered
    Variables measured
    lat, lng, Year, city, ANVSA, Negro, Other, Place, White, Aleut., and 138 more
    Description

    These data comprise Census records relating to the Alaskan people's population demographics for the State of Alaskan Salmon and People (SASAP) Project. Decennial census data were originally extracted from IPUMS National Historic Geographic Information Systems website: https://data2.nhgis.org/main(Citation: Steven Manson, Jonathan Schroeder, David Van Riper, and Steven Ruggles. IPUMS National Historical Geographic Information System: Version 12.0 [Database]. Minneapolis: University of Minnesota. 2017. http://doi.org/10.18128/D050.V12.0). A number of relevant tables of basic demographics on age and race, household income and poverty levels, and labor force participation were extracted.

      These particular variables were selected as part of an effort to understand and potentially quantify various dimensions of well-being in Alaskan communities.
      The file "censusdata_master.csv" is a consolidation of all 21 other data files in the package. For detailed information on how the datasets vary over different years, view the file "readme.docx" available in this data package.
    
      The included .Rmd file is a script which combines the 21 files by year into a single file (censusdata_master.csv). It also cleans up place names (including typographical errors) and uses the
      USGS place names dataset and the SASAP regions dataset to assign latitude and longitude values and region values to each place in the dataset. Note that some places were not assigned a region or
      location because they do not fit well into the regional framework.
    
      Considerable heterogeneity exists between census surveys each year. While we have attempted to combine these datasets in a way that makes sense, there may be some discrepancies or unexpected values.
      Please send a description of any unusual values to the dataset contact.
    
  5. d

    Alaskan Population Demographic Information from Decennial and American...

    • search.dataone.org
    • dataone.org
    • +1more
    Updated Apr 10, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    United States Census Bureau; Juliet Bachtel; John Randazzo; Erika Gavenus (2019). Alaskan Population Demographic Information from Decennial and American Community Survey Census Data, 1940-2016 [Dataset]. http://doi.org/10.5063/F1B856FH
    Explore at:
    Dataset updated
    Apr 10, 2019
    Dataset provided by
    Knowledge Network for Biocomplexity
    Authors
    United States Census Bureau; Juliet Bachtel; John Randazzo; Erika Gavenus
    Time period covered
    Jan 1, 1940 - Dec 31, 2015
    Area covered
    Variables measured
    lat, lng, Name, Year, city, ANVSA, Negro, Other, Place, White, and 147 more
    Description

    These data comprise Census records relating to the Alaskan people's population demographics for the State of Alaskan Salmon and People (SASAP) Project. Decennial census data were originally extracted from IPUMS National Historic Geographic Information Systems website: https://data2.nhgis.org/main (Citation: Steven Manson, Jonathan Schroeder, David Van Riper, and Steven Ruggles. IPUMS National Historical Geographic Information System: Version 12.0 [Database]. Minneapolis: University of Minnesota. 2017. http://doi.org/10.18128/D050.V12.0). A number of relevant tables of basic demographics on age and race, household income and poverty levels, and labor force participation were extracted. These particular variables were selected as part of an effort to understand and potentially quantify various dimensions of well-being in Alaskan communities. The file "censusdata_master.csv" is a consolidation of all 21 other data files in the package. For detailed information on how the datasets vary over different years, view the file "readme.docx" available in this data package. The included .Rmd file is a script which combines the 21 files by year into a single file (censusdata_master.csv). It also cleans up place names (including typographical errors) and uses the USGS place names dataset and the SASAP regions dataset to assign latitude and longitude values and region values to each place in the dataset. Note that some places were not assigned a region or location because they do not fit well into the regional framework. Considerable heterogeneity exists between census surveys each year. While we have attempted to combine these datasets in a way that makes sense, there may be some discrepancies or unexpected values. The RMarkdown document SASAPWebsiteGraphicsCensus.Rmd is used to generate a variety of figures using these data, including the additional file Chignik_population.png

  6. e

    Languages used in Alaskan households, 1990-2015

    • knb.ecoinformatics.org
    • dataone.org
    • +3more
    Updated Mar 15, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jeanette Clark; Sharis Ochs; Derek Strong; National Historic Geographic Information System (2019). Languages used in Alaskan households, 1990-2015 [Dataset]. http://doi.org/10.5063/F11G0JHX
    Explore at:
    Dataset updated
    Mar 15, 2019
    Dataset provided by
    Knowledge Network for Biocomplexity
    Authors
    Jeanette Clark; Sharis Ochs; Derek Strong; National Historic Geographic Information System
    Time period covered
    Jan 1, 1990 - Jan 1, 2015
    Area covered
    Variables measured
    lat, lng, Year, city, thai, urdu, greek, hindi, hmong, indic, and 40 more
    Description

    These data show languages spoken in the household for people over the age of 5 in Alaska, in addition to the total population, by community. These data come from census surveys, both from the American Community Survey and the decennial census Population and language use data were originally extracted from IPUMS National Historic Geographic Information Systems website: https://data2.nhgis.org/main (Citation: Steven Manson, Jonathan Schroeder, David Van Riper, and Steven Ruggles. IPUMS National Historical Geographic Information System: Version 12.0 [Database]. Minneapolis: University of Minnesota. 2017. http://doi.org/10.18128/D050.V12.0 ). The file "household_language.csv" is a consolidation of a number of tables downloaded from this system (see methods for more information). The "language.Rmd" file is a script which combines the files by year into a single file. It also cleans up place names (including typographical errors) and uses the USGS place names dataset and the SASAP regions dataset to assign latitude and longitude values and region values to each place in the dataset. Additionally, the "language_vis.Rmd" file is a script that uses this data to visualize Native language use by community, displayed in the "language_vis.html" file.

  7. e

    Alaskan Population Demographic Information from Decennial and American...

    • knb.ecoinformatics.org
    Updated Mar 25, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    United States Census Bureau; Juliet Bachtel; John Randazzo; Erika Gavenus (2019). Alaskan Population Demographic Information from Decennial and American Community Survey Census Data, 1940-2016 [Dataset]. http://doi.org/10.5063/F1M906XQ
    Explore at:
    Dataset updated
    Mar 25, 2019
    Dataset provided by
    Knowledge Network for Biocomplexity
    Authors
    United States Census Bureau; Juliet Bachtel; John Randazzo; Erika Gavenus
    Time period covered
    Jan 1, 1940 - Dec 31, 2015
    Area covered
    Variables measured
    lat, lng, Name, Year, city, ANVSA, Negro, Other, Place, White, and 147 more
    Description

    These data comprise Census records relating to the Alaskan people's population demographics for the State of Alaskan Salmon and People (SASAP) Project. Decennial census data were originally extracted from IPUMS National Historic Geographic Information Systems website: https://data2.nhgis.org/main (Citation: Steven Manson, Jonathan Schroeder, David Van Riper, and Steven Ruggles. IPUMS National Historical Geographic Information System: Version 12.0 [Database]. Minneapolis: University of Minnesota. 2017. http://doi.org/10.18128/D050.V12.0). A number of relevant tables of basic demographics on age and race, household income and poverty levels, and labor force participation were extracted. These particular variables were selected as part of an effort to understand and potentially quantify various dimensions of well-being in Alaskan communities. The file "censusdata_master.csv" is a consolidation of all 21 other data files in the package. For detailed information on how the datasets vary over different years, view the file "readme.docx" available in this data package. The included .Rmd file is a script which combines the 21 files by year into a single file (censusdata_master.csv). It also cleans up place names (including typographical errors) and uses the USGS place names dataset and the SASAP regions dataset to assign latitude and longitude values and region values to each place in the dataset. Note that some places were not assigned a region or location because they do not fit well into the regional framework. Considerable heterogeneity exists between census surveys each year. While we have attempted to combine these datasets in a way that makes sense, there may be some discrepancies or unexpected values. The RMarkdown document SASAPWebsiteGraphicsCensus.Rmd is used to generate a variety of figures using these data, including the additional file Chignik_population.png

  8. a

    Mapping Segregation in the Twin Cities DGAH 210 Sample map

    • dgah-210-carleton.hub.arcgis.com
    Updated Feb 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Carleton College (2024). Mapping Segregation in the Twin Cities DGAH 210 Sample map [Dataset]. https://dgah-210-carleton.hub.arcgis.com/maps/375c984f074b4493b756240de682e8b2
    Explore at:
    Dataset updated
    Feb 12, 2024
    Dataset authored and provided by
    Carleton College
    Area covered
    Description

    This map provides a spatial illustration of different means by which racial segregation was historically reinforced across the cities of Minneapolis and Saint Paul. The map focuses largely on data from the 1940s, and includes the following data layers:Population by Race - Data based on 1940 US Census that shows the percentage of the non-white population at the census tract level. This data was downloaded from NHGIS, with a spatial join performed to combine the census table and historic tracts (Citation: Steven Manson, Jonathan Schroeder, David Van Riper, Katherine Knowles, Tracy Kugler, Finn Roberts, and Steven Ruggles, IPUMS National Historical Geographic Information System: Version 18.0. Minneapolis, MN: IPUMS. 2023).HOLC Map Zones by Number of Covenants - This layer displays a summary of the number of racially exclusive covenants within the area of zones designated by grade on HOLC redlining maps. The polygons of each grade zone were digitized by the Mapping Inequality Project (University of Richmond Digital Scholarship Lab) and are symbolized by the grade colors on the original maps. The data on racially exclusive covenants in Twin Cities neighborhoods was downloaded from the Mapping Prejudice Project (University of Minnesota) and is symbolized by the size of each feature.Greenbook Locations - This layer displays locations included on Greenbook travel guides from the 1940s, which indicate safe businesses for African American travelers to American Cities. This data comes from a service layer created by Shana Crosson (University of Minnesota).This spatial extent of this map is limited to the cities of Minneapolis and Saint Paul. It was created as part of an in-class exercise in February of 2024.

  9. H

    CenSoc Demo Files

    • dataverse.harvard.edu
    • search.dataone.org
    Updated Jan 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Joshua R. Goldstein; Monica Alexander; Casey Breen; Andrea Miranda González; Felipe Menares; Maria Osborne; Mallika Snyder; Ugur Yildirim (2024). CenSoc Demo Files [Dataset]. http://doi.org/10.7910/DVN/QVDPM9
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 25, 2024
    Dataset provided by
    Harvard Dataverse
    Authors
    Joshua R. Goldstein; Monica Alexander; Casey Breen; Andrea Miranda González; Felipe Menares; Maria Osborne; Mallika Snyder; Ugur Yildirim
    License

    https://dataverse.harvard.edu/api/datasets/:persistentId/versions/2.0/customlicense?persistentId=doi:10.7910/DVN/QVDPM9https://dataverse.harvard.edu/api/datasets/:persistentId/versions/2.0/customlicense?persistentId=doi:10.7910/DVN/QVDPM9

    Description

    A prelinked “demo” version of the CenSoc-DMF and CenSoc-Numident datasets with approximately 15 mortality covariates from the 1940 Census and ~1% of records in the complete CenSoc datasets. Please adhere to the citation and usage guidelines of both CenSoc and IPUMS-USA when using this dataset.

  10. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Ansel Schiavone (2023). IPUMS CPS download [Dataset]. http://doi.org/10.6084/m9.figshare.13833281.v1

IPUMS CPS download

Explore at:
4 scholarly articles cite this dataset (View in Google Scholar)
application/gzipAvailable download formats
Dataset updated
Jun 2, 2023
Dataset provided by
figshare
Authors
Ansel Schiavone
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Citatation:Sarah Flood, Miriam King, Renae Rodgers, Steven Ruggles and J. Robert Warren. Integrated Public Use Microdata Series, Current Population Survey: Version 8.0 [dataset]. Minneapolis, MN: IPUMS, 2020. https://doi.org/10.18128/D030.V8.0Data may be found at: https://cps.ipums.org/cps/citation.shtml

Search
Clear search
Close search
Google apps
Main menu