48 datasets found

Historic US census - 1930
redivis.com
application/jsonl +7
Updated Jan 10, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stanford Center for Population Health Sciences (2020). Historic US census - 1930 [Dataset]. http://doi.org/10.57761/6e5q-rh85
Explore at:
application/jsonl, parquet, spss, csv, arrow, stata, avro, sasAvailable download formats
Unique identifier
https://doi.org/10.57761/6e5q-rh85
Dataset updated
Jan 10, 2020
Dataset provided by
Redivis Inc.
Authors
Stanford Center for Population Health Sciences
Time period covered
Jan 1, 1930 - Dec 31, 1930
Area covered
United States
Description
Abstract

The Integrated Public Use Microdata Series (IPUMS) Complete Count Data include more than 650 million individual-level and 7.5 million household-level records. The microdata are the result of collaboration between IPUMS and the nation’s two largest genealogical organizations—Ancestry.com and FamilySearch—and provides the largest and richest source of individual level and household data.

Before Manuscript Submission

All manuscripts (and other items you'd like to publish) must be submitted to

phsdatacore@stanford.edu for approval prior to journal submission.

We will check your cell sizes and citations.

For more information about how to cite PHS and PHS datasets, please visit:

https:/phsdocs.developerhub.io/need-help/citing-phs-data-core

Documentation

This dataset was created on 2020-01-10 22:52:11.461 by merging multiple datasets together. The source datasets for this version were:

IPUMS 1930 households: This dataset includes all households from the 1930 US census.

IPUMS 1930 persons: This dataset includes all individuals from the 1930 US census.

IPUMS 1930 Lookup: This dataset includes variable names, variable labels, variable values, and corresponding variable value labels for the IPUMS 1930 datasets.

Section 2

Historic data are scarce and often only exists in aggregate tables. The key advantage of historic US census data is the availability of individual and household level characteristics that researchers can tabulate in ways that benefits their specific research questions. The data contain demographic variables, economic variables, migration variables and family variables. Within households, it is possible to create relational data as all relations between household members are known. For example, having data on the mother and her children in a household enables researchers to calculate the mother’s age at birth. Another advantage of the Complete Count data is the possibility to follow individuals over time using a historical identifier.

In sum: the historic US census data are a unique source for research on social and economic change and can provide population health researchers with information about social and economic determinants.Historic data are scarce and often only exists in aggregate tables. The key advantage of historic US census data is the availability of individual and household level characteristics that researchers can tabulate in ways that benefits their specific research questions. The data contain demographic variables, economic variables, migration variables and family variables. Within households, it is possible to create relational data as all relations between household members are known. For example, having data on the mother and her children in a household enables researchers to calculate the mother’s age at birth. Another advantage of the Complete Count data is the possibility to follow individuals over time using a historical identifier. In sum: the historic US census data are a unique source for research on social and economic change and can provide population health researchers with information about social and economic determinants.

The historic US 1930 census data was collected in April 1930. Enumerators collected data traveling to households and counting the residents who regularly slept at the household. Individuals lacking permanent housing were counted as residents of the place where they were when the data was collected. Household members absent on the day of data collected were either listed to the household with the help of other household members or were scheduled for the last census subdivision.

Notes

We provide IPUMS household and person data separately so that it is convenient to explore the descriptive statistics on each level. In order to obtain a full dataset, merge the household and person on the variables SERIAL and SERIALP. In order to create a longitudinal dataset, merge datasets on the variable HISTID.

Households with more than 60 people in the original data were broken up for processing purposes. Every person in the large households are considered to be in their own household. The original large households can be identified using the variable SPLIT, reconstructed using the variable SPLITHID, and the original count is found in the variable SPLITNUM.

Coded variables derived from string variables are still in progress. These variables include: occupation and industry.

Missing observations have been allocated and some inconsistencies have been edited for the following variables: SPEAKENG, YRIMMIG, CITIZEN, AGEMARR, AGE, BPL, MBPL, FBPL, LIT, SCHOOL, OWNERSHP, FARM, EMPSTAT, OCC1950, IND1950, MTONGUE, MARST, RACE, SEX, RELATE, CLASSWKR. The flag variables indicating an allocated observation for the associated variables can be included in your extract by clicking the ‘Select data quality flags’ box on the extract summary page.

Most inconsistent information was not edite
Ireland Census
ebroy.org
Updated 1911
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Class: RG14; Census of Ireland 1901/1911. The National Archives of Ireland. http://www.census.nationalarchives.ie/search/: accessed 31 May 2013; Ancestry.com. Web: Ireland, Census, 1911 [database on-line]. Provo, UT, USA: Ancestry.com Operations, Inc., 2013. (1911). Ireland Census [Dataset]. https://ebroy.org/profile/?person=P31
Explore at:
Dataset updated
1911
Dataset provided by
Ancestryhttp://ancestry.com/
Authors
Class: RG14; Census of Ireland 1901/1911. The National Archives of Ireland. http://www.census.nationalarchives.ie/search/: accessed 31 May 2013; Ancestry.com. Web: Ireland, Census, 1911 [database on-line]. Provo, UT, USA: Ancestry.com Operations, Inc., 2013.
Area covered
Ireland
Description
Ireland Census contains records from Scalp, Peterswell, County Galway, Ireland by Class: RG14; Census of Ireland 1901/1911. The National Archives of Ireland. http://www.census.nationalarchives.ie/search/: accessed 31 May 2013; Ancestry.com. Web: Ireland, Census, 1911 [database on-line]. Provo, UT, USA: Ancestry.com Operations, Inc., 2013. - .
r
Lookup
redivis.com
Updated Jan 10, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stanford Center for Population Health Sciences (2020). Lookup [Dataset]. https://redivis.com/datasets/gsmz-24068kvny
Explore at:
Dataset updated
Jan 10, 2020
Dataset authored and provided by
Stanford Center for Population Health Sciences
Description
This dataset includes variable names, variable labels, variable values, and corresponding variable value labels for the IPUMS 1920 datasets.
d
POPMAPS: An R package to estimate ancestry probability surfaces
catalog.data.gov
data.usgs.gov
Updated Jul 6, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Geological Survey (2024). POPMAPS: An R package to estimate ancestry probability surfaces [Dataset]. https://catalog.data.gov/dataset/popmaps-an-r-package-to-estimate-ancestry-probability-surfaces
Explore at:
Dataset updated
Jul 6, 2024
Dataset provided by
United States Geological Surveyhttp://www.usgs.gov/
Description
This software code was developed to estimate the probability that individuals found at a geographic location will belong to the same genetic cluster as individuals at the nearest empirical sampling location for which ancestry is known. POPMAPS includes 5 main functions to calculate and visualize these results (see Table 1 for functions and arguments). Population assignment coefficients and a raster surface must be estimated prior to using POPMAPS functions (see Fig. 1a and b). With these data in hand, users can run a jackknife function to choose an optimal parameter combination that reconstructs empirical data best (Figs. 2 and S2). Pertinent parameters include 1) how many empirical sampling localities should be used to estimate ancestry coefficients and 2) what is the influence of empirical sites on ancestry coefficient estimation as distance increases (Fig. 2). After choosing these parameters, a user can estimate the entire ancestry probability surface (Fig. 1c and d, Fig. 3). This package can be used to estimate ancestry coefficients from empirical genetic data across a user-defined geospatial layer. Estimated ancestry coefficients are used to calculate ancestry probabilities, which together with 'hard population boundaries,' compose an ancestry probability surface. Within a hard boundary, the ancestry probability informs a user of the confidence that they can have of genetic identity matching the principal population if they were to find individuals of the focal organism at a location. Confidence can be modified across the ancestry probability surface by changing parameters influencing the contribution of empirical data to the estimation of ancestry coefficients. This information may be valuable to inform decision-making for organisms having management needs. See 'Related External Resources, Type: Source Code' below for direct access to the POPMAPS R software package.
Census Data
catalog.data.gov
datadiscoverystudio.org
+3more
Updated Mar 1, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Bureau of the Census (2024). Census Data [Dataset]. https://catalog.data.gov/dataset/census-data
Explore at:
Dataset updated
Mar 1, 2024
Dataset provided by
United States Census Bureauhttp://census.gov/
Description
The Bureau of the Census has released Census 2000 Summary File 1 (SF1) 100-Percent data. The file includes the following population items: sex, age, race, Hispanic or Latino origin, household relationship, and household and family characteristics. Housing items include occupancy status and tenure (whether the unit is owner or renter occupied). SF1 does not include information on incomes, poverty status, overcrowded housing or age of housing. These topics will be covered in Summary File 3. Data are available for states, counties, county subdivisions, places, census tracts, block groups, and, where applicable, American Indian and Alaskan Native Areas and Hawaiian Home Lands. The SF1 data are available on the Bureau's web site and may be retrieved from American FactFinder as tables, lists, or maps. Users may also download a set of compressed ASCII files for each state via the Bureau's FTP server. There are over 8000 data items available for each geographic area. The full listing of these data items is available here as a downloadable compressed data base file named TABLES.ZIP. The uncompressed is in FoxPro data base file (dbf) format and may be imported to ACCESS, EXCEL, and other software formats. While all of this information is useful, the Office of Community Planning and Development has downloaded selected information for all states and areas and is making this information available on the CPD web pages. The tables and data items selected are those items used in the CDBG and HOME allocation formulas plus topics most pertinent to the Comprehensive Housing Affordability Strategy (CHAS), the Consolidated Plan, and similar overall economic and community development plans. The information is contained in five compressed (zipped) dbf tables for each state. When uncompressed the tables are ready for use with FoxPro and they can be imported into ACCESS, EXCEL, and other spreadsheet, GIS and database software. The data are at the block group summary level. The first two characters of the file name are the state abbreviation. The next two letters are BG for block group. Each record is labeled with the code and name of the city and county in which it is located so that the data can be summarized to higher-level geography. The last part of the file name describes the contents . The GEO file contains standard Census Bureau geographic identifiers for each block group, such as the metropolitan area code and congressional district code. The only data included in this table is total population and total housing units. POP1 and POP2 contain selected population variables and selected housing items are in the HU file. The MA05 table data is only for use by State CDBG grantees for the reporting of the racial composition of beneficiaries of Area Benefit activities. The complete package for a state consists of the dictionary file named TABLES, and the five data files for the state. The logical record number (LOGRECNO) links the records across tables.
d
Data from: A geographic history of human genetic ancestry
search.dataone.org
datadryad.org
Updated Mar 13, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Michael Grundler; Jonathan Terhorst; Gideon Bradburd (2025). A geographic history of human genetic ancestry [Dataset]. http://doi.org/10.5061/dryad.p5hqbzkwz
Explore at:
Unique identifier
https://doi.org/10.5061/dryad.p5hqbzkwz
Dataset updated
Mar 13, 2025
Dataset provided by
Dryad Digital Repository
Authors
Michael Grundler; Jonathan Terhorst; Gideon Bradburd
Description
Describing the distribution of genetic variation across individuals is a fundamental goal of population genetics. We present a method that capitalizes on the rich genealogical information encoded in genomic tree sequences to infer the geographic locations of the shared ancestors of a sample of sequenced individuals. We use this method to infer the geographic history of genetic ancestry of a set of human genomes sampled from Europe, Asia, and Africa, accurately recovering major population movements on those continents. Our findings demonstrate the importance of defining the spatial-temporal context of genetic ancestry to describing human genetic variation and caution against the oversimplified interpretations of genetic data prevalent in contemporary discussions of race and ancestry., , , # A geographic history of human genetic ancestry

https://doi.org/10.5061/dryad.p5hqbzkwz

Description of the data and file structure

Data and codes for testing and fitting GAIA

Files and variables

File: data.tar.gz

Description:Â A gzip compressed archive with the following contents:

The slimÂ directory contains simulation scripts and outputs for testing GAIA with the SLiM software in continuous and discrete geographic space. Each subdirectory corresponds to a different geographic space representation. The following subdirectories and files include:

continuous-space/Â : Simulations on an abstract continuous plane

uniform-landscape/Â : SLiM simulations with spatially homogeneous carrying capacity

gaussian-dispersal/Â : simulations with a Gaussian dispersal kernel

analysis/

performance.R, performance.shÂ : R script and shell script used to run GAIA on each of the simulation tree sequences ...,
g
Census of Population and Housing, 2000 [United States]: Summary File 4, West...
search.gesis.org
Updated Feb 8, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
United States Department of Commerce. Bureau of the Census (2021). Census of Population and Housing, 2000 [United States]: Summary File 4, West Virginia - Archival Version [Dataset]. http://doi.org/10.3886/ICPSR13560
Explore at:
Unique identifier
https://doi.org/10.3886/ICPSR13560
Dataset updated
Feb 8, 2021
Dataset provided by
ICPSR - Interuniversity Consortium for Political and Social Research
GESIS search
Authors
United States Department of Commerce. Bureau of the Census
License
https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de446517https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de446517
Area covered
West Virginia, United States
Description
Abstract (en): Summary File 4 (SF 4) from the United States 2000 Census contains the sample data, which is the information compiled from the questions asked of a sample of all people and housing units. Population items include basic population totals: urban and rural, households and families, marital status, grandparents as caregivers, language and ability to speak English, ancestry, place of birth, citizenship status, year of entry, migration, place of work, journey to work (commuting), school enrollment and educational attainment, veteran status, disability, employment status, industry, occupation, class of worker, income, and poverty status. Housing items include basic housing totals: urban and rural, number of rooms, number of bedrooms, year moved into unit, household size and occupants per room, units in structure, year structure built, heating fuel, telephone service, plumbing and kitchen facilities, vehicles available, value of home, monthly rent, and shelter costs. In Summary File 4, the sample data are presented in 213 population tables (matrices) and 110 housing tables, identified with "PCT" and "HCT" respectively. Each table is iterated for 336 population groups: the total population, 132 race groups, 78 American Indian and Alaska Native tribe categories (reflecting 39 individual tribes), 39 Hispanic or Latino groups, and 86 ancestry groups. The presentation of SF4 tables for any of the 336 population groups is subject to a population threshold. That is, if there are fewer than 100 people (100-percent count) in a specific population group in a specific geographic area, and there are fewer than 50 unweighted cases, their population and housing characteristics data are not available for that geographic area in SF4. For the ancestry iterations, only the 50 unweighted cases test can be performed. See Appendix H: Characteristic Iterations, for a complete list of characteristic iterations. ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection: Created variable labels and/or value labels.. All persons in housing units in West Virginia in 2000. 2013-05-25 Multiple Census data file segments were repackaged for distribution into a single zip archive per dataset. No changes were made to the data or documentation.2006-01-12 All files were removed from dataset 342 and flagged as study-level files, so that they will accompany all downloads.2006-01-12 All files were removed from dataset 341 and flagged as study-level files, so that they will accompany all downloads.2006-01-12 All files were removed from dataset 340 and flagged as study-level files, so that they will accompany all downloads.2006-01-12 All files were removed from dataset 339 and flagged as study-level files, so that they will accompany all downloads.2006-01-12 All files were removed from dataset 338 and flagged as study-level files, so that they will accompany all downloads. Because of the number of files per state in Summary File 4, ICPSR has given each state its own ICPSR study number in the range ICPSR 13512-13563. The study number for the national file is 13570. Data for each state are being released as they become available.The data are provided in 38 segments (files) per iteration. These segments are PCT1-PCT4, PCT5-PCT16, PCT17-PCT34, PCT35-PCT37, PCT38-PCT45, PCT46-PCT49, PCT50-PCT61, PCT62-PCT67, PCT68-PCT71, PCT72-PCT76, PCT77-PCT78, PCT79-PCT81, PCT82-PCT84, PCT85-PCT86 (partial), PCT86 (partial), PCT87-PCT103, PCT104-PCT120, PCT121-PCT131, PCT132-PCT137, PCT138-PCT143, PCT144, PCT145-PCT150, PCT151-PCT156, PCT157-PCT162, PCT163-PCT208, PCT209-PCT213, HCT1-HCT9, HCT10-HCT18, HCT19-HCT22, HCT23-HCT25, HCT26-HCT29, HCT30-HCT39, HCT40-HCT55, HCT56-HCT61, HCT62-HCT70, HCT71-HCT81, HCT82-HCT86, and HCT87-HCT110. The iterations are Parts 1-336, the Geographic Header File is Part 337. The Geographic Header File is in fixed-format ASCII and the table files are in comma-delimited ASCII format. A merged iteration will have 7,963 variables.For Parts 251-336, the part names contain numbers within parentheses that refer to the Ancestry Code List (page G1 of the codebook).
f
Additional file 2: of Selecting SNPs informative for African, American...
springernature.figshare.com
xlsx
Updated Jun 4, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Robert Williams; Robert Elston; Pankaj Kumar; William Knowler; Hanna Abboud; Sharon Adler; Donald Bowden; Jasmin Divers; Barry Freedman; Robert Igo; Eli Ipp; Sudha Iyengar; Paul Kimmel; Michael Klag; Orly Kohn; Carl Langefeld; David Leehey; Robert Nelson; Susanne Nicholas; Madeleine Pahl; Rulan Parekh; Jerome Rotter; Jeffrey Schelling; John Sedor; Vallabh Shah; Michael Smith; Kent Taylor; Farook Thameem; Denyse Thornley-Brown; Cheryl Winkler; Xiuqing Guo; Phillip Zager; Robert Hanson (2023). Additional file 2: of Selecting SNPs informative for African, American Indian and European Ancestry: application to the Family Investigation of Nephropathy and Diabetes (FIND) [Dataset]. http://doi.org/10.6084/m9.figshare.c.3608633_D3.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.c.3608633_D3.v1
Dataset updated
Jun 4, 2023
Dataset provided by
figshare
Authors
Robert Williams; Robert Elston; Pankaj Kumar; William Knowler; Hanna Abboud; Sharon Adler; Donald Bowden; Jasmin Divers; Barry Freedman; Robert Igo; Eli Ipp; Sudha Iyengar; Paul Kimmel; Michael Klag; Orly Kohn; Carl Langefeld; David Leehey; Robert Nelson; Susanne Nicholas; Madeleine Pahl; Rulan Parekh; Jerome Rotter; Jeffrey Schelling; John Sedor; Vallabh Shah; Michael Smith; Kent Taylor; Farook Thameem; Denyse Thornley-Brown; Cheryl Winkler; Xiuqing Guo; Phillip Zager; Robert Hanson
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
RCWilliamsContrastEU_AF450AIMs. (XLSX 60 kb)
r
Persons
redivis.com
Updated Jan 10, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Persons [Dataset]. https://redivis.com/datasets/gsmz-24068kvny
Explore at:
Dataset updated
Jan 10, 2020
Dataset authored and provided by
Stanford Center for Population Health Sciences
Time period covered
1920
Description
This dataset includes all individuals from the 1920 US census.
r
Households
redivis.com
Updated Jan 10, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stanford Center for Population Health Sciences (2020). Households [Dataset]. https://redivis.com/datasets/gsmz-24068kvny
Explore at:
Dataset updated
Jan 10, 2020
Dataset authored and provided by
Stanford Center for Population Health Sciences
Time period covered
1920
Description
This dataset includes all households from the 1920 US census.
c
Genealogy Products and Services Market size will be USD 5,093.64 Million by...
cognitivemarketresearch.com
pdf,excel,csv,ppt
Updated May 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Cognitive Market Research (2025). Genealogy Products and Services Market size will be USD 5,093.64 Million by 2028! [Dataset]. https://www.cognitivemarketresearch.com/genealogy-products-and-services-market-report
Explore at:
pdf,excel,csv,pptAvailable download formats
Dataset updated
May 15, 2025
Dataset authored and provided by
Cognitive Market Research
License
https://www.cognitivemarketresearch.com/privacy-policyhttps://www.cognitivemarketresearch.com/privacy-policy
Time period covered
2021 - 2033
Area covered
Global
Description
As per Cognitive Market Research's latest published report, the Global Genealogy Products and Services Market size will be USD 5,093.64 Million by 2028. Genealogy Products and Services Industry's Compound Annual Growth Rate will be 7.97% from 2023 to 2030.

The North America Genealogy Products and Services market size will be USD 2,008.93 Million by 2028.

Market Dynamics of Genealogy Products and Services

Key Drivers for Genealogy Products and Services

Growing Interest in Ancestry and Family History: Rising consumer interest in personal heritage, cultural origins, and ethnic backgrounds is driving the demand for genealogy kits, online family tree services, and archival data platforms.

Advancements in DNA Testing Technologies: The development of cost-effective and precise DNA testing technologies has transformed genealogy, facilitating easier access for consumers to genetic information that enhances traditional family research.

Increased Digitalization of Historical Records: Governments, religious institutions, and private companies are digitizing essential records (birth, marriage, death, census), broadening access for genealogists and boosting subscriptions to genealogy services.

Key Restraints for Genealogy Products and Services

Concerns Regarding Privacy and Data Security: The act of sharing genetic and personal information on the internet presents significant privacy challenges, which may deter potential users due to fears of misuse, data breaches, or insufficient control over their personal data.

Limited Access to Records in Specific Regions: The presence of historical conflicts, inadequate recordkeeping, and disjointed archives in certain nations complicates the process of tracing lineage, thereby diminishing the effectiveness and attractiveness of services on a global scale.

Costs Associated with Subscriptions and Testing: Despite a reduction in prices, the comprehensive DNA kits and premium family history subscriptions continue to pose a financial obstacle for numerous users, particularly in developing economies.

Key Trends for Genealogy Products and Services

Integration of Artificial Intelligence for Record Matching: Companies are leveraging AI and machine learning technologies to identify patterns, propose familial connections, and automatically construct family trees, thereby improving user experience and the precision of research.

Collaborations with Health and Wellness Providers: Genealogy services are progressively forming partnerships with health platforms, providing users with insights into genetic predispositions, nutrition based on ancestry, and wellness recommendations.

Mobile Applications and Research Tools for On-the-Go: There is an increasing trend towards mobile-optimized platforms, allowing users to investigate family trees, upload documents, and engage with relatives directly from their smartphones. Introduction of Genealogy Products and Services

Genealogy is study of family and their history, tracing lineages, obtaining information about family, ancestors and it comprises DNA testing cemetery records, family tree creation, newspapers, online records, blogs, links that provides access to database for obtaining information about family members.

There are various institutions, advanced applications that are mobile based used for finding information about ancestors. The market is growing rapidly with adoption of emerging technologies that boost its growth in the market.

There is increasing technological advancement in the genealogical studies and its benefits in effectively find out information about ancestors has gained popularity across globe that drives the growth of genealogy products and service market.

For instance, there are various technological incorporation and ensure cost effective research that helps in tracing lineages, information about ancestors. The major companies are adopting DNA testing services and they merged genealogical research with genetic testing that helps in obtaining information about families. They have database, online records that has detailed information about ancestors. They use modern applications such as Ancestry, electronic database, blogs, that provide accurate database and genetic representation of family tree used in genetic services.

There are various benefits such as genealogical data provides medical history of...
g
Census of Population and Housing, 2000 [United States]: Summary File 4,...
search.gesis.org
Updated Feb 26, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
United States Department of Commerce. Bureau of the Census (2021). Census of Population and Housing, 2000 [United States]: Summary File 4, District of Columbia - Version 1 [Dataset]. http://doi.org/10.3886/ICPSR13520.v1
Explore at:
Unique identifier
https://doi.org/10.3886/ICPSR13520.v1
Dataset updated
Feb 26, 2021
Dataset provided by
ICPSR - Interuniversity Consortium for Political and Social Research
GESIS search
Authors
United States Department of Commerce. Bureau of the Census
License
https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de457436https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de457436
Area covered
Washington, United States
Description
Abstract (en): Summary File 4 (SF 4) from the United States 2000 Census contains the sample data, which is the information compiled from the questions asked of a sample of all people and housing units. Population items include basic population totals: urban and rural, households and families, marital status, grandparents as caregivers, language and ability to speak English, ancestry, place of birth, citizenship status, year of entry, migration, place of work, journey to work (commuting), school enrollment and educational attainment, veteran status, disability, employment status, industry, occupation, class of worker, income, and poverty status. Housing items include basic housing totals: urban and rural, number of rooms, number of bedrooms, year moved into unit, household size and occupants per room, units in structure, year structure built, heating fuel, telephone service, plumbing and kitchen facilities, vehicles available, value of home, monthly rent, and shelter costs. In Summary File 4, the sample data are presented in 213 population tables (matrices) and 110 housing tables, identified with "PCT" and "HCT" respectively. Each table is iterated for 336 population groups: the total population, 132 race groups, 78 American Indian and Alaska Native tribe categories (reflecting 39 individual tribes), 39 Hispanic or Latino groups, and 86 ancestry groups. The presentation of SF4 tables for any of the 336 population groups is subject to a population threshold. That is, if there are fewer than 100 people (100-percent count) in a specific population group in a specific geographic area, and there are fewer than 50 unweighted cases, their population and housing characteristics data are not available for that geographic area in SF4. For the ancestry iterations, only the 50 unweighted cases test can be performed. See Appendix H: Characteristic Iterations, for a complete list of characteristic iterations. ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection: Created variable labels and/or value labels.. All persons in housing units in the District of Columbia in 2000. 2013-05-25 Multiple Census data file segments were repackaged for distribution into a single zip archive per dataset. No changes were made to the data or documentation.2006-01-12 All files were removed from dataset 342 and flagged as study-level files, so that they will accompany all downloads.2006-01-12 All files were removed from dataset 341 and flagged as study-level files, so that they will accompany all downloads.2006-01-12 All files were removed from dataset 340 and flagged as study-level files, so that they will accompany all downloads.2006-01-12 All files were removed from dataset 339 and flagged as study-level files, so that they will accompany all downloads.2006-01-12 All files were removed from dataset 338 and flagged as study-level files, so that they will accompany all downloads. Because of the number of files per state in Summary File 4, ICPSR has given each state its own ICPSR study number in the range ICPSR 13512-13563. The study number for the national file is 13570. Data for each state are being released as they become available.The data are provided in 38 segments (files) per iteration. These segments are PCT1-PCT4, PCT5-PCT16, PCT17-PCT34, PCT35-PCT37, PCT38-PCT45, PCT46-PCT49, PCT50-PCT61, PCT62-PCT67, PCT68-PCT71, PCT72-PCT76, PCT77-PCT78, PCT79-PCT81, PCT82-PCT84, PCT85-PCT86 (partial), PCT86 (partial), PCT87-PCT103, PCT104-PCT120, PCT121-PCT131, PCT132-PCT137, PCT138-PCT143, PCT144, PCT145-PCT150, PCT151-PCT156, PCT157-PCT162, PCT163-PCT208, PCT209-PCT213, HCT1-HCT9, HCT10-HCT18, HCT19-HCT22, HCT23-HCT25, HCT26-HCT29, HCT30-HCT39, HCT40-HCT55, HCT56-HCT61, HCT62-HCT70, HCT71-HCT81, HCT82-HCT86, and HCT87-HCT110. The iterations are Parts 1-336, the Geographic Header File is Part 337. The Geographic Header File is in fixed-format ASCII and the table files are in comma-delimited ASCII format. A merged iteration will have 7,963 variables.For Parts 251-336, the part names contain numbers within parentheses that refer to the Ancestry Code List (page G1 of the codebook).
M
Profile of Selected Social Characteristics for Census Tracts: 2000
gisdata.mn.gov
fgdb, html, shp
Updated Jul 9, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Metropolitan Council (2020). Profile of Selected Social Characteristics for Census Tracts: 2000 [Dataset]. https://gisdata.mn.gov/dataset/us-mn-state-metc-society-census-soclchar-trct2000
Explore at:
html, shp, fgdbAvailable download formats
Dataset updated
Jul 9, 2020
Dataset provided by
Metropolitan Council
Description
Summary File 3 Data Profile 2 (SF3 Table DP-2) for Census Tracts in Minneapolis-St. Paul 7 County metropolitan area is a subset of the profile of selected social characteristics for 2000 prepared by the U.S. Census Bureau.

This table (DP-2) includes: School Enrollment, Educational Attainment, Marital Status, Grandparents as Caregivers, Veteran Status, Disability Status of the Civilian Noninstitutionalized Population, Residence in 1995, Nativity and Place of Birth, Region of Birth of Foreign Born, Language Spoken At Home, Ancestry

US Census 2000 Demographic Profiles: 100-percent and Sample Data

The profile includes four tables (DP-1 thru DP-4) that provide various demographic, social, economic, and housing characteristics for the United States, states, counties, minor civil divisions in selected states, places, metropolitan areas, American Indian and Alaska Native areas, Hawaiian home lands and congressional districts (106th Congress). It includes 100-percent and sample data from Census 2000. The DP-1 table is available as part of the Summary File 1 (SF 1) dataset, and the other three tables are available as part of the Summary File 3 (SF 3) dataset.

The US Census provides DP-1 thru DP-4 data at the Census tract level through their DataFinder search engine. However, since the Metropolitan Council and MetroGIS participants are interested in all Census tracts within the seven county metropolitan area, it was quicker to take the raw Census SF-1 and SF-3 data at tract levels and recreate the DP1-4 variables using the appropriate formula for each DP variable. This file lists the formulas used to create the DP variables.
N
Greensboro, NC Hispanic or Latino Population Distribution by Ancestries...
neilsberg.com
csv, json
Updated Feb 21, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2025). Greensboro, NC Hispanic or Latino Population Distribution by Ancestries Dataset : Detailed Breakdown of Hispanic or Latino Origins // 2025 Edition [Dataset]. https://www.neilsberg.com/insights/greensboro-nc-population-by-race/
Explore at:
json, csvAvailable download formats
Dataset updated
Feb 21, 2025
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
North Carolina, Greensboro
Variables measured
Hispanic or Latino population with Cuban ancestry, Hispanic or Latino population with Mexican ancestry, Hispanic or Latino population with Puerto Rican ancestry, Hispanic or Latino population with Other Hispanic or Latino ancestry, Hispanic or Latino population with Cuban ancestry as Percent of Total Hispanic Population, Hispanic or Latino population with Mexican ancestry as Percent of Total Hispanic Population, Hispanic or Latino population with Puerto Rican ancestry as Percent of Total Hispanic Population, Hispanic or Latino population with Other Hispanic or Latino ancestry as Percent of Total Hispanic Population
Measurement technique
The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates. To measure the two variables, namely (a) Origin / Ancestry for Hispanic population and (b) respective population as a percentage of the total Hispanic population, we initially analyzed and categorized the data for each of the ancestries across the Hispanic or Latino population. It is ensured that the population estimates used in this dataset pertain exclusively to ancestries for the Hispanic or Latino population. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset tabulates the Greensboro Hispanic or Latino population. It includes the distribution of the Hispanic or Latino population, of Greensboro, by their ancestries, as identified by the Census Bureau. The dataset can be utilized to understand the origin of the Hispanic or Latino population of Greensboro.

Key observations

Among the Hispanic population in Greensboro, regardless of the race, the largest group is of Mexican origin, with a population of 13,381 (42.88% of the total Hispanic population).

Content

When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

Origin for Hispanic or Latino population include:

Mexican

Puerto Rican

Cuban

Other Hispanic or Latino

Variables / Data Columns

Origin: This column displays the origin for Hispanic or Latino population for the Greensboro

Population: The population of the specific origin for Hispanic or Latino population in the Greensboro is shown in this column.

% of Total Hispanic Population: This column displays the percentage distribution of each Hispanic origin as a proportion of Greensboro total Hispanic or Latino population. Please note that the sum of all percentages may not equal one due to rounding of values.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Recommended for further research

This dataset is a part of the main dataset for Greensboro Population by Race & Ethnicity. You can refer the same here
d
Data from: Genome-wide local ancestry and the functional consequences of...
search.dataone.org
datadryad.org
Updated Nov 27, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gillian McHugo; James Ward; Said Ismael Ngâ€™angâ€™a; Laurent Frantz; Daniel Bradley; Michael Salter-Townshend; Emmeline Hill; Grace O'Gorman; Kieran Meade; Thomas Hall; David E. MacHugh (2024). Genome-wide local ancestry and the functional consequences of admixture in African and European cattle populations [Dataset]. http://doi.org/10.5061/dryad.w3r22810n
Explore at:
Unique identifier
https://doi.org/10.5061/dryad.w3r22810n
Dataset updated
Nov 27, 2024
Dataset provided by
Dryad Digital Repository
Authors
Gillian McHugo; James Ward; Said Ismael Ngâ€™angâ€™a; Laurent Frantz; Daniel Bradley; Michael Salter-Townshend; Emmeline Hill; Grace O'Gorman; Kieran Meade; Thomas Hall; David E. MacHugh
Description
Bos taurus (taurine) and Bos indicus (indicine) cattle diverged at least 150,000 years ago and, since that time, substantial genomic differences have evolved between the two lineages. During the last two millennia, genetic exchange in Africa has resulted in a complex mosaic of taurine-indicine ancestry, with most cattle populations exhibiting varying levels of admixture. Similarly, there are several Southern European cattle populations that also show evidence for historical gene flow from indicine cattle, the highest levels of which are found in the Central Italian White cattle populations. Here we use two different software tools (MOSAIC and ELAI) for local ancestry inference (LAI) with genome-wide high- and low-density SNP array data sets in hybrid African and Italian cattle populations and obtained broadly similar results despite critical differences in the two LAI methodologies used. Our analyses identified genomic regions with elevated levels of retained or introgressed ancestry fr..., IlluminaÂ® BovineHD 777K BeadChip SNP data sets generated for 39 African cattle: 8 Boran, 8 Nâ€™Dama, and 23 Somba.The Boran and Nâ€™Dama data were obtained using cattle DNA samples from a trypanosome challenge time-course experiment (O'Gorman et al., 2009) and were generated by Neogen Europe (Ayr, Scotland) using standard procedures for IlluminaÂ® SNP array genotyping.The Somba data were obtained using DNA samples previously published as part of a microsatellite-based survey of cattle genetic diversity (Freeman et al., 2004) and were generated by Weatherbys Scientific (Naas, Ireland) also using standard procedures. The samples were prepared using PLINK (v. 1.90 beta 6.25). The data are provided in the following two files:Â - bora-ndam-somb.pedÂ - bora-ndam-somb.map The data are in PLINK ped and map format, see:Â - https://www.cog-genomics.org/plink/1.9/formats#pedÂ - [https://www.cog-genomics.org/plink/1.9/formats#map](https://www.cog-genomic..., , # IlluminaÂ® BovineHD 777K BeadChip SNP data sets generated for 39 African cattle: 8 Boran, 8 Nâ€™Dama, and 23 Somba.

https://doi.org/10.5061/dryad.w3r22810n

IlluminaÂ® BovineHD 777K BeadChip SNP data sets generated for 39 African cattle: 8 Boran, 8 Nâ€™Dama, and 23 Somba.

The Boran and Nâ€™Dama data were obtained using cattle DNA samples from a trypanosome challenge time-course experiment (O'Gorman et al., 2009) and were generated by Neogen Europe (Ayr, Scotland) using standard procedures for IlluminaÂ® SNP array genotyping.

The Somba data were obtained using DNA samples previously published as part of a microsatellite-based survey of cattle genetic diversity (Freeman et al., 2004) and were generated by Weatherbys Scientific (Naas, Ireland) also using standard procedures.

Description of the data and file structure

The data are provided in the following two files:

- bora-ndam-somb.ped

- bora-ndam-somb.map

The data are in PLINK ped and ma...
S
2023 Census totals by topic for families and extended families by...
datafinder.stats.govt.nz
csv, dwg, geodatabase +6
Updated Nov 24, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stats NZ (2024). 2023 Census totals by topic for families and extended families by statistical area 2 [Dataset]. https://datafinder.stats.govt.nz/layer/120891-2023-census-totals-by-topic-for-families-and-extended-families-by-statistical-area-2/
Explore at:
mapinfo tab, geopackage / sqlite, shapefile, kml, csv, geodatabase, pdf, mapinfo mif, dwgAvailable download formats
Dataset updated
Nov 24, 2024
Dataset provided by
Statistics New Zealandhttp://www.stats.govt.nz/
Authors
Stats NZ
License
https://datafinder.stats.govt.nz/license/attribution-4-0-international/https://datafinder.stats.govt.nz/license/attribution-4-0-international/
Area covered
Description
Dataset contains counts and measures for families and extended families from the 2013, 2018, and 2023 Censuses. Data is available by statistical area 2.

The variables included in this dataset are for families and extended families in households in occupied private dwellings:

Count of families

Family type

Number of people in family

Average number of people in family

Total family income

Median ($) total family income

Count of extended families

Extended family type

Total extended family income

Median ($) total extended family income.

Download lookup file from Stats NZ ArcGIS Online or embedded attachment in Stats NZ geographic data service. Download data table (excluding the geometry column for CSV files) using the instructions in the Koordinates help guide.

Footnotes

Geographical boundaries

Statistical standard for geographic areas 2023 (updated December 2023) has information about geographic boundaries as of 1 January 2023. Address data from 2013 and 2018 Censuses was updated to be consistent with the 2023 areas. Due to the changes in area boundaries and coding methodologies, 2013 and 2018 counts published in 2023 may be slightly different to those published in 2013 or 2018.

Caution using time series

Time series data should be interpreted with care due to changes in census methodology and differences in response rates between censuses. The 2023 and 2018 Censuses used a combined census methodology (using census responses and administrative data), while the 2013 Census used a full-field enumeration methodology (with no use of administrative data).

About the 2023 Census dataset

For information on the 2023 dataset see Using a combined census model for the 2023 Census. We combined data from the census forms with administrative data to create the 2023 Census dataset, which meets Stats NZ's quality criteria for population structure information. We added real data about real people to the dataset where we were confident the people who hadn’t completed a census form (which is known as admin enumeration) will be counted. We also used data from the 2018 and 2013 Censuses, administrative data sources, and statistical imputation methods to fill in some missing characteristics of people and dwellings.

Data quality

The quality of data in the 2023 Census is assessed using the quality rating scale and the quality assurance framework to determine whether data is fit for purpose and suitable for release. Data quality assurance in the 2023 Census has more information.

Concept descriptions and quality ratings

Data quality ratings for 2023 Census variables has additional details about variables found within totals by topic, for example, definitions and data quality.

Using data for good

Stats NZ expects that, when working with census data, it is done so with a positive purpose, as outlined in the Māori Data Governance Model (Data Iwi Leaders Group, 2023). This model states that "data should support transformative outcomes and should uplift and strengthen our relationships with each other and with our environments. The avoidance of harm is the minimum expectation for data use. Māori data should also contribute to iwi and hapū tino rangatiratanga”.

Confidentiality

The 2023 Census confidentiality rules have been applied to 2013, 2018, and 2023 data. These rules protect the confidentiality of individuals, families, households, dwellings, and undertakings in 2023 Census data. Counts are calculated using fixed random rounding to base 3 (FRR3) and suppression of ‘sensitive’ counts less than six, where tables report multiple geographic variables and/or small populations. Individual figures may not always sum to stated totals. Applying confidentiality rules to 2023 Census data and summary of changes since 2018 and 2013 Censuses has more information about 2023 Census confidentiality rules.

Measures

Measures like averages, medians, and other quantiles are calculated from unrounded counts, with input noise added to or subtracted from each contributing value during measures calculations. Averages and medians based on less than six units (e.g. individuals, dwellings, households, families, or extended families) are suppressed. This suppression threshold changes for other quantiles. Where the cells have been suppressed, a placeholder value has been used.

Percentages

To calculate percentages, divide the figure for the category of interest by the figure for 'Total stated' where this applies.

Symbol

-997 Not available

-999 Confidential

Inconsistencies in definitions

Please note that there may be differences in definitions between census classifications and those used for other data collections.
f
A Genome-Wide Search for Greek and Jewish Admixture in the Kashmiri...
plos.figshare.com
omicsdi.org
docx
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jonathan M. Downie; Tsewang Tashi; Felipe Ramos Lorenzo; Julie Ellen Feusier; Hyder Mir; Josef T. Prchal; Lynn B. Jorde; Parvaiz A. Koul (2023). A Genome-Wide Search for Greek and Jewish Admixture in the Kashmiri Population [Dataset]. http://doi.org/10.1371/journal.pone.0160614
Explore at:
docxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0160614
Dataset updated
Jun 1, 2023
Dataset provided by
PLOS ONE
Authors
Jonathan M. Downie; Tsewang Tashi; Felipe Ramos Lorenzo; Julie Ellen Feusier; Hyder Mir; Josef T. Prchal; Lynn B. Jorde; Parvaiz A. Koul
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The Kashmiri population is an ethno-linguistic group that resides in the Kashmir Valley in northern India. A longstanding hypothesis is that this population derives ancestry from Jewish and/or Greek sources. There is historical and archaeological evidence of ancient Greek presence in India and Kashmir. Further, some historical accounts suggest ancient Hebrew ancestry as well. To date, it has not been determined whether signatures of Greek or Jewish admixture can be detected in the Kashmiri population. Using genome-wide genotyping and admixture detection methods, we determined there are no significant or substantial signs of Greek or Jewish admixture in modern-day Kashmiris. The ancestry of Kashmiri Tibetans was also determined, which showed signs of admixture with populations from northern India and west Eurasia. These results contribute to our understanding of the existing population structure in northern India and its surrounding geographical areas.
g
Census of Population, 1880 [United States]: Public Use Sample (1 in 1000...
search.gesis.org
Updated Feb 1, 2001
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
GESIS search (2001). Census of Population, 1880 [United States]: Public Use Sample (1 in 1000 Preliminary Subsample) - Archival Version [Dataset]. http://doi.org/10.3886/ICPSR09474
Explore at:
Unique identifier
https://doi.org/10.3886/ICPSR09474
Dataset updated
Feb 1, 2001
Dataset provided by
ICPSR - Interuniversity Consortium for Political and Social Research
GESIS search
License
https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de445119https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de445119
Area covered
United States
Description
Abstract (en): This data collection provides a preliminary subsample of the 1880 Public Use Sample drawn from census enumeration forms. The file contains two types of records: family and person. Each household record is followed by a record for each person in the family. This collection contains information about size of family, number of persons and families in dwelling, and geographic location of each household. Information on individuals includes demographic characteristics, civil condition, occupation, health, education, and nativity. Manuscript census records from 1880 for the 38 United States, the District of Columbia, and the Dakota Territory. This collection is a nationally representative--although clustered--1 in 1000 preliminary subsample of the United States population in 1880. The subsample is based on every tenth microfilm reel of enumeration forms (there are a total of 1,454 reels) and, within each reel, on the census page itself. In terms of the Public Use Sample as a whole, a sample density of 1 person per 100 was chosen so that a single sample point was randomly generated for every two census pages. Sample points were chosen for inclusion in the collection only if the individual selected was the first person listed in the dwelling. Under this procedure each dwelling, family, and individual in the population had a 1 in 100 probability of inclusion in the Public Use Sample. The complete sample, which will be released by the principal investigators in December 1993, will contain approximately 500,000 individuals living in 100,000 families, or 1 percent of the United States population in 1880. Funding insitution(s): United States Department of Health and Human Services. National Institutes of Health (HD25839). (1) This dataset has two levels. The first level ("F" Record Type) contains 29 variables for each of 10,126 families. The second level ("P" Record Type) contains 45 variables for each of 48,786 individuals residing in those families. (2) The data contain blanks and alphabetic characters. (3) Users will note some differences in code frequencies between certain variables in this collection and the totals listed in the documentation. (4) This collection is superseded by CENSUS OF POPULATION, 1880 [UNITED STATES]: PUBLIC USE SAMPLE (ICPSR 6460).
Census Data for 2000 from Geolytics
search.dataone.org
Updated Oct 14, 2013
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Cary Institute Of Ecosystem Studies; Jarlath O'Neil-Dunne (2013). Census Data for 2000 from Geolytics [Dataset]. https://search.dataone.org/view/knb-lter-bes.23.570
Explore at:
Dataset updated
Oct 14, 2013
Dataset provided by
Long Term Ecological Research Networkhttp://www.lternet.edu/
Authors
Cary Institute Of Ecosystem Studies; Jarlath O'Neil-Dunne
Time period covered
Jan 1, 2004 - Nov 17, 2011
Area covered

Description
Geolytics Census 2000 Long Form dataset. The Geolytics Census 2000 Long Form is a comprehensive source of detailed information about the people, housing, and economy of the United States. The Census 2000 Long Form offers the entire US Census Bureau's SF3 dataset. This dataset contains variables such as income, housing, employment, language spoken, ancestry, education, poverty, rent, mortgage, commute to work, etc. There are 5,500 variables at the Block Group level. A select portion of the Geolytics Census data was joined to GDT spatial data by block group and some census attributes were aggregated. See the attached txt file for a description of the attributes. This is part of a collection of 221 Baltimore Ecosystem Study metadata records that point to a geodatabase. The geodatabase is available online and is considerably large. Upon request, and under certain arrangements, it can be shipped on media, such as a usb hard drive. The geodatabase is roughly 51.4 Gb in size, consisting of 4,914 files in 160 folders. Although this metadata record and the others like it are not rich with attributes, it is nonetheless made available because the data that it represents could be indeed useful.
Census of Population and Housing, 2000 [United States]: Summary File 4, Iowa...
search.gesis.org
Updated Feb 16, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
United States Department of Commerce. Bureau of the Census (2021). Census of Population and Housing, 2000 [United States]: Summary File 4, Iowa - Version 1 [Dataset]. http://doi.org/10.3886/ICPSR13527.v1
Explore at:
Unique identifier
https://doi.org/10.3886/ICPSR13527.v1
Dataset updated
Feb 16, 2021
Dataset provided by
Inter-university Consortium for Political and Social Researchhttps://www.icpsr.umich.edu/web/pages/
GESIS search
Authors
United States Department of Commerce. Bureau of the Census
License
https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de457443https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de457443
Area covered
Iowa, United States
Description
Abstract (en): Summary File 4 (SF 4) from the United States 2000 Census contains the sample data, which is the information compiled from the questions asked of a sample of all people and housing units. Population items include basic population totals: urban and rural, households and families, marital status, grandparents as caregivers, language and ability to speak English, ancestry, place of birth, citizenship status, year of entry, migration, place of work, journey to work (commuting), school enrollment and educational attainment, veteran status, disability, employment status, industry, occupation, class of worker, income, and poverty status. Housing items include basic housing totals: urban and rural, number of rooms, number of bedrooms, year moved into unit, household size and occupants per room, units in structure, year structure built, heating fuel, telephone service, plumbing and kitchen facilities, vehicles available, value of home, monthly rent, and shelter costs. In Summary File 4, the sample data are presented in 213 population tables (matrices) and 110 housing tables, identified with "PCT" and "HCT" respectively. Each table is iterated for 336 population groups: the total population, 132 race groups, 78 American Indian and Alaska Native tribe categories (reflecting 39 individual tribes), 39 Hispanic or Latino groups, and 86 ancestry groups. The presentation of SF4 tables for any of the 336 population groups is subject to a population threshold. That is, if there are fewer than 100 people (100-percent count) in a specific population group in a specific geographic area, and there are fewer than 50 unweighted cases, their population and housing characteristics data are not available for that geographic area in SF4. For the ancestry iterations, only the 50 unweighted cases test can be performed. See Appendix H: Characteristic Iterations, for a complete list of characteristic iterations. ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection: Created variable labels and/or value labels.. All persons in housing units in Iowa in 2000. 2013-05-25 Multiple Census data file segments were repackaged for distribution into a single zip archive per dataset. No changes were made to the data or documentation.2006-01-12 All files were removed from dataset 342 and flagged as study-level files, so that they will accompany all downloads.2006-01-12 All files were removed from dataset 341 and flagged as study-level files, so that they will accompany all downloads.2006-01-12 All files were removed from dataset 340 and flagged as study-level files, so that they will accompany all downloads.2006-01-12 All files were removed from dataset 339 and flagged as study-level files, so that they will accompany all downloads.2006-01-12 All files were removed from dataset 338 and flagged as study-level files, so that they will accompany all downloads. Because of the number of files per state in Summary File 4, ICPSR has given each state its own ICPSR study number in the range ICPSR 13512-13563. The study number for the national file is 13570. Data for each state are being released as they become available.The data are provided in 38 segments (files) per iteration. These segments are PCT1-PCT4, PCT5-PCT16, PCT17-PCT34, PCT35-PCT37, PCT38-PCT45, PCT46-PCT49, PCT50-PCT61, PCT62-PCT67, PCT68-PCT71, PCT72-PCT76, PCT77-PCT78, PCT79-PCT81, PCT82-PCT84, PCT85-PCT86 (partial), PCT86 (partial), PCT87-PCT103, PCT104-PCT120, PCT121-PCT131, PCT132-PCT137, PCT138-PCT143, PCT144, PCT145-PCT150, PCT151-PCT156, PCT157-PCT162, PCT163-PCT208, PCT209-PCT213, HCT1-HCT9, HCT10-HCT18, HCT19-HCT22, HCT23-HCT25, HCT26-HCT29, HCT30-HCT39, HCT40-HCT55, HCT56-HCT61, HCT62-HCT70, HCT71-HCT81, HCT82-HCT86, and HCT87-HCT110. The iterations are Parts 1-336, the Geographic Header File is Part 337. The Geographic Header File is in fixed-format ASCII and the table files are in comma-delimited ASCII format. A merged iteration will have 7,963 variables.For Parts 251-336, the part names contain numbers within parentheses that refer to the Ancestry Code List (page G1 of the codebook).

Facebook

Twitter

Click to copy link

Link copied

Cite

Stanford Center for Population Health Sciences (2020). Historic US census - 1930 [Dataset]. http://doi.org/10.57761/6e5q-rh85

Historic US census - 1930

Explore at:

application/jsonl, parquet, spss, csv, arrow, stata, avro, sasAvailable download formats

Unique identifier

https://doi.org/10.57761/6e5q-rh85

Dataset updated

Jan 10, 2020

Dataset provided by

Redivis Inc.

Authors

Stanford Center for Population Health Sciences

Time period covered

Jan 1, 1930 - Dec 31, 1930

Area covered

United States

Description

Abstract

The Integrated Public Use Microdata Series (IPUMS) Complete Count Data include more than 650 million individual-level and 7.5 million household-level records. The microdata are the result of collaboration between IPUMS and the nation’s two largest genealogical organizations—Ancestry.com and FamilySearch—and provides the largest and richest source of individual level and household data.

Before Manuscript Submission

All manuscripts (and other items you'd like to publish) must be submitted to

phsdatacore@stanford.edu for approval prior to journal submission.

We will check your cell sizes and citations.

For more information about how to cite PHS and PHS datasets, please visit:

https:/phsdocs.developerhub.io/need-help/citing-phs-data-core

Documentation

This dataset was created on 2020-01-10 22:52:11.461 by merging multiple datasets together. The source datasets for this version were:

IPUMS 1930 households: This dataset includes all households from the 1930 US census.

IPUMS 1930 persons: This dataset includes all individuals from the 1930 US census.

IPUMS 1930 Lookup: This dataset includes variable names, variable labels, variable values, and corresponding variable value labels for the IPUMS 1930 datasets.

Section 2

Historic data are scarce and often only exists in aggregate tables. The key advantage of historic US census data is the availability of individual and household level characteristics that researchers can tabulate in ways that benefits their specific research questions. The data contain demographic variables, economic variables, migration variables and family variables. Within households, it is possible to create relational data as all relations between household members are known. For example, having data on the mother and her children in a household enables researchers to calculate the mother’s age at birth. Another advantage of the Complete Count data is the possibility to follow individuals over time using a historical identifier.

In sum: the historic US census data are a unique source for research on social and economic change and can provide population health researchers with information about social and economic determinants.Historic data are scarce and often only exists in aggregate tables. The key advantage of historic US census data is the availability of individual and household level characteristics that researchers can tabulate in ways that benefits their specific research questions. The data contain demographic variables, economic variables, migration variables and family variables. Within households, it is possible to create relational data as all relations between household members are known. For example, having data on the mother and her children in a household enables researchers to calculate the mother’s age at birth. Another advantage of the Complete Count data is the possibility to follow individuals over time using a historical identifier. In sum: the historic US census data are a unique source for research on social and economic change and can provide population health researchers with information about social and economic determinants.

The historic US 1930 census data was collected in April 1930. Enumerators collected data traveling to households and counting the residents who regularly slept at the household. Individuals lacking permanent housing were counted as residents of the place where they were when the data was collected. Household members absent on the day of data collected were either listed to the household with the help of other household members or were scheduled for the last census subdivision.

Notes

We provide IPUMS household and person data separately so that it is convenient to explore the descriptive statistics on each level. In order to obtain a full dataset, merge the household and person on the variables SERIAL and SERIALP. In order to create a longitudinal dataset, merge datasets on the variable HISTID.
Households with more than 60 people in the original data were broken up for processing purposes. Every person in the large households are considered to be in their own household. The original large households can be identified using the variable SPLIT, reconstructed using the variable SPLITHID, and the original count is found in the variable SPLITNUM.
Coded variables derived from string variables are still in progress. These variables include: occupation and industry.
Missing observations have been allocated and some inconsistencies have been edited for the following variables: SPEAKENG, YRIMMIG, CITIZEN, AGEMARR, AGE, BPL, MBPL, FBPL, LIT, SCHOOL, OWNERSHP, FARM, EMPSTAT, OCC1950, IND1950, MTONGUE, MARST, RACE, SEX, RELATE, CLASSWKR. The flag variables indicating an allocated observation for the associated variables can be included in your extract by clicking the ‘Select data quality flags’ box on the extract summary page.
Most inconsistent information was not edite

Clear search

Close search

Google apps

Main menu

Historic US census - 1930

Abstract

Before Manuscript Submission

Documentation

Section 2

Ireland Census

Lookup

POPMAPS: An R package to estimate ancestry probability surfaces

Census Data

Data from: A geographic history of human genetic ancestry

Description of the data and file structure

Files and variables

File: data.tar.gz

Census of Population and Housing, 2000 [United States]: Summary File 4, West...

Additional file 2: of Selecting SNPs informative for African, American...

Persons

Households

Genealogy Products and Services Market size will be USD 5,093.64 Million by...

Census of Population and Housing, 2000 [United States]: Summary File 4,...

Profile of Selected Social Characteristics for Census Tracts: 2000

Greensboro, NC Hispanic or Latino Population Distribution by Ancestries...

About this dataset

Content

Inspiration

Recommended for further research

Data from: Genome-wide local ancestry and the functional consequences of...

Description of the data and file structure

2023 Census totals by topic for families and extended families by...

A Genome-Wide Search for Greek and Jewish Admixture in the Kashmiri...

Census of Population, 1880 [United States]: Public Use Sample (1 in 1000...

Census Data for 2000 from Geolytics

Census of Population and Housing, 2000 [United States]: Summary File 4, Iowa...

Historic US census - 1930See More Versions

Abstract

Before Manuscript Submission

Documentation

Section 2

Historic US census - 1930