100+ datasets found
  1. census-bureau-international

    • kaggle.com
    zip
    Updated May 6, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Google BigQuery (2020). census-bureau-international [Dataset]. https://www.kaggle.com/datasets/bigquery/census-bureau-international
    Explore at:
    zip(0 bytes)Available download formats
    Dataset updated
    May 6, 2020
    Dataset provided by
    BigQueryhttps://cloud.google.com/bigquery
    Authors
    Google BigQuery
    Description

    Context

    The United States Census Bureau’s international dataset provides estimates of country populations since 1950 and projections through 2050. Specifically, the dataset includes midyear population figures broken down by age and gender assignment at birth. Additionally, time-series data is provided for attributes including fertility rates, birth rates, death rates, and migration rates.

    Querying BigQuery tables

    You can use the BigQuery Python client library to query tables in this dataset in Kernels. Note that methods available in Kernels are limited to querying data. Tables are at bigquery-public-data.census_bureau_international.

    Sample Query 1

    What countries have the longest life expectancy? In this query, 2016 census information is retrieved by joining the mortality_life_expectancy and country_names_area tables for countries larger than 25,000 km2. Without the size constraint, Monaco is the top result with an average life expectancy of over 89 years!

    standardSQL

    SELECT age.country_name, age.life_expectancy, size.country_area FROM ( SELECT country_name, life_expectancy FROM bigquery-public-data.census_bureau_international.mortality_life_expectancy WHERE year = 2016) age INNER JOIN ( SELECT country_name, country_area FROM bigquery-public-data.census_bureau_international.country_names_area where country_area > 25000) size ON age.country_name = size.country_name ORDER BY 2 DESC /* Limit removed for Data Studio Visualization */ LIMIT 10

    Sample Query 2

    Which countries have the largest proportion of their population under 25? Over 40% of the world’s population is under 25 and greater than 50% of the world’s population is under 30! This query retrieves the countries with the largest proportion of young people by joining the age-specific population table with the midyear (total) population table.

    standardSQL

    SELECT age.country_name, SUM(age.population) AS under_25, pop.midyear_population AS total, ROUND((SUM(age.population) / pop.midyear_population) * 100,2) AS pct_under_25 FROM ( SELECT country_name, population, country_code FROM bigquery-public-data.census_bureau_international.midyear_population_agespecific WHERE year =2017 AND age < 25) age INNER JOIN ( SELECT midyear_population, country_code FROM bigquery-public-data.census_bureau_international.midyear_population WHERE year = 2017) pop ON age.country_code = pop.country_code GROUP BY 1, 3 ORDER BY 4 DESC /* Remove limit for visualization*/ LIMIT 10

    Sample Query 3

    The International Census dataset contains growth information in the form of birth rates, death rates, and migration rates. Net migration is the net number of migrants per 1,000 population, an important component of total population and one that often drives the work of the United Nations Refugee Agency. This query joins the growth rate table with the area table to retrieve 2017 data for countries greater than 500 km2.

    SELECT growth.country_name, growth.net_migration, CAST(area.country_area AS INT64) AS country_area FROM ( SELECT country_name, net_migration, country_code FROM bigquery-public-data.census_bureau_international.birth_death_growth_rates WHERE year = 2017) growth INNER JOIN ( SELECT country_area, country_code FROM bigquery-public-data.census_bureau_international.country_names_area

    Update frequency

    Historic (none)

    Dataset source

    United States Census Bureau

    Terms of use: This dataset is publicly available for anyone to use under the following terms provided by the Dataset Source - http://www.data.gov/privacy-policy#data_policy - and is provided "AS IS" without any warranty, express or implied, from Google. Google disclaims all liability for any damages, direct or indirect, resulting from the use of the dataset.

    See the GCP Marketplace listing for more details and sample queries: https://console.cloud.google.com/marketplace/details/united-states-census-bureau/international-census-data

  2. United States Census Data, 1900: Public Use Sample

    • icpsr.umich.edu
    • archive.ciser.cornell.edu
    ascii
    Updated May 11, 1992
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Preston, Samuel H.; Higgs, Robert L. (1992). United States Census Data, 1900: Public Use Sample [Dataset]. http://doi.org/10.3886/ICPSR07825.v1
    Explore at:
    asciiAvailable download formats
    Dataset updated
    May 11, 1992
    Dataset provided by
    Inter-university Consortium for Political and Social Researchhttps://www.icpsr.umich.edu/web/pages/
    Authors
    Preston, Samuel H.; Higgs, Robert L.
    License

    https://www.icpsr.umich.edu/web/ICPSR/studies/7825/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/7825/terms

    Time period covered
    1900
    Area covered
    United States
    Description

    This study was conducted under the auspices of the Center for Studies in Demography and Ecology at the University of Washington. It is a nationally representative sample of the population of the United States in 1900, drawn from the manuscript returns of individuals enumerated in the 1900 United States Census. Household variables include region, state and county of household, size of household, and type and ownership of dwelling. Individual variables for each household member include relationship to head of household, race, sex, age, marital status, number of children, and birthplace. Immigration variables include parents' birthplace, year of immigration and number of years in the United States. Occupation variables include occupation, coded by both the 1900 and 1950 systems, and number of months unemployed. Education variables include number of months in school, whether respondents could read or write a language, and whether they spoke English.

  3. A

    2020 Census for Boston

    • data.boston.gov
    csv, pdf
    Updated Sep 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Planning Department (2023). 2020 Census for Boston [Dataset]. https://data.boston.gov/dataset/2020-census-for-boston
    Explore at:
    pdf(713107), csv(34556), csv(4944), csv(94470), csv(34702)Available download formats
    Dataset updated
    Sep 8, 2023
    Dataset authored and provided by
    Planning Department
    Area covered
    Boston
    Description

    2020 Census data for the city of Boston, Boston neighborhoods, census tracts, block groups, and voting districts. In the 2020 Census, the U.S. Census Bureau divided Boston into 207 census tracts (~4,000 residents) made up of 581 smaller block groups. The Boston Planning and Development Agency uses the 2020 tracts to approximate Boston neighborhoods. The 2020 Census Redistricting data also identify Boston’s voting districts.

    For analysis of Boston’s 2020 Census data including graphs and maps by the BPDA Research Division and Office of Digital Cartography and GIS, see 2020 Census Research Publications

    For a complete official data dictionary, please go to 2020 Census State Redistricting Data (Public Law 94-171) Summary File, Chapter 6. Data Dictionary. 2020 Census State Redistricting Data (Public Law 94-171) Summary File

    2020 Census Tracts In Boston

    2020 Census Block Groups In Boston

    Boston Neighborhood Boundaries Approximated By 2020 Census Tracts

    Boston Voting District Boundaries

  4. D

    2020 Public Use Microdata Areas (PUMAs); PA, NJ, DE & MD

    • catalog.dvrpc.org
    • hub.arcgis.com
    api, geojson, html +1
    Updated Aug 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    DVRPC (2025). 2020 Public Use Microdata Areas (PUMAs); PA, NJ, DE & MD [Dataset]. https://catalog.dvrpc.org/dataset/2020-public-use-microdata-areas-pumas-pa-nj-de-md
    Explore at:
    geojson, html, xml, apiAvailable download formats
    Dataset updated
    Aug 28, 2025
    Dataset authored and provided by
    DVRPC
    Area covered
    New Jersey, Pennsylvania
    Description

    USE geoid TO JOIN DATA DOWNLOADED FROM DATA.CENSUS.GOV The TIGER/Line Shapefiles are extracts of selected geographic and cartographic information from the Census Bureau's Master Address File (MAF)/Topologically Integrated Geographic Encoding and Referencing (TIGER) System (MTS). The TIGER/Line Shapefiles contain a standard geographic identifier (GEOID) for each entity that links to the GEOID in the data from censuses and surveys. The TIGER/Line Shapefiles do not include demographic data from surveys and censuses (e.g., Decennial Census, Economic Census, American Community Survey, and the Population Estimates Program). Other, non-census, data often have this standard geographic identifier as well. Data from many of the Census Bureau’s surveys and censuses, including the geographic codes needed to join to the TIGER/Line Shapefiles, are available at the Census Bureau’s public data dissemination website (https://data.census.gov/). Public Use Microdata Areas (PUMAs) are statistical geographic areas for the dissemination of decennial census and American Community Survey (ACS) Public Use Microdata Sample files in which the Census Bureau provides selected extracts of raw data from a small sample of census records that are screened to protect confidentiality. The ACS also uses the PUMAs as a tabulation geographic entity. For the 2020 Census, the State Data Centers in each state, the District of Columbia, and Puerto Rico are involved in the delineation of the 2020 PUMAs. Counties and census tracts are used to define PUMAs, and each PUMA must include at least 100,000 people based on the 2020 Census published counts. For the 2020 Census in Guam and the U.S. Virgin Islands, the Census Bureau establishes a single, separate PUMA for each of these two Island Areas. American Samoa and the Commonwealth of the Northern Mariana Islands do not have PUMAs, because the total population of each is under 100,000 people. Downloaded from https://www2.census.gov/geo/tiger/TIGER2022/PUMA/ on June 22, 2023

  5. TIGER/Line Shapefile, Current, State, District of Columbia, 2020 Census...

    • catalog.data.gov
    Updated Aug 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Department of Commerce, U.S. Census Bureau, Geography Division (Point of Contact) (2025). TIGER/Line Shapefile, Current, State, District of Columbia, 2020 Census Public Use Microdata Area (PUMA) [Dataset]. https://catalog.data.gov/dataset/tiger-line-shapefile-current-state-district-of-columbia-2020-census-public-use-microdata-area-p
    Explore at:
    Dataset updated
    Aug 7, 2025
    Dataset provided by
    United States Census Bureauhttp://census.gov/
    Area covered
    Washington
    Description

    This resource is a member of a series. The TIGER/Line shapefiles and related database files (.dbf) are an extract of selected geographic and cartographic information from the U.S. Census Bureau's Master Address File / Topologically Integrated Geographic Encoding and Referencing (MAF/TIGER) System (MTS). The MTS represents a seamless national file with no overlaps or gaps between parts, however, each TIGER/Line shapefile is designed to stand alone as an independent data set, or they can be combined to cover the entire nation. Public Use Microdata Areas (PUMAs) are decennial census areas that permit the tabulation and dissemination of Public Use Microdata Sample (PUMS) data, American Community Survey (ACS) data, and data from other censuses and surveys. For the 2020 Census, the State Data Centers (SDCs) in each state, the District of Columbia, and the Commonwealth of Puerto Rico had the opportunity to delineate PUMAS within their state or statistically equivalent entity. All PUMAs must nest within states and have a minimum population threshold of 100,000 persons. 2020 PUMAs consist of census tracts and cover the entirety of the United States, Puerto Rico and Guam. American Samoa, the Commonwealth of the Northern Mariana Islands, and the U.S. Virgin Islands do not contain any 2020 PUMAs because the population is less than the minimum population requirement. Each PUMA is identified by a five-character numeric census code that may contain leading zeros and a descriptive name.

  6. Census of Population and Housing, 2000 [United States]: Public Law (P.L.)...

    • icpsr.umich.edu
    spss, stata
    Updated May 24, 2013
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    United States. Bureau of the Census (2013). Census of Population and Housing, 2000 [United States]: Public Law (P.L.) 94-171 Adjusted Data [Dataset]. http://doi.org/10.3886/ICPSR13400.v3
    Explore at:
    spss, stataAvailable download formats
    Dataset updated
    May 24, 2013
    Dataset provided by
    Inter-university Consortium for Political and Social Researchhttps://www.icpsr.umich.edu/web/pages/
    Authors
    United States. Bureau of the Census
    License

    https://www.icpsr.umich.edu/web/ICPSR/studies/13400/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/13400/terms

    Time period covered
    2000
    Area covered
    Minnesota, Illinois, New Jersey, Idaho, Utah, Connecticut, Rhode Island, Pennsylvania, Colorado, District of Columbia
    Description

    The numbers contained in this study are released pursuant to the order of the United States Court of Appeals for the Ninth Circuit in Carter v. Department of Commerce, 307 F.3d 1084. These numbers are not official Census 2000 counts. These numbers are estimates of the population based on a statistical adjustment method, utilizing sampling and modeling, applied to the official Census 2000 figures. The estimates utilized the results of the Accuracy and Coverage Evaluation (A.C.E.), a sample survey intended to measure net over- and undercounts in the census results. The Census Bureau has determined that the A.C.E. estimates dramatically overstate the level of undercoverage in Census 2000, and that the adjusted Census 2000 data are, therefore, not more accurate than the unadjusted data. On March 6, 2001, the Secretary of Commerce decided that unadjusted data from Census 2000 should be used to tabulate population counts reported to states and localities pursuant to 13 U.S.C. 141(c) (see 66 FR 14520, March 13, 2001). The Secretary's decision endorsed the unanimous recommendation of the Executive Steering Committee for A.C.E. Policy (ESCAP), a group of 12 senior career professionals within the Census Bureau. The ESCAP, in its recommendation against the use of the statistically adjusted estimates, had noted serious reservations regarding their accuracy. In order to inform the Census Bureau's planned October 2001 decision regarding the potential use of the adjusted estimates for non-redistricting purposes, the agency conducted extensive analyses throughout the summer of 2001. These extensive analyses confirmed the serious concerns the agency had noted earlier regarding the accuracy of the A.C.E. estimates. Specifically, the adjusted estimates were determined to be so severely flawed that all potential uses of these data would be inappropriate. Accordingly, the Department of Commerce deems that these estimates should not be used for any purpose that legally requires use of data from the decennial census and assumes no responsibility for the accuracy of the data for any purpose whatsoever. The Department, including the U.S. Census Bureau, will provide no assistance in the interpretation or use of these numbers. The collection contains four tables: (1) a count of all persons by race (Table PL1), (2) a count of Hispanic or Latino and a count of not Hispanic or Latino by race of all persons (Table PL2), (3) a count of the population 18 years and older by race (Table PL3), and (4) a count of Hispanic or Latino and a count of not Hispanic or Latino by race for the population 18 years and older (Table PL4).

  7. V

    Norfolk Decennial Census

    • odgavaprod.ogopendata.com
    • data.norfolk.gov
    • +1more
    url
    Updated May 2, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    City of Norfolk (2024). Norfolk Decennial Census [Dataset]. https://odgavaprod.ogopendata.com/dataset/norfolk-decennial-census
    Explore at:
    urlAvailable download formats
    Dataset updated
    May 2, 2024
    Dataset authored and provided by
    City of Norfolk
    Description

    This dataset contains Decennial Census data for Norfolk, Virginia. Every ten years ending in 0, the United States Census Bureau conducts a census to determine the number of people living in the United States. The release of data includes counts for population, race, ethnicity, group quarters, and housing units. This dataset contains data for all of Norfolk, VA.

    Source: U.S. Census Bureau, 2020 Census Redistricting Data (Public Law 94-171) For data about this dataset, please click on the below link: https://data.norfolk.gov/Government/Norfolk-Decennial-Census/dijs-dhze/about_data

  8. Data from: Census of Public Defender Offices: County-Based and Local...

    • icpsr.umich.edu
    • catalog.data.gov
    • +2more
    ascii, delimited, sas +2
    Updated May 13, 2011
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    United States Department of Justice. Office of Justice Programs. Bureau of Justice Statistics (2011). Census of Public Defender Offices: County-Based and Local Offices, 2007 [Dataset]. http://doi.org/10.3886/ICPSR29502.v1
    Explore at:
    spss, stata, ascii, delimited, sasAvailable download formats
    Dataset updated
    May 13, 2011
    Dataset provided by
    Inter-university Consortium for Political and Social Researchhttps://www.icpsr.umich.edu/web/pages/
    Authors
    United States Department of Justice. Office of Justice Programs. Bureau of Justice Statistics
    License

    https://www.icpsr.umich.edu/web/ICPSR/studies/29502/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/29502/terms

    Time period covered
    2007
    Area covered
    United States
    Description

    The Bureau of Justice Statistics' (BJS) 2007 Census of Public Defender Offices (CPDO) collected data from public defender offices located across 49 states and the District of Columbia. Public defender offices are one of three methods through which states and localities ensure that indigent defendants are granted the Sixth and Fourteenth Amendment right to counsel. (In addition to defender offices, indigent defense services may also be provided by court-assigned private counsel or by a contract system in which private attorneys contractually agree to take on a specified number of indigent defendants or indigent defense cases.) Public defender offices have a salaried staff of full- or part-time attorneys who represent indigent defendants and are employed as direct government employees or through a public, nonprofit organization. Public defenders play an important role in the United States criminal justice system. Data from prior BJS surveys on indigent defense representation indicate that most criminal defendants rely on some form of publicly provided defense counsel, primarily public defenders. Although the United States Supreme Court has mandated that the states provide counsel for indigent persons accused of crime, documentation on the nature and provision of these services has not been readily available. States have devised various systems, rules of organization, and funding mechanisms for indigent defense programs. While the operation and funding of public defender offices varies across states, public defender offices can be generally classified as being part of either a state program or a county-based system. The 22 state public defender programs functioned entirely under the direction of a central administrative office that funded and administered all the public defender offices in the state. For the 28 states with county-based offices, indigent defense services were administered at the county or local jurisdictional level and funded principally by the county or through a combination of county and state funds. The CPDO collected data from both state- and county-based offices. All public defender offices that were principally funded by state or local governments and provided general criminal defense services, conflict services, or capital case representation were within the scope of the study. Federal public defender offices and offices that provided primarily contract or assigned counsel services with private attorneys were excluded from the data collection. In addition, public defender offices that were principally funded by a tribal government, or provided primarily appellate or juvenile services were outside the scope of the project and were also excluded. The CPDO gathered information on public defender office staffing, expenditures, attorney training, standards and guidelines, and caseloads, including the number and type of cases received by the offices. The data collected by the CPDO can be compared to and analyzed against many of the existing national standards for the provision of indigent defense services.

  9. a

    2020 Census State Redistricting Data (Public Law 94 171) Summary File

    • hub.arcgis.com
    • census.delaware.gov
    Updated Sep 10, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    State of Delaware (2021). 2020 Census State Redistricting Data (Public Law 94 171) Summary File [Dataset]. https://hub.arcgis.com/documents/2c68c47256304d55b4f7d717c86b5d7c
    Explore at:
    Dataset updated
    Sep 10, 2021
    Dataset authored and provided by
    State of Delaware
    Description

    The 2020 Census State Redistrict Data Summary File pages 99 - 113 include the data dictionary reference name and table number and contents that identifies the data dictionary reference name.

  10. Archive of Census Related Products (ACRP): 1990 Public Use Microdata Sample...

    • data.nasa.gov
    Updated Apr 1, 1990
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    nasa.gov (1990). Archive of Census Related Products (ACRP): 1990 Public Use Microdata Sample Areas (PUMA) Boundary Files - Dataset - NASA Open Data Portal [Dataset]. https://data.nasa.gov/dataset/archive-of-census-related-products-acrp-1990-public-use-microdata-sample-areas-puma-bounda
    Explore at:
    Dataset updated
    Apr 1, 1990
    Dataset provided by
    NASAhttp://nasa.gov/
    Description

    The 1990 Public Use Microdata Sample Areas (PUMA) Boundary Files portion of the Archive of Census Related Products (ACRP) consists of 5% sample (apuma) and 1% sample (bpuma) areas for the mapping of 1990 PUMS data covering the continental United States, Alaska, and Hawaii. These boundary files are created based on equivalency files generated by the Geographic Correspondence Engine (GeoCorr). A national census tract to PUMA geography correspondence file is used in merging the two files resulting in the PUMA geographies. An additional file is also available consisting of geographic centroids for the PUMA coverages calculated by UIC (Urban Information Center/Office of Computing, University of Missouri). This portion of the ACRP is produced by the Center for International Earth Science Information Network (CIESIN).

  11. n

    United States Census

    • datacatalog.med.nyu.edu
    Updated Jul 17, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2018). United States Census [Dataset]. https://datacatalog.med.nyu.edu/dataset/10026
    Explore at:
    Dataset updated
    Jul 17, 2018
    Area covered
    United States
    Description

    The Decennial Census provides population estimates and demographic information on residents of the United States.

    The Census Summary Files contain detailed tables on responses to the decennial census. Data tables in Summary File 1 provide information on population and housing characteristics, including cross-tabulations of age, sex, households, families, relationship to householder, housing units, detailed race and Hispanic or Latino origin groups, and group quarters for the total population. Summary File 2 contains data tables on population and housing characteristics as reported by housing unit.

    Researchers at NYU Langone Health can find guidance for the use and analysis of Census Bureau data on the Population Health Data Hub (listed under "Other Resources"), which is accessible only through the intranet portal with a valid Kerberos ID (KID).

  12. 1960 Residence Census Data for Baltimore, MD

    • search.dataone.org
    Updated Oct 14, 2013
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cary Institute Of Ecosystem Studies; Jarlath O'Neil-Dunne (2013). 1960 Residence Census Data for Baltimore, MD [Dataset]. https://search.dataone.org/view/knb-lter-bes.150.570
    Explore at:
    Dataset updated
    Oct 14, 2013
    Dataset provided by
    Long Term Ecological Research Networkhttp://www.lternet.edu/
    Authors
    Cary Institute Of Ecosystem Studies; Jarlath O'Neil-Dunne
    Time period covered
    Jan 1, 2004 - Nov 17, 2011
    Area covered
    Description

    1960 Residence Census Data for Baltimore, Maryland. Refer to the 1960 codebook (codebook_1960.pdf) for more information. This is part of a collection of 221 Baltimore Ecosystem Study metadata records that point to a geodatabase. The geodatabase is available online and is considerably large. Upon request, and under certain arrangements, it can be shipped on media, such as a usb hard drive. The geodatabase is roughly 51.4 Gb in size, consisting of 4,914 files in 160 folders. Although this metadata record and the others like it are not rich with attributes, it is nonetheless made available because the data that it represents could be indeed useful.

  13. p

    Population and Housing Census 2005 - Palau

    • microdata.pacificdata.org
    Updated Aug 18, 2013
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Office of Planning and Statistics (2013). Population and Housing Census 2005 - Palau [Dataset]. https://microdata.pacificdata.org/index.php/catalog/27
    Explore at:
    Dataset updated
    Aug 18, 2013
    Dataset authored and provided by
    Office of Planning and Statistics
    Time period covered
    2005
    Area covered
    Palau
    Description

    Abstract

    The 2005 Republic of Palau Census of Population and Housing will be used to give a snapshot of Republic of Palau's population and housing at the mid-point of the decade. This Census is also important because it measures the population at the beginning of the implementation of the Compact of Free Association. The information collected in the census is needed to plan for the needs of the population. The government uses the census figures to allocate funds for public services in a wide variety of areas, such as education, housing, and job training. The figures also are used by private businesses, academic institutions, local organizations, and the public in general to understand who we are and what our situation is, in order to prepare better for our future needs.

    The fundamental purpose of a census is to provide information on the size, distribution and characteristics of a country's population. The census data are used for policymaking, planning and administration, as well as in management and evaluation of programmes in education, labour force, family planning, housing, health, transportation and rural development. A basic administrative use is in the demarcation of constituencies and allocation of representation to governing bodies. The census is also an invaluable resource for research, providing data for scientific analysis of the composition and distribution of the population and for statistical models to forecast its future growth. The census provides business and industry with the basic data they need to appraise the demand for housing, schools, furnishings, food, clothing, recreational facilities, medical supplies and other goods and services.

    Geographic coverage

    A hierarchical geographic presentation shows the geographic entities in a superior/subordinate structure in census products. This structure is derived from the legal, administrative, or areal relationships of the entities. The hierarchical structure is depicted in report tables by means of indentation. The following structure is used for the 2005 Census of the Republic of Palau:

    Republic of Palau State Hamlet/Village Enumeration District Block

    Analysis unit

    Individuals Families Households General Population

    Universe

    The Census covered all the households and respective residents in the entire country.

    Kind of data

    Census/enumeration data [cen]

    Sampling procedure

    Not applicable to a full enumeration census.

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    The 2005 Palau Census of Population and Housing comprises three parts: 1. Housing - one form for each household 2. Population - one for for each member of the household 3. People who have left home - one form for each household.

    Cleaning operations

    Full scale processing and editing activiities comprised eight separate sessions either with or separately but with remote guidance of the U.S. Census Bureau experts to finalize all datasets for publishing stage.

    Processing operation was handled with care to produce a set of data that describes the population as clearly and accurately as possible. To meet this objective, questionnaires were reviewed and edited during field data collection operations by crew leaders for consistency, completeness, and acceptability. Questionnaires were also reviewed by census clerks in the census office for omissions, certain inconsistencies, and population coverage. For example, write-in entries such as "Don't know" or "NA" were considered unacceptable in certain quantities and/or in conjunction with other data omissions.

    As a result of this review operation, a telephone or personal visit follow-up was made to obtain missing information. Potential coverage errors were included in the follow-up, as well as questionnaires with omissions or inconsistencies beyond the completeness and quality tolerances specified in the review procedures.

    Subsequent to field operations, remaining incomplete or inconsistent information on the questionnaires was assigned using imputation procedures during the final automated edit of the collected data. Allocations, or computer assignments of acceptable data in place of unacceptable entries or blanks, were needed most often when an entry for a given item was lacking or when the information reported for a person or housing unit on that item was inconsistent with other information for that same person or housing unit. As in previous censuses, the general procedure for changing unacceptable entries was to assign an entry for a person or housing unit that was consistent with entries for persons or housing units with similar characteristics. The assignment of acceptable data in lace of blanks or unacceptable entries enhanced the usefulness of the data.

    Another way to make corrections during the computer editing process is substitution. Substitution is the assignment of a full set of characteristics for a person or housing unit. Because of the detailed field operations, substitution was not needed for the 2005 Census.

    Sampling error estimates

    Sampling Error is not applicable to full enumeration censuses.

    Data appraisal

    In any large-scale statistical operation, such as the 2005 Census of the Republic of Palau, human- and machine-related errors were anticipated. These errors are commonly referred to as nonsampling errors. Such errors include not enumerating every household or every person in the population, not obtaining all required information form the respondents, obtaining incorrect or inconsistent information, and recording information incorrectly. In addition, errors can occur during the field review of the enumerators' work, during clerical handling of the census questionnaires, or during the electronic processing of the questionnaires.

    To reduce various types of nonsampling errors, a number of techniques were implemented during the planning, data collection, and data processing activities. Quality assurance methods were used throughout the data collection and processing phases of the census to improve the quality of the data.

  14. AHRQ and NaNDA Included Variables

    • zenodo.org
    csv
    Updated Apr 24, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anonymous Anonymous; Anonymous Anonymous (2024). AHRQ and NaNDA Included Variables [Dataset]. http://doi.org/10.5281/zenodo.10982453
    Explore at:
    csvAvailable download formats
    Dataset updated
    Apr 24, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Anonymous Anonymous; Anonymous Anonymous
    Description

    All credit for variables in AHRQ_included_variables.csv is attributed to

  15. 2010 Census Production Settings Redistricting Data (P.L. 94-171)...

    • registry.opendata.aws
    • icpsr.umich.edu
    Updated Nov 10, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    United States Census Bureau (2023). 2010 Census Production Settings Redistricting Data (P.L. 94-171) Demonstration Noisy Measurement File [Dataset]. https://registry.opendata.aws/census-2010-pl94-nmf/
    Explore at:
    Dataset updated
    Nov 10, 2023
    Dataset provided by
    United States Census Bureauhttp://census.gov/
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    The 2010 Census Production Settings Redistricting Data (P.L. 94-171) Demonstration Noisy Measurement File (2023-04-03) is an intermediate output of the 2020 Census Disclosure Avoidance System (DAS) TopDown Algorithm (TDA) (as described in Abowd, J. et al [2022] https://doi.org/10.1162/99608f92.529e3cb9 , and implemented in https://github.com/uscensusbureau/DAS_2020_Redistricting_Production_Code). The NMF was produced using the official “production settings,” the final set of algorithmic parameters and privacy-loss budget allocations, that were used to produce the 2020 Census Redistricting Data (P.L. 94-171) Summary File and the 2020 Census Demographic and Housing Characteristics File.

    The NMF consists of the full set of privacy-protected statistical queries (counts of individuals or housing units with particular combinations of characteristics) of confidential 2010 Census data relating to the redistricting data portion of the 2010 Demonstration Data Products Suite – Redistricting and Demographic and Housing Characteristics File – Production Settings (2023-04-03). These statistical queries, called “noisy measurements” were produced under the zero-Concentrated Differential Privacy framework (Bun, M. and Steinke, T [2016] https://arxiv.org/abs/1605.02065; see also Dwork C. and Roth, A. [2014] https://www.cis.upenn.edu/~aaroth/Papers/privacybook.pdf) implemented via the discrete Gaussian mechanism (Cannone C., et al., [2023] https://arxiv.org/abs/2004.00010), which added positive or negative integer-valued noise to each of the resulting counts. The noisy measurements are an intermediate stage of the TDA prior to the post-processing the TDA then performs to ensure internal and hierarchical consistency within the resulting tables. The Census Bureau has released these 2010 Census demonstration data to enable data users to evaluate the expected impact of disclosure avoidance variability on 2020 Census data. The 2010 Census Production Settings Redistricting Data (P.L.94-171) Demonstration Noisy Measurement File (2023-04-03) has been cleared for public dissemination by the Census Bureau Disclosure Review Board (CBDRB-FY22-DSEP-004).

    The data includes zero-Concentrated Differentially Private (zCDP) (Bun, M. and Steinke, T [2016]) noisy measurements, implemented via the discrete Gaussian mechanism. These are estimated counts of individuals and housing units included in the 2010 Census Edited File (CEF), which includes confidential data initially collected in the 2010 Census of Population and Housing. The noisy measurements included in this file were subsequently post-processed by the TopDown Algorithm (TDA) to produce the 2010 Census Production Settings Privacy-Protected Microdata File - Redistricting (P.L. 94-171) and Demographic and Housing Characteristics File (2023-04-03) (https://www2.census.gov/programs-surveys/decennial/2020/program-management/data-product-planning/2010-demonstration-data-products/04-Demonstration_Data_Products_Suite/2023-04-03/). As these 2010 Census demonstration data are intended to support study of the design and expected impacts of the 2020 Disclosure Avoidance System, the 2010 CEF records were pre-processed before application of the zCDP framework. This pre-processing converted the 2010 CEF records into the input-file format, response codes, and tabulation categories used for the 2020 Census, which differ in substantive ways from the format, response codes, and tabulation categories originally used for the 2010 Census.

    The NMF provides estimates of counts of persons in the CEF by various characteristics and combinations of characteristics including their reported race and ethnicity, whether they were of voting age, whether they resided in a housing unit or one of 7 group quarters types, and their census block of residence after the addition of discrete Gaussian noise (with the scale parameter determined by the privacy-loss budget allocation for that particular query under zCDP). Noisy measurements of the counts of occupied and vacant housing units by census block are also included. Lastly, data on constraints—information into which no noise was infused by the Disclosure Avoidance System (DAS) and used by the TDA to post-process the noisy measurements into the 2010 Census Production Settings Privacy-Protected Microdata File - Redistricting (P.L. 94-171) and Demographic and Housing Characteristics File (2023-04-03) —are provided.

  16. TIGER/Line Shapefile, Current, State, Texas, 2020 Census Public Use...

    • catalog.data.gov
    Updated Aug 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Department of Commerce, U.S. Census Bureau, Geography Division (Point of Contact) (2025). TIGER/Line Shapefile, Current, State, Texas, 2020 Census Public Use Microdata Area (PUMA) [Dataset]. https://catalog.data.gov/dataset/tiger-line-shapefile-current-state-texas-2020-census-public-use-microdata-area-puma
    Explore at:
    Dataset updated
    Aug 8, 2025
    Dataset provided by
    United States Department of Commercehttp://commerce.gov/
    United States Census Bureauhttp://census.gov/
    Area covered
    Texas
    Description

    This resource is a member of a series. The TIGER/Line shapefiles and related database files (.dbf) are an extract of selected geographic and cartographic information from the U.S. Census Bureau's Master Address File / Topologically Integrated Geographic Encoding and Referencing (MAF/TIGER) System (MTS). The MTS represents a seamless national file with no overlaps or gaps between parts, however, each TIGER/Line shapefile is designed to stand alone as an independent data set, or they can be combined to cover the entire nation. Public Use Microdata Areas (PUMAs) are decennial census areas that permit the tabulation and dissemination of Public Use Microdata Sample (PUMS) data, American Community Survey (ACS) data, and data from other censuses and surveys. For the 2020 Census, the State Data Centers (SDCs) in each state, the District of Columbia, and the Commonwealth of Puerto Rico had the opportunity to delineate PUMAS within their state or statistically equivalent entity. All PUMAs must nest within states and have a minimum population threshold of 100,000 persons. 2020 PUMAs consist of census tracts and cover the entirety of the United States, Puerto Rico and Guam. American Samoa, the Commonwealth of the Northern Mariana Islands, and the U.S. Virgin Islands do not contain any 2020 PUMAs because the population is less than the minimum population requirement. Each PUMA is identified by a five-character numeric census code that may contain leading zeros and a descriptive name.

  17. d

    Alaskan Population Demographic Information from Decennial and American...

    • dataone.org
    • search.dataone.org
    • +1more
    Updated Apr 11, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    United States Census Bureau; Juliet Bachtel; John Randazzo; Erika Gavenus (2019). Alaskan Population Demographic Information from Decennial and American Community Survey Census Data, 1940-2016 [Dataset]. http://doi.org/10.5063/F10R9MPV
    Explore at:
    Dataset updated
    Apr 11, 2019
    Dataset provided by
    Knowledge Network for Biocomplexity
    Authors
    United States Census Bureau; Juliet Bachtel; John Randazzo; Erika Gavenus
    Time period covered
    Jan 1, 1940 - Dec 31, 2015
    Area covered
    Variables measured
    lat, lng, Year, city, ANVSA, Negro, Other, Place, White, Aleut., and 145 more
    Description

    These data comprise Census records relating to the Alaskan people's population demographics for the State of Alaskan Salmon and People (SASAP) Project. Decennial census data were originally extracted from IPUMS National Historic Geographic Information Systems website: https://data2.nhgis.org/main (Citation: Steven Manson, Jonathan Schroeder, David Van Riper, and Steven Ruggles. IPUMS National Historical Geographic Information System: Version 12.0 [Database]. Minneapolis: University of Minnesota. 2017. http://doi.org/10.18128/D050.V12.0). A number of relevant tables of basic demographics on age and race, household income and poverty levels, and labor force participation were extracted. These particular variables were selected as part of an effort to understand and potentially quantify various dimensions of well-being in Alaskan communities. The file "censusdata_master.csv" is a consolidation of all 21 other data files in the package. For detailed information on how the datasets vary over different years, view the file "readme.docx" available in this data package. The included .Rmd file is a script which combines the 21 files by year into a single file (censusdata_master.csv). It also cleans up place names (including typographical errors) and uses the USGS place names dataset and the SASAP regions dataset to assign latitude and longitude values and region values to each place in the dataset. Note that some places were not assigned a region or location because they do not fit well into the regional framework. Considerable heterogeneity exists between census surveys each year. While we have attempted to combine these datasets in a way that makes sense, there may be some discrepancies or unexpected values. The RMarkdown document SASAPWebsiteGraphicsCensus.Rmd is used to generate a variety of figures using these data, including the additional file Chignik_population.png. An additional set of 25 figures showing regional trends in population and income metrics are also included.

  18. O

    Census Block Groups

    • data.austintexas.gov
    • datahub.austintexas.gov
    • +3more
    Updated Feb 28, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    City of Austin, Texas - data.austintexas.gov (2024). Census Block Groups [Dataset]. https://data.austintexas.gov/Public-Safety/Census-Block-Groups/dwa9-qvcr
    Explore at:
    kmz, application/geo+json, tsv, csv, kml, application/rssxml, xml, application/rdfxmlAvailable download formats
    Dataset updated
    Feb 28, 2024
    Dataset authored and provided by
    City of Austin, Texas - data.austintexas.gov
    License

    U.S. Government Workshttps://www.usa.gov/government-works
    License information was derived automatically

    Description

    DATASET DESCRIPTION: Census Block Group polygons from the United States Census Bureau (2022) of the Austin area, including Hays, Travis and Williamson counties and a portion of western Bastrop County. Block groups are clusters of blocks within the same census tract that have the same first digit of their 4-character census block number (e.g., Blocks 3001, 3002, 3003 to 3999 in census tract 1210.02 belong to block group 3).

    AUSTIN POLICE DEPARTMENT DATA DISCLAIMER 1. The data provided is for informational use only and may differ from official Austin Police Department data.

    1. The Austin Police Department’s databases are continuously updated, and changes can be made due to a variety of investigative factors including but not limited to offense reclassification and dates.

    2. The Austin Police Department does not assume any liability for any decision made or action taken or not taken by the recipient in reliance upon any information or data provided.

    City of Austin Open Data Terms of Use - https://data.austintexas.gov/stories/s/ranj-cccq

  19. u

    Census 1991: Individual Sample of Anonymised Records for Northern Ireland...

    • beta.ukdataservice.ac.uk
    Updated 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Census Division Office For National Statistics; Cathie Marsh Centre For Census University Of Manchester (2023). Census 1991: Individual Sample of Anonymised Records for Northern Ireland (SARs) [Dataset]. http://doi.org/10.5255/ukda-sn-7212-1
    Explore at:
    Dataset updated
    2023
    Dataset provided by
    UK Data Servicehttps://ukdataservice.ac.uk/
    datacite
    Authors
    Census Division Office For National Statistics; Cathie Marsh Centre For Census University Of Manchester
    Area covered
    Northern Ireland, Ireland
    Description

    The UK censuses took place on 21st April 1991. They were run by the Census Office for Northern Ireland, General Register Office for Scotland, and the Office of Population and Surveys for both England and Wales. The UK comprises the countries of England, Wales, Scotland and Northern Ireland.

    Statistics from the UK censuses help paint a picture of the nation and how we live. They provide a detailed snapshot of the population and its characteristics, and underpin funding allocation to provide public services.

    The Northern Ireland Individual SAR is a 2% sample of individuals which was drawn from the full set of 1991 Census records. It was released to the then Census Microdata Unit (now the Centre for Census and Survey Research) in May 1994 who then undertook quality assurance work and produced documentation and additional derived variables.

    The dataset contains 31,967 person records and 53 variables. A number of protections are in place to ensure the anonymity of cases in the data including the low sampling fraction, grouping of some rare categories, limitation of geographical detail and record reordering so that the cases are not ordered geographically.

    Once the Household SAR (held under SN 7213) had been removed to avoid an overlap between the two files, remaining records were stratified into groups of 99 and two individuals were chosen from each group. Individuals in communal establishments were stratified geographically into groups of 50 people and one person was chosen at random from each group. Unlike Great Britain, 100% of Northern Ireland records were coded.

    For many variables, the codes that are used in the Northern Ireland SARs differ from those used in the GB SARs. Where this is the case, the Northern Ireland coding follows on numerically from the GB coding but it will not necessarily start at 1 and have a value for every succeeding integer. For example, 'household family type' uses quite different coding to the nearest GB equivalent 'family type': the GB codes run from 00 through to 08, whilst the Northern Ireland codes pick up from 09 and run through to 23.

    Further information, including guides and other documentation, may be found on the Cathie Marsh Centre for Survey Research Samples of Anonymised Records (SARS) website.

  20. TIGER/Line Shapefile, 2022, State, Minnesota, MN, 2020 Census Public Use...

    • catalog.data.gov
    • res1catalogd-o-tdatad-o-tgov.vcapture.xyz
    Updated Jan 27, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Department of Commerce, U.S. Census Bureau, Geography Division, Spatial Data Collection and Products Branch (Point of Contact) (2024). TIGER/Line Shapefile, 2022, State, Minnesota, MN, 2020 Census Public Use Microdata Area (PUMA) [Dataset]. https://catalog.data.gov/dataset/tiger-line-shapefile-2022-state-minnesota-mn-2020-census-public-use-microdata-area-puma
    Explore at:
    Dataset updated
    Jan 27, 2024
    Dataset provided by
    United States Census Bureauhttp://census.gov/
    Area covered
    Minnesota
    Description

    The TIGER/Line shapefiles and related database files (.dbf) are an extract of selected geographic and cartographic information from the U.S. Census Bureau's Master Address File/Topologically Integrated Geographic Encoding and Referencing (MAF/TIGER) Database (MTDB). The MTDB represents a seamless national file with no overlaps or gaps between parts, however, each TIGER/Line shapefile is designed to stand alone as an independent data set, or they can be combined to cover the entire nation. Public Use Microdata Areas (PUMAs) are decennial census areas that permit the tabulation and dissemination of Public Use Microdata Sample (PUMS) data, American Community Survey (ACS) data, and data from other census and surveys. For the 2020 Census, the State Data Centers (SDCs) in each state, the District of Columbia, and the Commonwealth of Puerto Rico had the opportunity to delineate PUMAS within their state or statistically equivalent entity. All PUMAs must nest within states and have a minimum population threshold of 100,000 persons. 2020 PUMAs consist of census tracts and cover the entirety of the United States, Puerto Rico and Guam. American Samoa, the Commonwealth of the Northern Mariana Islands, and the U.S. Virgin Islands do not contain any 2020 PUMAs because the population is less than the minimum population requirement. Each PUMA is identified by a 5-character numeric census code that may contain leading zeros and a descriptive name. The 2020 PUMAs will appear in the 2022 TIGER/Line Shapefiles.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Google BigQuery (2020). census-bureau-international [Dataset]. https://www.kaggle.com/datasets/bigquery/census-bureau-international
Organization logo

census-bureau-international

World population estimates 1950 through 2050

Explore at:
zip(0 bytes)Available download formats
Dataset updated
May 6, 2020
Dataset provided by
BigQueryhttps://cloud.google.com/bigquery
Authors
Google BigQuery
Description

Context

The United States Census Bureau’s international dataset provides estimates of country populations since 1950 and projections through 2050. Specifically, the dataset includes midyear population figures broken down by age and gender assignment at birth. Additionally, time-series data is provided for attributes including fertility rates, birth rates, death rates, and migration rates.

Querying BigQuery tables

You can use the BigQuery Python client library to query tables in this dataset in Kernels. Note that methods available in Kernels are limited to querying data. Tables are at bigquery-public-data.census_bureau_international.

Sample Query 1

What countries have the longest life expectancy? In this query, 2016 census information is retrieved by joining the mortality_life_expectancy and country_names_area tables for countries larger than 25,000 km2. Without the size constraint, Monaco is the top result with an average life expectancy of over 89 years!

standardSQL

SELECT age.country_name, age.life_expectancy, size.country_area FROM ( SELECT country_name, life_expectancy FROM bigquery-public-data.census_bureau_international.mortality_life_expectancy WHERE year = 2016) age INNER JOIN ( SELECT country_name, country_area FROM bigquery-public-data.census_bureau_international.country_names_area where country_area > 25000) size ON age.country_name = size.country_name ORDER BY 2 DESC /* Limit removed for Data Studio Visualization */ LIMIT 10

Sample Query 2

Which countries have the largest proportion of their population under 25? Over 40% of the world’s population is under 25 and greater than 50% of the world’s population is under 30! This query retrieves the countries with the largest proportion of young people by joining the age-specific population table with the midyear (total) population table.

standardSQL

SELECT age.country_name, SUM(age.population) AS under_25, pop.midyear_population AS total, ROUND((SUM(age.population) / pop.midyear_population) * 100,2) AS pct_under_25 FROM ( SELECT country_name, population, country_code FROM bigquery-public-data.census_bureau_international.midyear_population_agespecific WHERE year =2017 AND age < 25) age INNER JOIN ( SELECT midyear_population, country_code FROM bigquery-public-data.census_bureau_international.midyear_population WHERE year = 2017) pop ON age.country_code = pop.country_code GROUP BY 1, 3 ORDER BY 4 DESC /* Remove limit for visualization*/ LIMIT 10

Sample Query 3

The International Census dataset contains growth information in the form of birth rates, death rates, and migration rates. Net migration is the net number of migrants per 1,000 population, an important component of total population and one that often drives the work of the United Nations Refugee Agency. This query joins the growth rate table with the area table to retrieve 2017 data for countries greater than 500 km2.

SELECT growth.country_name, growth.net_migration, CAST(area.country_area AS INT64) AS country_area FROM ( SELECT country_name, net_migration, country_code FROM bigquery-public-data.census_bureau_international.birth_death_growth_rates WHERE year = 2017) growth INNER JOIN ( SELECT country_area, country_code FROM bigquery-public-data.census_bureau_international.country_names_area

Update frequency

Historic (none)

Dataset source

United States Census Bureau

Terms of use: This dataset is publicly available for anyone to use under the following terms provided by the Dataset Source - http://www.data.gov/privacy-policy#data_policy - and is provided "AS IS" without any warranty, express or implied, from Google. Google disclaims all liability for any damages, direct or indirect, resulting from the use of the dataset.

See the GCP Marketplace listing for more details and sample queries: https://console.cloud.google.com/marketplace/details/united-states-census-bureau/international-census-data

Search
Clear search
Close search
Google apps
Main menu