44 datasets found

k
Population Projection
datasource.kapsarc.org
Updated Mar 10, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). Population Projection [Dataset]. https://datasource.kapsarc.org/explore/dataset/population-projection/
Explore at:
Dataset updated
Mar 10, 2025
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Explore population projections for China on this dataset webpage. Get valuable insights into the future demographic trends of one of the world's most populous countries.

Population, China, projections ChinaFollow data.kapsarc.org for timely data to advance energy economics research..Total population is based on the de facto definition of population, which counts all residents regardless of legal status or citizenship. The values shown are midyear estimatesSource: (1) United Nations Population Division. World Population Prospects: 2019 Revision. (2) Census reports and other statistical publications from national statistical offices, (3) Eurostat: Demographic Statistics, (4) United Nations Statistical Division. Population and Vital Statistics Reprot (various years), (5) U.S. Census Bureau: International Database, and (6) Secretariat of the Pacific Community: Statistics and Demography Programme.
Retail Sales Dataset
kaggle.com
Updated Aug 22, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mohammad Talib (2023). Retail Sales Dataset [Dataset]. https://www.kaggle.com/datasets/mohammadtalib786/retail-sales-dataset/data
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 22, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Mohammad Talib
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Welcome to the Retail Sales and Customer Demographics Dataset! This synthetic dataset has been meticulously crafted to simulate a dynamic retail environment, providing an ideal playground for those eager to sharpen their data analysis skills through exploratory data analysis (EDA). With a focus on retail sales and customer characteristics, this dataset invites you to unravel intricate patterns, draw insights, and gain a deeper understanding of customer behavior.

****Dataset Overview:**

This dataset is a snapshot of a fictional retail landscape, capturing essential attributes that drive retail operations and customer interactions. It includes key details such as Transaction ID, Date, Customer ID, Gender, Age, Product Category, Quantity, Price per Unit, and Total Amount. These attributes enable a multifaceted exploration of sales trends, demographic influences, and purchasing behaviors.

Why Explore This Dataset?

Realistic Representation: Though synthetic, the dataset mirrors real-world retail scenarios, allowing you to practice analysis within a familiar context.

Diverse Insights: From demographic insights to product preferences, the dataset offers a broad spectrum of factors to investigate.

Hypothesis Generation: As you perform EDA, you'll have the chance to formulate hypotheses that can guide further analysis and experimentation.

Applied Learning: Uncover actionable insights that retailers could use to enhance their strategies and customer experiences.

Questions to Explore:

How does customer age and gender influence their purchasing behavior?

Are there discernible patterns in sales across different time periods?

Which product categories hold the highest appeal among customers?

What are the relationships between age, spending, and product preferences?

How do customers adapt their shopping habits during seasonal trends?

Are there distinct purchasing behaviors based on the number of items bought per transaction?

What insights can be gleaned from the distribution of product prices within each category?

Your EDA Journey:

Prepare to immerse yourself in a world of data-driven exploration. Through data visualization, statistical analysis, and correlation examination, you'll uncover the nuances that define retail operations and customer dynamics. EDA isn't just about numbers—it's about storytelling with data and extracting meaningful insights that can influence strategic decisions.

Embrace the Retail Sales and Customer Demographics Dataset as your canvas for discovery. As you traverse the landscape of this synthetic retail environment, you'll refine your analytical skills, pose intriguing questions, and contribute to the ever-evolving narrative of the retail industry. Happy exploring!
r
POPLINK - Demographic Data Base
researchdata.se
Updated Jun 24, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Umeå university (2025). POPLINK - Demographic Data Base [Dataset]. https://researchdata.se/en/catalogue/dataset/ext0086-1
Explore at:
Dataset updated
Jun 24, 2025
Dataset provided by
Umeå University
Authors
Umeå university
Time period covered
1700 - 1950
Description
The POPLINK database contains personal records entered from parish registers from large parts of the Västerbotten county and the inland of Norrland. The database covers parish registers from the end of the 17th century until 1950. The personal records stem from different types of parish registers such as catechetical registers, birth and baptism registers, banns and marriage registers, migrations registers, and death registers.

These datasets, ranging from birth to death, provides possibilities to link together individuals and generations. The life stories of individuals as well as family histories can be tracked. Moreover, the dataset offers the possibility to study demographical and socio-economic changes over time across different regions and parishes.

The purpose of the POPLINK database is to provide detailed individual records from church books between the late 1600s and around 1950 for research. It enables studies on demography, social structures, and historical development in Sweden. POPUM offers data on, among other things, population development, migration patterns, economic changes, industrialization, agricultural communities, small-scale farming, Sámi history, and urbanization.

The database is a resource in many areas of research. Not least because of the possibility to link together POPLINK with other modern registers. This means, among other things, that the life sciences can investigate how nature and nurture influence the development of our most common national diseases, such as cardiovascular diseases, cancer, and diabetes. Within the social sciences and the humanities, the increasing access to personal records brings about an opportunity for new perspectives on the rise of the welfare state and the time that shaped the modern Sweden.

POPLINK is a research database, which means that only academic researchers can be given access. In some cases it requires approval from The Swedish Ethical Review Authority.
f
Demographic monitoring of wild muriqui populations: Criteria for defining...
plos.figshare.com
datasetcatalog.nlm.nih.gov
+1more
pdf
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Karen B. Strier; Carla B. Possamai; Fernanda P. Tabacow; Alcides Pissinatti; Andre M. Lanna; Fabiano Rodrigues de Melo; Leandro Moreira; Maurício Talebi; Paula Breves; Sérgio L. Mendes; Leandro Jerusalinsky (2023). Demographic monitoring of wild muriqui populations: Criteria for defining priority areas and monitoring intensity [Dataset]. http://doi.org/10.1371/journal.pone.0188922
Explore at:
pdfAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0188922
Dataset updated
May 31, 2023
Dataset provided by
PLOS ONE
Authors
Karen B. Strier; Carla B. Possamai; Fernanda P. Tabacow; Alcides Pissinatti; Andre M. Lanna; Fabiano Rodrigues de Melo; Leandro Moreira; Maurício Talebi; Paula Breves; Sérgio L. Mendes; Leandro Jerusalinsky
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Demographic data are essential to assessments of the status of endangered species. However, establishing an integrated monitoring program to obtain useful data on contemporary and future population trends requires both the identification of priority areas and populations and realistic evaluations of the kinds of data that can be obtained under different monitoring regimes. We analyzed all known populations of a critically endangered primate, the muriqui (genus: Brachyteles) using population size, genetic uniqueness, geographic importance (including potential importance in corridor programs) and implementability scores to define monitoring priorities. Our analyses revealed nine priority populations for the northern muriqui (B. hypoxanthus) and nine for the southern muriqui (B. arachnoides). In addition, we employed knowledge of muriqui developmental and life history characteristics to define the minimum monitoring intensity needed to evaluate demographic trends along a continuum ranging from simple descriptive changes in population size to predictions of population changes derived from individual based life histories. Our study, stimulated by the Brazilian government’s National Action Plan for the Conservation of Muriquis, is fundamental to meeting the conservation goals for this genus, and also provides a model for defining priorities and methods for the implementation of integrated demographic monitoring programs for other endangered and critically endangered species of primates.
d
U.S. Voting by Census Block Groups
search.dataone.org
Updated Nov 9, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bryan, Michael (2023). U.S. Voting by Census Block Groups [Dataset]. http://doi.org/10.7910/DVN/NKNWBX
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/NKNWBX
Dataset updated
Nov 9, 2023
Dataset provided by
Harvard Dataverse
Authors
Bryan, Michael
Area covered
United States
Description
PROBLEM AND OPPORTUNITY In the United States, voting is largely a private matter. A registered voter is given a randomized ballot form or machine to prevent linkage between their voting choices and their identity. This disconnect supports confidence in the election process, but it provides obstacles to an election's analysis. A common solution is to field exit polls, interviewing voters immediately after leaving their polling location. This method is rife with bias, however, and functionally limited in direct demographics data collected. For the 2020 general election, though, most states published their election results for each voting location. These publications were additionally supported by the geographical areas assigned to each location, the voting precincts. As a result, geographic processing can now be applied to project precinct election results onto Census block groups. While precinct have few demographic traits directly, their geographies have characteristics that make them projectable onto U.S. Census geographies. Both state voting precincts and U.S. Census block groups: are exclusive, and do not overlap are adjacent, fully covering their corresponding state and potentially county have roughly the same size in area, population and voter presence Analytically, a projection of local demographics does not allow conclusions about voters themselves. However, the dataset does allow statements related to the geographies that yield voting behavior. One could say, for example, that an area dominated by a particular voting pattern would have mean traits of age, race, income or household structure. The dataset that results from this programming provides voting results allocated by Census block groups. The block group identifier can be joined to Census Decennial and American Community Survey demographic estimates. DATA SOURCES The state election results and geographies have been compiled by Voting and Election Science team on Harvard's dataverse. State voting precincts lie within state and county boundaries. The Census Bureau, on the other hand, publishes its estimates across a variety of geographic definitions including a hierarchy of states, counties, census tracts and block groups. Their definitions can be found here. The geometric shapefiles for each block group are available here. The lowest level of this geography changes often and can obsolesce before the next census survey (Decennial or American Community Survey programs). The second to lowest census level, block groups, have the benefit of both granularity and stability however. The 2020 Decennial survey details US demographics into 217,740 block groups with between a few hundred and a few thousand people. Dataset Structure The dataset's columns include: Column Definition BLOCKGROUP_GEOID 12 digit primary key. Census GEOID of the block group row. This code concatenates: 2 digit state 3 digit county within state 6 digit Census Tract identifier 1 digit Census Block Group identifier within tract STATE State abbreviation, redundent with 2 digit state FIPS code above REP Votes for Republican party candidate for president DEM Votes for Democratic party candidate for president LIB Votes for Libertarian party candidate for president OTH Votes for presidential candidates other than Republican, Democratic or Libertarian AREA square kilometers of area associated with this block group GAP total area of the block group, net of area attributed to voting precincts PRECINCTS Number of voting precincts that intersect this block group ASSUMPTIONS, NOTES AND CONCERNS: Votes are attributed based upon the proportion of the precinct's area that intersects the corresponding block group. Alternative methods are left to the analyst's initiative. 50 states and the District of Columbia are in scope as those U.S. possessions voting in the general election for the U.S. Presidency. Three states did not report their results at the precinct level: South Dakota, Kentucky and West Virginia. A dummy block group is added for each of these states to maintain national totals. These states represent 2.1% of all votes cast. Counties are commonly coded using FIPS codes. However, each election result file may have the county field named differently. Also, three states do not share county definitions - Delaware, Massachusetts, Alaska and the District of Columbia. Block groups may be used to capture geographies that do not have population like bodies of water. As a result, block groups without intersection voting precincts are not uncommon. In the U.S., elections are administered at a state level with the Federal Elections Commission compiling state totals against the Electoral College weights. The states have liberty, though, to define and change their own voting precincts https://en.wikipedia.org/wiki/Electoral_precinct. The Census Bureau... Visit https://dataone.org/datasets/sha256%3A05707c1dc04a814129f751937a6ea56b08413546b18b351a85bc96da16a7f8b5 for complete metadata about this dataset.
D
[Archived] COVID-19 Deaths by Population Characteristics Over Time
data.sfgov.org
healthdata.gov
+1more
csv, xlsx, xml
Updated Jun 27, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). [Archived] COVID-19 Deaths by Population Characteristics Over Time [Dataset]. https://data.sfgov.org/Health-and-Social-Services/-Archived-COVID-19-Deaths-by-Population-Characteri/kkr3-wq7h
Explore at:
xlsx, xml, csvAvailable download formats
Dataset updated
Jun 27, 2024
License
ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
Description
As of July 2nd, 2024 the COVID-19 Deaths by Population Characteristics Over Time dataset has been retired. This dataset is archived and will no longer update. We will be publishing a cumulative deaths by population characteristics dataset that will update moving forward.

A. SUMMARY This dataset shows San Francisco COVID-19 deaths by population characteristics and by date. This data may not be immediately available for recently reported deaths. Data updates as more information becomes available. Because of this, death totals for previous days may increase or decrease. More recent data is less reliable.

Population characteristics are subgroups, or demographic cross-sections, like age, race, or gender. The City tracks how deaths have been distributed among different subgroups. This information can reveal trends and disparities among groups.

B. HOW THE DATASET IS CREATED As of January 1, 2023, COVID-19 deaths are defined as persons who had COVID-19 listed as a cause of death or a significant condition contributing to their death on their death certificate. This definition is in alignment with the California Department of Public Health and the national https://preparedness.cste.org/wp-content/uploads/2022/12/CSTE-Revised-Classification-of-COVID-19-associated-Deaths.Final_.11.22.22.pdf">Council of State and Territorial Epidemiologists. Death certificates are maintained by the California Department of Public Health.

Data on the population characteristics of COVID-19 deaths are from: *Case reports *Medical records *Electronic lab reports *Death certificates

Data are continually updated to maximize completeness of information and reporting on San Francisco COVID-19 deaths.

To protect resident privacy, we summarize COVID-19 data by only one characteristic at a time. Data are not shown until cumulative citywide deaths reach five or more.

Data notes on each population characteristic type is listed below.

Race/ethnicity * We include all race/ethnicity categories that are collected for COVID-19 cases.

Gender * The City collects information on gender identity using these guidelines.

C. UPDATE PROCESS Updates automatically at 06:30 and 07:30 AM Pacific Time on Wednesday each week.

Dataset will not update on the business day following any federal holiday.

D. HOW TO USE THIS DATASET Population estimates are only available for age groups and race/ethnicity categories. San Francisco population estimates for race/ethnicity and age groups can be found in a view based on the San Francisco Population and Demographic Census dataset. These population estimates are from the 2016-2020 5-year American Community Survey (ACS).

This dataset includes many different types of characteristics. Filter the “Characteristic Type” column to explore a topic area. Then, the “Characteristic Group” column shows each group or category within that topic area and the number of deaths on each date.

New deaths are the count of deaths within that characteristic group on that specific date. Cumulative deaths are the running total of all San Francisco COVID-19 deaths in that characteristic group up to the date listed.

This data may not be immediately available for more recent deaths. Data updates as more information becomes available.

To explore data on the total number of deaths, use the COVID-19 Deaths Over Time dataset.

E. CHANGE LOG
9/11/2023 - on this date, we began using an updated definition of a COVID-19 death to align with the California Department of Public Health. This change was applied to COVID-19 deaths retrospectively beginning on 1/1/2023. More information about the recommendation by the Council of State and Territorial Epidemiologists that motivated this change can be found https://preparedness.cste.org/wp-content/uploads/2022/12/CSTE-Revised-Classification-of-COVID-19-associated-Deaths.Final_.11.22.22.pdf">here.
6/6/2023 - data on deaths by transmission type have been removed. See section ARCHIVED DATA for more detail.
5/16/2023 - data on deaths by sexual orientation, comorbidities, homelessness, and single room occupancy have been removed. See section ARCHIVED DATA for more detail.
4/6/2023 - the State implemented system updates to improve the integrity of historical data.
1/31/2023 - column “population_estimate” added.
3/23/2022 - ‘Native American’ changed to ‘American Indian or Alaska Native’ to align with the census.
1/22/2022 - system updates to improve timeliness and accuracy of cases and deaths data were implemented.
COVID-19 Deaths by Population Characteristics
healthdata.gov
data.sfgov.org
+3more
application/rdfxml +5
Updated Apr 8, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
data.sfgov.org (2025). COVID-19 Deaths by Population Characteristics [Dataset]. https://healthdata.gov/dataset/COVID-19-Deaths-by-Population-Characteristics/wzjr-br3v
Explore at:
application/rdfxml, tsv, application/rssxml, csv, xml, jsonAvailable download formats
Dataset updated
Apr 8, 2025
Dataset provided by
data.sfgov.org
Description
A. SUMMARY This dataset shows San Francisco COVID-19 deaths by population characteristics. This data may not be immediately available for recently reported deaths. Data updates as more information becomes available. Because of this, death totals may increase or decrease.

Population characteristics are subgroups, or demographic cross-sections, like age, race, or gender. The City tracks how deaths have been distributed among different subgroups. This information can reveal trends and disparities among groups.

B. HOW THE DATASET IS CREATED As of January 1, 2023, COVID-19 deaths are defined as persons who had COVID-19 listed as a cause of death or a significant condition contributing to their death on their death certificate. This definition is in alignment with the California Department of Public Health and the national https://preparedness.cste.org/wp-content/uploads/2022/12/CSTE-Revised-Classification-of-COVID-19-associated-Deaths.Final_.11.22.22.pdf">Council of State and Territorial Epidemiologists. Death certificates are maintained by the California Department of Public Health.

Data on the population characteristics of COVID-19 deaths are from: *Case reports *Medical records *Electronic lab reports *Death certificates

Data are continually updated to maximize completeness of information and reporting on San Francisco COVID-19 deaths.

To protect resident privacy, we summarize COVID-19 data by only one population characteristic at a time. Data are not shown until cumulative citywide deaths reach five or more.

Data notes on select population characteristic types are listed below.

Race/ethnicity * We include all race/ethnicity categories that are collected for COVID-19 cases.

Gender * The City collects information on gender identity using these guidelines.

C. UPDATE PROCESS Updates automatically at 06:30 and 07:30 AM Pacific Time on Wednesday each week.

Dataset will not update on the business day following any federal holiday.

D. HOW TO USE THIS DATASET Population estimates are only available for age groups and race/ethnicity categories. San Francisco population estimates for race/ethnicity and age groups can be found in a dataset based on the San Francisco Population and Demographic Census dataset.These population estimates are from the 2018-2022 5-year American Community Survey (ACS).

This dataset includes several characteristic types. Filter the “Characteristic Type” column to explore a topic area. Then, the “Characteristic Group” column shows each group or category within that topic area and the number of cumulative deaths.

Cumulative deaths are the running total of all San Francisco COVID-19 deaths in that characteristic group up to the date listed.

To explore data on the total number of deaths, use the COVID-19 Deaths Over Time dataset.

E. CHANGE LOG
2020 Census Tracts
catalog.data.gov
data.oregon.gov
+3more
Updated Jan 31, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Department of Commerce, U.S. Census Bureau, Geography Division, Spatial Data Collection and Products Branch (2025). 2020 Census Tracts [Dataset]. https://catalog.data.gov/dataset/census-tracts
Explore at:
Dataset updated
Jan 31, 2025
Dataset provided by
United States Census Bureauhttp://census.gov/
Description
This data layer is an element of the Oregon GIS Framework. The TIGER/Line shapefiles and related database files (.dbf) are an extract of selected geographic and cartographic information from the U.S. Census Bureau's Master Address File / Topologically Integrated Geographic Encoding and Referencing (MAF/TIGER) Database (MTDB). The MTDB represents a seamless national file with no overlaps or gaps between parts, however, each TIGER/Line shapefile is designed to stand alone as an independent data set, or they can be combined to cover the entire nation. Census tracts are small, relatively permanent statistical subdivisions of a county or equivalent entity, and were defined by local participants as part of the 2020 Census Participant Statistical Areas Program. The Census Bureau delineated the census tracts in situations where no local participant existed or where all the potential participants declined to participate. The primary purpose of census tracts is to provide a stable set of geographic units for the presentation of census data and comparison back to previous decennial censuses. Census tracts generally have a population size between 1,200 and 8,000 people, with an optimum size of 4,000 people. When first delineated, census tracts were designed to be homogeneous with respect to population characteristics, economic status, and living conditions. The spatial size of census tracts varies widely depending on the density of settlement. Physical changes in street patterns caused by highway construction, new development, and so forth, may require boundary revisions. In addition, census tracts occasionally are split due to population growth, or combined as a result of substantial population decline. Census tract boundaries generally follow visible and identifiable features. They may follow legal boundaries such as minor civil division (MCD) or incorporated place boundaries in some States and situations to allow for census tract-to-governmental unit relationships where the governmental boundaries tend to remain unchanged between censuses. State and county boundaries always are census tract boundaries in the standard census geographic hierarchy. In a few rare instances, a census tract may consist of noncontiguous areas. These noncontiguous areas may occur where the census tracts are coextensive with all or parts of legal entities that are themselves noncontiguous. For the 2010 Census and beyond, the census tract code range of 9400 through 9499 was enforced for census tracts that include a majority American Indian population according to Census 2000 data and/or their area was primarily covered by federally recognized American Indian reservations and/or off-reservation trust lands; the code range 9800 through 9899 was enforced for those census tracts that contained little or no population and represented a relatively large special land use area such as a National Park, military installation, or a business/industrial park; and the code range 9900 through 9998 was enforced for those census tracts that contained only water area, no land area.
WSDOT - Population Centers
geo.wa.gov
data-wutc.opendata.arcgis.com
+2more
Updated Nov 30, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
WSDOT Online Map Center (2021). WSDOT - Population Centers [Dataset]. https://geo.wa.gov/datasets/WSDOT::wsdot-population-centers
Explore at:
Dataset updated
Nov 30, 2021
Dataset provided by
Washington State Department of Transportationhttps://wsdot.wa.gov/
Authors
WSDOT Online Map Center
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Area covered

Description
Per RCW 47.04.010, "population center" includes incorporated cities and towns, including their urban growth areas, and census-designated places in Washington State. The WSDOT Population Center dataset combines the WSDOT Incorporated City Limits dataset (May 2021) with the Office of Financial Management’s Census Designated Places (2020 Census) Dataset. Identification of Population Centers enables WSDOT to address the Complete Streets requirement under RCW 47.04.035 and to otherwise identify locations prioritized in the 2021 WSDOT Active Transportation Plan (ATP). WSDOT may also recognize other developed areas as exhibiting land use patterns consistent with the definition of population center, that are not currently captured by this data layer.This data layer assists WSDOT in prioritizing active transportation improvements in areas where people congregate and access destinations, and where travel distances between destinations align with typical distances travelled by users of pedestrian and bicycle modes. These areas are a priority because they serve the broadest range of users and potential users of the transportation system, including the very young, very old, and people with disabilities. In this dataset, each Population Center includes information for the “Place Name”, the “Place Type” (city/town, Urban Growth Area outside of city limits, or Census Designated Place), and whether or not the Population Center intersects a State Route (“yes” indicates that there is an intersection with a State Route, “no” indicates that there is no intersection.). The dataset will be updated as needed. Please direct questions about the Population Centers dataset to: Grace.Young@wsdot.wa.gov.
Population Estimates: Estimates by Age Group, Sex, Race, and Hispanic Origin...
catalog.data.gov
s.cnmilf.com
Updated Jul 19, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Census Bureau (2023). Population Estimates: Estimates by Age Group, Sex, Race, and Hispanic Origin [Dataset]. https://catalog.data.gov/dataset/population-estimates-estimates-by-age-group-sex-race-and-hispanic-origin
Explore at:
Dataset updated
Jul 19, 2023
Dataset provided by
United States Census Bureauhttp://census.gov/
Description
Annual Resident Population Estimates by Age Group, Sex, Race, and Hispanic Origin; for the United States, States, Counties; and for Puerto Rico and its Municipios: April 1, 2010 to July 1, 2019 // Source: U.S. Census Bureau, Population Division // The contents of this file are released on a rolling basis from December through June. // Note: 'In combination' means in combination with one or more other races. The sum of the five race-in-combination groups adds to more than the total population because individuals may report more than one race. Hispanic origin is considered an ethnicity, not a race. Hispanics may be of any race. Responses of 'Some Other Race' from the 2010 Census are modified. This results in differences between the population for specific race categories shown for the 2010 Census population in this file versus those in the original 2010 Census data. The estimates are based on the 2010 Census and reflect changes to the April 1, 2010 population due to the Count Question Resolution program and geographic program revisions. // Current data on births, deaths, and migration are used to calculate population change since the 2010 Census. An annual time series of estimates is produced, beginning with the census and extending to the vintage year. The vintage year (e.g., Vintage 2019) refers to the final year of the time series. The reference date for all estimates is July 1, unless otherwise specified. With each new issue of estimates, the entire estimates series is revised. Additional information, including historical and intercensal estimates, evaluation estimates, demographic analysis, research papers, and methodology is available on website: https://www.census.gov/programs-surveys/popest.html.
d
Census_sum_15
catalog.data.gov
datasets.ai
+2more
Updated Jul 6, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Geological Survey (2024). Census_sum_15 [Dataset]. https://catalog.data.gov/dataset/census-sum-15
Explore at:
Dataset updated
Jul 6, 2024
Dataset provided by
United States Geological Surveyhttp://www.usgs.gov/
Description
The GIS layer "Census_sum_15" provides a standardized tool for examining spatial patterns in abundance and demographic trends of the southern sea otter (Enhydra lutris nereis), based on data collected during the spring 2015 range-wide census. The USGS range-wide sea otter census has been undertaken twice a year since 1982, once in May and once in October, using consistent methodology involving both ground-based and aerial-based counts. The spring census is considered more accurate than the fall count, and provides the primary basis for gauging population trends by State and Federal management agencies. This Shape file includes a series of summary statistics derived from the raw census data, including sea otter density (otters per square km of habitat), linear density (otters per km of coastline), relative pup abundance (ratio of pups to independent animals) and 5-year population trend (calculated as exponential rate of change). All statistics are calculated and plotted for small sections of habitat in order to illustrate local variation in these statistics across the entire mainland distribution of sea otters in California (as of 2015). Sea otter habitat is considered to extend offshore from the mean low tide line and out to the 60m isobath: this depth range includes over 99% of sea otter feeding dives, based on dive-depth data from radio tagged sea otters (Tinker et al 2006, 2007). Sea otter distribution in California (the mainland range) is considered to comprise this band of potential habitat stretching along the coast of California, and bounded to the north and south by range limits defined as "the points farthest from the range center at which 5 or more otters are counted within a 10km contiguous stretch of coastline (as measured along the 10m bathymetric contour) during the two most recent spring censuses, or at which these same criteria were met in the previous year". The polygon corresponding to the range definition was then sub-divided into onshore/offshore strips roughly 500 meters in width. The boundaries between these strips correspond to ATOS (As-The-Otter-Swims) points, which are arbitrary locations established approximately every 500 meters along a smoothed 5 fathom bathymetric contour (line) offshore of the State of California.
f
Prevalence and patterns of multi-morbidity among 30-69 years old population...
figshare.com
xls
Updated Sep 29, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rohini; Panniyammakal Jeemon (2020). Prevalence and patterns of multi-morbidity among 30-69 years old population of rural Pathanamthitta, a district of Kerala, India: A cross-sectional study [Dataset]. http://doi.org/10.6084/m9.figshare.12494681.v4
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.12494681.v4
Dataset updated
Sep 29, 2020
Dataset provided by
figshare
Authors
Rohini; Panniyammakal Jeemon
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Kerala
Description
Data set of a community based cross-sectional survey done to find the prevalence , its correlates and patterns in a population of a district in southern Kerala, IndiaBackground: Multi-morbidity is the coexistence of multiple chronic conditions in the same individual. With advancing epidemiological and demographic transitions, the burden of multi-morbidity is expected to increase India. The state of Kerala in India is also in an advanced phase of epidemiological transition. However, very limited data on prevalence of multi-morbidity are available in the Kerala population.

Methods: A cross sectional survey was conducted among 410 participants in the age group of 30-69 years. A multi-stage cluster sampling method was employed to identify the study participants. Every eligible participant in the household were interviewed to assess the household prevalence. A structured interview schedule was used to assess socio-demographic variables, behavioral risk factors and prevailing clinical conditions, PHQ-9 questionnaire for screening of depression and active measurement of blood sugar and blood pressure. Co-existence of two or more conditions out of 11 was used as multi-morbidity case definition. Bivariate analyses were done to understand the association between socio-demographic factors and multi-morbidity. Logistic regression analyses were performed to estimate the effect size of these variables on multi-morbidity.

Results: Overall, the prevalence of multi-morbidity was 45.4% (95% CI: 40.5-50.3%). Nearly a quarter of study participants (25.4%) reported only one chronic condition (21.3-29.9%). Further, 30.7% (26.3-35.5), 10.7% (7.9-14.2), 3.7% (2.1-6.0) and 0.2% reported two, three, four and five chronic conditions, respectively. Nearly seven out of ten households (72%, 95%CI: 65-78%) had at least one person in the household with multi-morbidity and one in five households (22%, 95%CI: 16.7-28.9%) had more than one person with multi-morbidity. With every year increase in age, the propensity for multi-morbidity increased by 10 percent (OR=1.1; 95% CI: 1.1-1.2). Males and participants with low levels of education were less likely to suffer from multi-morbidity while unemployed and who do recommended level of physical activity were significantly more likely to suffer from multi-morbidity. Diabetes and hypertension was the most frequent dyad.

Conclusion: One of two participants in the productive age group of 30-69 years report multi-morbidity. Further, seven of ten households have at least one person with multi-morbidity. Preventive and management guidelines for chronic non-communicable conditions should focus on multi-morbidity especially in the older age group. Health-care systems that function within the limits of vertical disease management and episodic care (e.g., maternal health, tuberculosis, malaria, cardiovascular disease, mental health etc.) require optimal re-organization and horizontal integration of care across disease domains in managing people with multiple chronic conditions.

Key words: Multi-morbidity, cross-sectional, household, active measurement, rural, India, pattern
c
Consumer Behavior and Shopping Habits Dataset:
cubig.ai
Updated May 28, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CUBIG (2025). Consumer Behavior and Shopping Habits Dataset: [Dataset]. https://cubig.ai/store/products/352/consumer-behavior-and-shopping-habits-dataset
Explore at:
Dataset updated
May 28, 2025
Dataset authored and provided by
CUBIG
License
https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service
Measurement technique
Synthetic data generation using AI techniques for model training, Privacy-preserving data transformation via differential privacy
Description
1) Data Introduction • The Consumer Behavior and Shopping Habits Dataset is a tabular collection of customer demographics, purchase history, product preferences, shopping frequency, and online and offline purchasing behavior.

2) Data Utilization (1) Consumer Behavior and Shopping Habits Dataset has characteristics that: • Each row contains detailed consumer and transaction information such as customer ID, age, gender, purchased goods and categories, purchase amount, region, product attributes (size, color, season), review rating, subscription status, delivery method, discount/promotion usage, payment method, purchase frequency, etc. • Data is organized to cover a variety of variables and purchasing patterns to help segment customers, establish marketing strategies, analyze product preferences, and more. (2) Consumer Behavior and Shopping Habits Dataset can be used to: • Customer Segmentation and Target Marketing: You can analyze demographics and purchasing patterns to define different customer groups and use them to develop customized marketing strategies. • Product and service improvement: Based on purchase history, review ratings, discount/promotional responses, etc., it can be applied to product and service improvements such as identifying popular products, managing inventory, and analyzing promotion effects.
e
Dataset Direct Download Service (WFS): Legislative constituencies of the...
data.europa.eu
unknown
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dataset Direct Download Service (WFS): Legislative constituencies of the department of Aveyron [Dataset]. https://data.europa.eu/data/datasets/fr-120066022-srv-0373cd0c-8e54-4d5a-acc7-8f6741c2c93f
Explore at:
unknownAvailable download formats
Description
Legislative constituencies of the Department of Aveyron. Legislative constituencies are the territorial framework for the election of each deputy to the National Assembly, elected by direct universal suffrage for a renewable term of five years (unless the legislature is interrupted by dissolution, Article 24 of the Constitution).

The legislative constituency is designated by a serial number within each department.

The establishment of the deputies’ electoral districts, the method of election of the deputies and the first division into legislative constituencies date from the advent of the Fifth Republic in 1958.

Note

According to Article 2 of Law No 86-825 of 11 July 1986: “The electoral boundaries shall be revised in accordance with demographic trends after the second general census of the population following the last delimitation.” (INSEE definition).
Demographic and Health Survey 2017 - Indonesia
microdata.worldbank.org
catalog.ihsn.org
+1more
Updated Jul 12, 2019
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ministry of Health (Kemenkes) (2019). Demographic and Health Survey 2017 - Indonesia [Dataset]. https://microdata.worldbank.org/index.php/catalog/3477
Explore at:
Dataset updated
Jul 12, 2019
Dataset provided by
Statistics Indonesiahttp://www.bps.go.id/
National Population and Family Planning Board (BKKBN)
Ministry of Health (Kemenkes)
Time period covered
2017
Area covered
Indonesia
Description
Abstract

The primary objective of the 2017 Indonesia Dmographic and Health Survey (IDHS) is to provide up-to-date estimates of basic demographic and health indicators. The IDHS provides a comprehensive overview of population and maternal and child health issues in Indonesia. More specifically, the IDHS was designed to: - provide data on fertility, family planning, maternal and child health, and awareness of HIV/AIDS and sexually transmitted infections (STIs) to help program managers, policy makers, and researchers to evaluate and improve existing programs; - measure trends in fertility and contraceptive prevalence rates, and analyze factors that affect such changes, such as residence, education, breastfeeding practices, and knowledge, use, and availability of contraceptive methods; - evaluate the achievement of goals previously set by national health programs, with special focus on maternal and child health; - assess married men’s knowledge of utilization of health services for their family’s health and participation in the health care of their families; - participate in creating an international database to allow cross-country comparisons in the areas of fertility, family planning, and health.

Geographic coverage

National coverage

Analysis unit

Household

Individual

Children age 0-5

Woman age 15-49

Man age 15-54

Universe

The survey covered all de jure household members (usual residents), all women age 15-49 years resident in the household, and all men age 15-54 years resident in the household.

Kind of data

Sample survey data [ssd]

Sampling procedure

The 2017 IDHS sample covered 1,970 census blocks in urban and rural areas and was expected to obtain responses from 49,250 households. The sampled households were expected to identify about 59,100 women age 15-49 and 24,625 never-married men age 15-24 eligible for individual interview. Eight households were selected in each selected census block to yield 14,193 married men age 15-54 to be interviewed with the Married Man's Questionnaire. The sample frame of the 2017 IDHS is the Master Sample of Census Blocks from the 2010 Population Census. The frame for the household sample selection is the updated list of ordinary households in the selected census blocks. This list does not include institutional households, such as orphanages, police/military barracks, and prisons, or special households (boarding houses with a minimum of 10 people).

The sampling design of the 2017 IDHS used two-stage stratified sampling: Stage 1: Several census blocks were selected with systematic sampling proportional to size, where size is the number of households listed in the 2010 Population Census. In the implicit stratification, the census blocks were stratified by urban and rural areas and ordered by wealth index category.

Stage 2: In each selected census block, 25 ordinary households were selected with systematic sampling from the updated household listing. Eight households were selected systematically to obtain a sample of married men.

For further details on sample design, see Appendix B of the final report.

Mode of data collection

Face-to-face [f2f]

Research instrument

The 2017 IDHS used four questionnaires: the Household Questionnaire, Woman’s Questionnaire, Married Man’s Questionnaire, and Never Married Man’s Questionnaire. Because of the change in survey coverage from ever-married women age 15-49 in the 2007 IDHS to all women age 15-49, the Woman’s Questionnaire had questions added for never married women age 15-24. These questions were part of the 2007 Indonesia Young Adult Reproductive Survey Questionnaire. The Household Questionnaire and the Woman’s Questionnaire are largely based on standard DHS phase 7 questionnaires (2015 version). The model questionnaires were adapted for use in Indonesia. Not all questions in the DHS model were included in the IDHS. Response categories were modified to reflect the local situation.

Cleaning operations

All completed questionnaires, along with the control forms, were returned to the BPS central office in Jakarta for data processing. The questionnaires were logged and edited, and all open-ended questions were coded. Responses were entered in the computer twice for verification, and they were corrected for computer-identified errors. Data processing activities were carried out by a team of 34 editors, 112 data entry operators, 33 compare officers, 19 secondary data editors, and 2 data entry supervisors. The questionnaires were entered twice and the entries were compared to detect and correct keying errors. A computer package program called Census and Survey Processing System (CSPro), which was specifically designed to process DHS-type survey data, was used in the processing of the 2017 IDHS.

Response rate

Of the 49,261 eligible households, 48,216 households were found by the interviewer teams. Among these households, 47,963 households were successfully interviewed, a response rate of almost 100%.

In the interviewed households, 50,730 women were identified as eligible for individual interview and, from these, completed interviews were conducted with 49,627 women, yielding a response rate of 98%. From the selected household sample of married men, 10,440 married men were identified as eligible for interview, of which 10,009 were successfully interviewed, yielding a response rate of 96%. The lower response rate for men was due to the more frequent and longer absence of men from the household. In general, response rates in rural areas were higher than those in urban areas.

Sampling error estimates

The estimates from a sample survey are affected by two types of errors: (1) nonsampling errors and (2) sampling errors. Nonsampling errors result from mistakes made in implementing data collection and data processing, such as failure to locate and interview the correct household, misunderstanding the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the 2017 Indonesia Demographic and Health Survey (2017 IDHS) to minimize this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.

Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the 2017 IDHS is only one of many samples that could have been selected from the same population, using the same design and identical size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling error is a measure of the variability among all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.

A sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95 percent of all possible samples of identical size and design.

If the sample of respondents had been selected as a simple random sample, it would have been possible to use straightforward formulas for calculating sampling errors. However, the 2017 IDHS sample is the result of a multi-stage stratified design, and, consequently, it was necessary to use more complex formulas. The computer software used to calculate sampling errors for the 2017 IDHS is a STATA program. This program used the Taylor linearization method for variance estimation for survey estimates that are means or proportions. The Jackknife repeated replication method is used for variance estimation of more complex statistics such as fertility and mortality rates.

A more detailed description of estimates of sampling errors are presented in Appendix C of the survey final report.

Data appraisal

Data Quality Tables - Household age distribution - Age distribution of eligible and interviewed women - Age distribution of eligible and interviewed men - Completeness of reporting - Births by calendar year - Reporting of age at death in days - Reporting of age at death in months

See details of the data quality tables in Appendix D of the survey final report.
Trends in COVID-19 Cases and Deaths in the United States, by County-level...
data.virginia.gov
healthdata.gov
+1more
csv, json, rdf, xsl
Updated Jan 13, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Centers for Disease Control and Prevention (2025). Trends in COVID-19 Cases and Deaths in the United States, by County-level Population Factors - ARCHIVED [Dataset]. https://data.virginia.gov/dataset/trends-in-covid-19-cases-and-deaths-in-the-united-states-by-county-level-population-factors-arc
Explore at:
csv, json, xsl, rdfAvailable download formats
Dataset updated
Jan 13, 2025
Dataset provided by
Centers for Disease Control and Preventionhttp://www.cdc.gov/
Area covered
United States
Description
Reporting of Aggregate Case and Death Count data was discontinued on May 11, 2023, with the expiration of the COVID-19 public health emergency declaration. Although these data will continue to be publicly available, this dataset will no longer be updated.

The surveillance case definition for COVID-19, a nationally notifiable disease, was first described in a position statement from the Council for State and Territorial Epidemiologists, which was later revised. However, there is some variation in how jurisdictions implemented these case definitions. More information on how CDC collects COVID-19 case surveillance data can be found at FAQ: COVID-19 Data and Surveillance.

Aggregate Data Collection Process Since the beginning of the COVID-19 pandemic, data were reported from state and local health departments through a robust process with the following steps:
Aggregate county-level counts were obtained indirectly, via automated overnight web collection, or directly, via a data submission process.
If more than one official county data source existed, CDC used a comprehensive data selection process comparing each official county data source to retrieve the highest case and death counts, unless otherwise specified by the state.
A CDC data team reviewed counts for congruency prior to integration and set up alerts to monitor for discrepancies in the data.
CDC routinely compiled these data and post the finalized information on COVID Data Tracker.
County level data were aggregated to obtain state- and territory- specific totals.
Counting of cases and deaths is based on date of report and not on the date of symptom onset. CDC calculates rates in these data by using population estimates provided by the US Census Bureau Population Estimates Program (2019 Vintage).
COVID-19 aggregate case and death data are organized in a time series that includes cumulative number of cases and deaths as reported by a jurisdiction on a given date. New case and death counts are calculated as the week-to-week change in cumulative counts of cases and deaths reported (i.e., newly reported cases and deaths = cumulative number of cases/deaths reported this week minus the cumulative total reported the prior week.

This process was collaborative, with CDC and jurisdictions working together to ensure the accuracy of COVID-19 case and death numbers. County counts provided the most up-to-date numbers on cases and deaths by report date. Throughout data collection, CDC retrospectively updated counts to correct known data quality issues.

Description This archived public use dataset focuses on the cumulative and weekly case and death rates per 100,000 persons within various sociodemographic factors across all states and their counties. All resulting data are expressed as rates calculated as the number of cases or deaths per 100,000 persons in counties meeting various classification criteria using the US Census Bureau Population Estimates Program (2019 Vintage).

Each county within jurisdictions is classified into multiple categories for each factor. All rates in this dataset are based on classification of counties by the characteristics of their population, not individual-level factors. This applies to each of the available factors observed in this dataset. Specific factors and their corresponding categories are detailed below.

Population-level factors Each unique population factor is detailed below. Please note that the “Classification” column describes each of the 12 factors in the dataset, including a data dict
H
2020 General Election Voting by US Census Block Group
dataverse.harvard.edu
Updated Mar 10, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Michael Bryan (2025). 2020 General Election Voting by US Census Block Group [Dataset]. http://doi.org/10.7910/DVN/NKNWBX
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/NKNWBX
Dataset updated
Mar 10, 2025
Dataset provided by
Harvard Dataverse
Authors
Michael Bryan
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
PROBLEM AND OPPORTUNITY In the United States, voting is largely a private matter. A registered voter is given a randomized ballot form or machine to prevent linkage between their voting choices and their identity. This disconnect supports confidence in the election process, but it provides obstacles to an election's analysis. A common solution is to field exit polls, interviewing voters immediately after leaving their polling location. This method is rife with bias, however, and functionally limited in direct demographics data collected. For the 2020 general election, though, most states published their election results for each voting location. These publications were additionally supported by the geographical areas assigned to each location, the voting precincts. As a result, geographic processing can now be applied to project precinct election results onto Census block groups. While precinct have few demographic traits directly, their geographies have characteristics that make them projectable onto U.S. Census geographies. Both state voting precincts and U.S. Census block groups: are exclusive, and do not overlap are adjacent, fully covering their corresponding state and potentially county have roughly the same size in area, population and voter presence Analytically, a projection of local demographics does not allow conclusions about voters themselves. However, the dataset does allow statements related to the geographies that yield voting behavior. One could say, for example, that an area dominated by a particular voting pattern would have mean traits of age, race, income or household structure. The dataset that results from this programming provides voting results allocated by Census block groups. The block group identifier can be joined to Census Decennial and American Community Survey demographic estimates. DATA SOURCES The state election results and geographies have been compiled by Voting and Election Science team on Harvard's dataverse. State voting precincts lie within state and county boundaries. The Census Bureau, on the other hand, publishes its estimates across a variety of geographic definitions including a hierarchy of states, counties, census tracts and block groups. Their definitions can be found here. The geometric shapefiles for each block group are available here. The lowest level of this geography changes often and can obsolesce before the next census survey (Decennial or American Community Survey programs). The second to lowest census level, block groups, have the benefit of both granularity and stability however. The 2020 Decennial survey details US demographics into 217,740 block groups with between a few hundred and a few thousand people. Dataset Structure The dataset's columns include: Column Definition BLOCKGROUP_GEOID 12 digit primary key. Census GEOID of the block group row. This code concatenates: 2 digit state 3 digit county within state 6 digit Census Tract identifier 1 digit Census Block Group identifier within tract STATE State abbreviation, redundent with 2 digit state FIPS code above REP Votes for Republican party candidate for president DEM Votes for Democratic party candidate for president LIB Votes for Libertarian party candidate for president OTH Votes for presidential candidates other than Republican, Democratic or Libertarian AREA square kilometers of area associated with this block group GAP total area of the block group, net of area attributed to voting precincts PRECINCTS Number of voting precincts that intersect this block group ASSUMPTIONS, NOTES AND CONCERNS: Votes are attributed based upon the proportion of the precinct's area that intersects the corresponding block group. Alternative methods are left to the analyst's initiative. 50 states and the District of Columbia are in scope as those U.S. possessions voting in the general election for the U.S. Presidency. Three states did not report their results at the precinct level: South Dakota, Kentucky and West Virginia. A dummy block group is added for each of these states to maintain national totals. These states represent 2.1% of all votes cast. Counties are commonly coded using FIPS codes. However, each election result file may have the county field named differently. Also, three states do not share county definitions - Delaware, Massachusetts, Alaska and the District of Columbia. Block groups may be used to capture geographies that do not have population like bodies of water. As a result, block groups without intersection voting precincts are not uncommon. In the U.S., elections are administered at a state level with the Federal Elections Commission compiling state totals against the Electoral College weights. The states have liberty, though, to define and change their own voting precincts https://en.wikipedia.org/wiki/Electoral_precinct. The Census Bureau practices "data suppression", filtering some block groups from demographic publication because they do not meet a population threshold. This practice...
a
Electricity Access, Africa
hub.arcgis.com
Updated Jan 20, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
UN Environment, Early Warning &Data Analytics (2016). Electricity Access, Africa [Dataset]. https://hub.arcgis.com/maps/9ec221b2a63745e586ac258e0827c6a5
Explore at:
Dataset updated
Jan 20, 2016
Dataset authored and provided by
UN Environment, Early Warning &Data Analytics
Area covered

Description
This map shows electricity access in Africa. The data source is from the International Energy Agency’s World Energy Outlook. The International Energy Agency’s World Energy Outlook first constructed a database on electrification rates for WEO-2002. The database once again was updated for WEO-2015, showing detailed data on national, urban and rural electrification.

The general paucity of data on electricity access means that it must be gathered through a combination of sources, including: IEA energy statistics; a network of contacts spanning governments, multilateral development banks and country-level representatives of various international organisations; and, other publicly available statistics, such as US Agency for International Development (USAID) supported DHS survey data, the World Bank’s Living Standards Measurement Surveys (LSMS), the UN Economic Commission for Latin America and the Caribbean’s (ECLAC) statistical publications, and data from national statistics agencies. In the small number of cases where no data could be provided through these channels other sources were used. If electricity access data for 2013 was not available, data for the latest available year was used.

For many countries, data on the urban and rural breakdown was collected, but if not available an estimate was made on the basis of pre-existing data or a comparison to the average correlation between urban and national electrification rates. Often only the percentage of households with a connection is known and assumptions about an average household size are used to determine access rates as a percentage of the population. To estimate the number of people without access, population data comes from OECD statistics in conjunction with the United Nations Population Division reports World Urbanization Prospects: the 2014 Revision Population Database, and World Population Prospects: the 2012 Revision. Electricity access data is adjusted to be consistent with demographic patterns of urban and rural population. Due to differences in definitions and methodology from different sources, data quality may vary from country to country. Where country data appeared contradictory, outdated or unreliable, the IEA Secretariat made estimates based on cross-country comparisons and earlier surveys.
2023 Census internal migration by TALB
datafinder.stats.govt.nz
csv, dbf (dbase iii) +4
Updated Mar 7, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stats NZ (2023). 2023 Census internal migration by TALB [Dataset]. https://datafinder.stats.govt.nz/table/122425-2023-census-internal-migration-by-talb/
Explore at:
csv, geodatabase, mapinfo tab, mapinfo mif, geopackage / sqlite, dbf (dbase iii)Available download formats
Dataset updated
Mar 7, 2023
Dataset provided by
Statistics New Zealandhttp://www.stats.govt.nz/
Authors
Stats NZ
License
https://datafinder.stats.govt.nz/license/attribution-4-0-international/https://datafinder.stats.govt.nz/license/attribution-4-0-international/
Description
Dataset contains counts for territorial authority local board area (TALB) of usual residence by TALB of usual residence address one year ago and five years ago, and by life cycle age group, for the census usually resident population count, 2023 Census.

This dataset compares usual residence at the 2023 Census with usual residence one and five years earlier to show population mobility and internal migration patterns of people within New Zealand.

‘Usual residence address’ is the address of the dwelling where a person considers that they usually live.

‘Usual residence one year ago address’ identifies an individual’s usual residence on 7 March 2022, which may be different to their current usual residence on census night 2023 (7 March 2023).

‘Usual residence five years ago address’ identifies an individual’s usual residence on 6 March 2018, which may be different to their current usual residence on census night 2023 (7 March 2023).

Note: This dataset only includes usual residence address information for individuals whose usual residence address one year ago and five years ago is available at TALB.

Life cycle age groups are categorised as:

under 15 years

15–29 years

30–64 years

65 years and over.

This dataset can be used in conjunction with the following spatial files by joining on the TALB code values:

Territorial authority local board (generalised)

Footnotes

Geographical boundaries

Statistical standard for geographic areas 2023 (updated December 2023) has information about geographic boundaries as of 1 January 2023. Address data from 2013 and 2018 Censuses was updated to be consistent with the 2023 areas. Due to the changes in area boundaries and coding methodologies, 2013 and 2018 counts published in 2023 may be slightly different to those published in 2013 or 2018.

Subnational census usually resident population

The census usually resident population count of an area (subnational count) is a count of all people who usually live in that area and were present in New Zealand on census night. It excludes visitors from overseas, visitors from elsewhere in New Zealand, and residents temporarily overseas on census night. For example, a person who usually lives in Christchurch city and is visiting Wellington city on census night will be included in the census usually resident population count of Christchurch city. 

Population counts

Stats NZ publishes a number of different population counts, each using a different definition and methodology. Population statistics – user guide has more information about different counts. 

Rows excluded from the dataset

Rows show TALB of usual residence by TALB of usual residence one year ago and five years ago, by life cycle age group. Cells with a number less than six have been confidentialised. Responses to categories unable to be mapped, such as response unidentifiable, not stated, and Auckland (not further defined), have also been excluded from this dataset.

About the 2023 Census dataset

For information on the 2023 dataset see Using a combined census model for the 2023 Census. We combined data from the census forms with administrative data to create the 2023 Census dataset, which meets Stats NZ's quality criteria for population structure information. We added real data about real people to the dataset where we were confident the people who hadn’t completed a census form (which is known as admin enumeration) will be counted. We also used data from the 2018 and 2013 Censuses, administrative data sources, and statistical imputation methods to fill in some missing characteristics of people and dwellings.

Data quality

The quality of data in the 2023 Census is assessed using the quality rating scale and the quality assurance framework to determine whether data is fit for purpose and suitable for release. Data quality assurance in the 2023 Census has more information.

Quality rating of a variable

The quality rating of a variable provides an overall evaluation of data quality for that variable, usually at the highest levels of classification. The quality ratings shown are for the 2023 Census unless stated. There is variability in the quality of data at smaller geographies. Data quality may also vary between censuses, for subpopulations, or when cross tabulated with other variables or at lower levels of the classification. Data quality ratings for 2023 Census variables has more information on quality ratings by variable.

Age quality rating

Age is rated as very high quality.

Age – 2023 Census: Information by concept has more information, for example, definitions and data quality.

Census usually resident population quality rating

The census usually resident population count is rated as very high quality.

Census usually resident population count – 2023 Census: Information by concept has more information, for example, definitions and data quality.

Usual residence address quality rating

Usual residence address is rated as high quality.

Usual residence address – 2023 Census: Information by concept has more information, for example, definitions and data quality.

Usual residence one year ago quality rating

Usual residence one year ago area is rated as high quality.

Usual residence one year ago – 2023 Census: Information by concept has more information, for example, definitions and data quality.

Usual residence five years ago quality rating

Usual residence five years ago area is rated as high quality.

Usual residence five years ago – 2023 Census: Information by concept has more information, for example, definitions and data quality.

Using data for good

Stats NZ expects that, when working with census data, it is done so with a positive purpose, as outlined in the Māori Data Governance Model (Data Iwi Leaders Group, 2023). This model states that "data should support transformative outcomes and should uplift and strengthen our relationships with each other and with our environments. The avoidance of harm is the minimum expectation for data use. Māori data should also contribute to iwi and hapū tino rangatiratanga”.

Confidentiality

The 2023 Census confidentiality rules have been applied to 2013, 2018, and 2023 data. These rules protect the confidentiality of individuals, families, households, dwellings, and undertakings in 2023 Census data. Counts are calculated using fixed random rounding to base 3 (FRR3) and suppression of ‘sensitive’ counts less than six, where tables report multiple geographic variables and/or small populations. Individual figures may not always sum to stated totals. Applying confidentiality rules to 2023 Census data and summary of changes since 2018 and 2013 Censuses has more information about 2023 Census confidentiality rules.

Symbol

-999 Confidential

Inconsistencies in definitions

Please note that there may be differences in definitions between census classifications and those used for other data collections.
S
Trends in COVID-19 Cases and Deaths in the United States, by County-level...
splitgraph.com
Updated Jun 8, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
cdc-gov (2023). Trends in COVID-19 Cases and Deaths in the United States, by County-level Population Factors - ARCHIVED [Dataset]. https://www.splitgraph.com/cdc-gov/trends-in-covid19-cases-and-deaths-in-the-united-njmz-dpbc/
Explore at:
application/vnd.splitgraph.image, application/openapi+json, jsonAvailable download formats
Dataset updated
Jun 8, 2023
Authors
cdc-gov
Area covered
United States
Description
Reporting of Aggregate Case and Death Count data was discontinued on May 11, 2023, with the expiration of the COVID-19 public health emergency declaration. Although these data will continue to be publicly available, this dataset will no longer be updated.

The surveillance case definition for COVID-19, a nationally notifiable disease, was first described in a position statement from the Council for State and Territorial Epidemiologists, which was later revised. However, there is some variation in how jurisdictions implemented these case definitions. More information on how CDC collects COVID-19 case surveillance data can be found at FAQ: COVID-19 Data and Surveillance.

Aggregate Data Collection Process

Since the beginning of the COVID-19 pandemic, data were reported from state and local health departments through a robust process with the following steps:

This process was collaborative, with CDC and jurisdictions working together to ensure the accuracy of COVID-19 case and death numbers. County counts provided the most up-to-date numbers on cases and deaths by report date. Throughout data collection, CDC retrospectively updated counts to correct known data quality issues.

Description

This archived public use dataset focuses on the cumulative and weekly case and death rates per 100,000 persons within various sociodemographic factors across all states and their counties. All resulting data are expressed as rates calculated as the number of cases or deaths per 100,000 persons in counties meeting various classification criteria using the US Census Bureau Population Estimates Program (2019 Vintage).

Each county within jurisdictions is classified into multiple categories for each factor. All rates in this dataset are based on classification of counties by the characteristics of their population, not individual-level factors. This applies to each of the available factors observed in this dataset. Specific factors and their corresponding categories are detailed below.

Population-level factors

Each unique population factor is detailed below. Please note that the “Classification” column describes each of the 12 factors in the dataset, including a data dict

Splitgraph serves as an HTTP API that lets you run SQL queries directly on this data to power Web applications. For example:

See the Splitgraph documentation for more information.

Facebook

Twitter

Click to copy link

Link copied

Cite

(2025). Population Projection [Dataset]. https://datasource.kapsarc.org/explore/dataset/population-projection/

Population Projection

Explore at:

Dataset updated

Mar 10, 2025

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Explore population projections for China on this dataset webpage. Get valuable insights into the future demographic trends of one of the world's most populous countries.

Population, China, projections ChinaFollow data.kapsarc.org for timely data to advance energy economics research..Total population is based on the de facto definition of population, which counts all residents regardless of legal status or citizenship. The values shown are midyear estimatesSource: (1) United Nations Population Division. World Population Prospects: 2019 Revision. (2) Census reports and other statistical publications from national statistical offices, (3) Eurostat: Demographic Statistics, (4) United Nations Statistical Division. Population and Vital Statistics Reprot (various years), (5) U.S. Census Bureau: International Database, and (6) Secretariat of the Pacific Community: Statistics and Demography Programme.

Clear search

Close search

Google apps

Main menu

Population Projection

Retail Sales Dataset

POPLINK - Demographic Data Base

Demographic monitoring of wild muriqui populations: Criteria for defining...

U.S. Voting by Census Block Groups

[Archived] COVID-19 Deaths by Population Characteristics Over Time

COVID-19 Deaths by Population Characteristics

2020 Census Tracts

WSDOT - Population Centers

Population Estimates: Estimates by Age Group, Sex, Race, and Hispanic Origin...

Census_sum_15

Prevalence and patterns of multi-morbidity among 30-69 years old population...

Consumer Behavior and Shopping Habits Dataset:

Dataset Direct Download Service (WFS): Legislative constituencies of the...

Demographic and Health Survey 2017 - Indonesia

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Sampling procedure

Mode of data collection

Research instrument

Cleaning operations

Response rate

Sampling error estimates

Data appraisal

Trends in COVID-19 Cases and Deaths in the United States, by County-level...

2020 General Election Voting by US Census Block Group

Electricity Access, Africa

2023 Census internal migration by TALB

Trends in COVID-19 Cases and Deaths in the United States, by County-level...

Population Projection