100+ datasets found

r
The database TABVERK
researchdata.se
Updated Jun 24, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Umeå university (2025). The database TABVERK [Dataset]. https://researchdata.se/en/catalogue/dataset/ext0084-1
Explore at:
Dataset updated
Jun 24, 2025
Dataset provided by
Umeå University
Authors
Umeå university
Time period covered
1749 - 1859
Description
The database TABVERK contains population statistics in the Swedish parishes during the period 1749-1859. It constitutes the earliest population statistics throughout the world making it a unique source. Merely in Finland, a similar time serial of data exists. The database gathers all the information about the parishes’ populations that Swedish clergymens filled out in large forms of tables which were reported to the Tabellkommissionen in Stockholm. Now this information is accessible through the agency of the Demographic Data Base.

The database contain information on births, deaths, marriage, and occupation. The population is describe by sex, age, civil status, and by occupation and social status every fifth year. Yearly statistics is given about the demographical events, fertility, mortality and nuptiality, as well as migrations in the 19th century. The statistics in Tabellverket (TABVERK) enable demographical studies from several angles on both local and regional level in Sweden during the pre-industrial society.

TABVERK holds an online search tool called SHiPS (Swedish Historical Population Statistics), which can be reached and utilized from the home page of the Demographic Data Base.

Purpose:

The Demographic Database (DDB) is a special unit at Umeå University with activities focused on two main areas: data production and research. DDB creates and makes available population databases, mainly based on historical information in parish registers during the 18th, 19th, and 20th centuries.
u
Population and Family Health Survey 2012 - Jordan
microdata.unhcr.org
catalog.ihsn.org
+2more
Updated May 19, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department of Statistics (DoS) (2021). Population and Family Health Survey 2012 - Jordan [Dataset]. https://microdata.unhcr.org/index.php/catalog/405
Explore at:
Dataset updated
May 19, 2021
Dataset authored and provided by
Department of Statistics (DoS)
Time period covered
2012
Area covered
Jordan
Description
Abstract

The Jordan Population and Family Health Survey (JPFHS) is part of the worldwide Demographic and Health Surveys Program, which is designed to collect data on fertility, family planning, and maternal and child health.

The primary objective of the 2012 Jordan Population and Family Health Survey (JPFHS) is to provide reliable estimates of demographic parameters, such as fertility, mortality, family planning, and fertility preferences, as well as maternal and child health and nutrition, that can be used by program managers and policymakers to evaluate and improve existing programs. The JPFHS data will be useful to researchers and scholars interested in analyzing demographic trends in Jordan, as well as those conducting comparative, regional, or cross-national studies.

Geographic coverage

National coverage

Analysis unit

Household

Women age 15-49

Kind of data

Sample survey data [ssd]

Sampling procedure

Sample Design The 2012 JPFHS sample was designed to produce reliable estimates of major survey variables for the country as a whole, urban and rural areas, each of the 12 governorates, and for the two special domains: the Badia areas and people living in refugee camps. To facilitate comparisons with previous surveys, the sample was also designed to produce estimates for the three regions (North, Central, and South). The grouping of the governorates into regions is as follows: the North consists of Irbid, Jarash, Ajloun, and Mafraq governorates; the Central region consists of Amman, Madaba, Balqa, and Zarqa governorates; and the South region consists of Karak, Tafiela, Ma'an, and Aqaba governorates.

The 2012 JPFHS sample was selected from the 2004 Jordan Population and Housing Census sampling frame. The frame excludes the population living in remote areas (most of whom are nomads), as well as those living in collective housing units such as hotels, hospitals, work camps, prisons, and the like. For the 2004 census, the country was subdivided into convenient area units called census blocks. For the purposes of the household surveys, the census blocks were regrouped to form a general statistical unit of moderate size (30 households or more), called a "cluster", which is widely used in surveys as a primary sampling unit (PSU).

Stratification was achieved by first separating each governorate into urban and rural areas and then, within each urban and rural area, by Badia areas, refugee camps, and other. A two-stage sampling procedure was employed. In the first stage, 806 clusters were selected with probability proportional to the cluster size, that is, the number of residential households counted in the 2004 census. A household listing operation was then carried out in all of the selected clusters, and the resulting lists of households served as the sampling frame for the selection of households in the second stage. In the second stage of selection, a fixed number of 20 households was selected in each cluster with an equal probability systematic selection. A subsample of two-thirds of the selected households was identified for anthropometry measurements.

Refer to Appendix A in the final report (Jordan Population and Family Health Survey 2012) for details of sampling weights calculation.

Mode of data collection

Face-to-face [f2f]

Research instrument

The 2012 JPFHS used two questionnaires, namely the Household Questionnaire and the Woman’s Questionnaire (see Appendix D). The Household Questionnaire was used to list all usual members of the sampled households, and visitors who slept in the household the night before the interview, and to obtain information on each household member’s age, sex, educational attainment, relationship to the head of the household, and marital status. In addition, questions were included on the socioeconomic characteristics of the household, such as source of water, sanitation facilities, and the availability of durable goods. Moreover, the questionnaire included questions about child discipline. The Household Questionnaire was also used to identify women who were eligible for the individual interview (ever-married women age 15-49 years). In addition, all women age 15-49 and children under age 5 living in the subsample of households were eligible for height and weight measurement and anemia testing.

The Woman’s Questionnaire was administered to ever-married women age 15-49 and collected information on the following topics: • Respondent’s background characteristics • Birth history • Knowledge, attitudes, and practice of family planning and exposure to family planning messages • Maternal health (antenatal, delivery, and postnatal care) • Immunization and health of children under age 5 • Breastfeeding and infant feeding practices • Marriage and husband’s background characteristics • Fertility preferences • Respondent’s employment • Knowledge of AIDS and sexually transmitted infections (STIs) • Other health issues specific to women • Early childhood development • Domestic violence

In addition, information on births, pregnancies, and contraceptive use and discontinuation during the five years prior to the survey was collected using a monthly calendar.

The Household and Woman’s Questionnaires were based on the model questionnaires developed by the MEASURE DHS program. Additions and modifications to the model questionnaires were made in order to provide detailed information specific to Jordan. The questionnaires were then translated into Arabic.

Anthropometric data were collected during the 2012 JPFHS in a subsample of two-thirds of the selected households in each cluster. All women age 15-49 and children age 0-4 in these households were measured for height using Shorr height boards and for weight using electronic Seca scales. In addition, a drop of capillary blood was taken from these women and children in the field to measure their hemoglobin level using the HemoCue system. Hemoglobin testing was used to estimate the prevalence of anemia.

Cleaning operations

Fieldwork and data processing activities overlapped. Data processing began two weeks after the start of the fieldwork. After field editing of questionnaires for completeness and consistency, the questionnaires for each cluster were packaged together and sent to the central office in Amman, where they were registered and stored. Special teams were formed to carry out office editing and coding of the openended questions.

Data entry and verification started after two weeks of office data processing. The process of data entry, including 100 percent reentry, editing, and cleaning, was done by using PCs and the CSPro (Census and Survey Processing) computer package, developed specially for such surveys. The CSPro program allows data to be edited while being entered. Data processing operations were completed by early January 2013. A data processing specialist from ICF International made a trip to Jordan in February 2013 to follow up on data editing and cleaning and to work on the tabulation of results for the survey preliminary report, which was published in March 2013. The tabulations for this report were completed in April 2013.

Response rate

In all, 16,120 households were selected for the survey and, of these, 15,722 were found to be occupied households. Of these households, 15,190 (97 percent) were successfully interviewed.

In the households interviewed, 11,673 ever-married women age 15-49 were identified and interviews were completed with 11,352 women, or 97 percent of all eligible women.

Sampling error estimates

The estimates from a sample survey are affected by two types of errors: (1) nonsampling errors and (2) sampling errors. Nonsampling errors are the results of mistakes made in implementing data collection and data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the 2012 Jordan Population and Family Health Survey (JPFHS) to minimize this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.

Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the 2012 JPFHS is only one of many samples that could have been selected from the same population, using the same design and identical size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling error is a measure of the variability between all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.

A sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95 percent of all possible samples of identical size and design.

If the sample of respondents had been selected as a simple random sample, it would have been possible to use straightforward formulas for calculating sampling errors. However, the 2012 JPFHS sample is the result of a multistage stratified design, and, consequently, it was necessary to use more complex formulae. The computer
n
Census Microdata Samples Project
neuinfo.org
dknet.org
+2more
Updated Jan 29, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2022). Census Microdata Samples Project [Dataset]. http://identifiers.org/RRID:SCR_008902
Explore at:
Unique identifier
https://identifiers.org/RRID:SCR_008902
Dataset updated
Jan 29, 2022
Description
A data set of cross-nationally comparable microdata samples for 15 Economic Commission for Europe (ECE) countries (Bulgaria, Canada, Czech Republic, Estonia, Finland, Hungary, Italy, Latvia, Lithuania, Romania, Russia, Switzerland, Turkey, UK, USA) based on the 1990 national population and housing censuses in countries of Europe and North America to study the social and economic conditions of older persons. These samples have been designed to allow research on a wide range of issues related to aging, as well as on other social phenomena. A common set of nomenclatures and classifications, derived on the basis of a study of census data comparability in Europe and North America, was adopted as a standard for recoding. This series was formerly called Dynamics of Population Aging in ECE Countries. The recommendations regarding the design and size of the samples drawn from the 1990 round of censuses envisaged: (1) drawing individual-based samples of about one million persons; (2) progressive oversampling with age in order to ensure sufficient representation of various categories of older people; and (3) retaining information on all persons co-residing in the sampled individual''''s dwelling unit. Estonia, Latvia and Lithuania provided the entire population over age 50, while Finland sampled it with progressive over-sampling. Canada, Italy, Russia, Turkey, UK, and the US provided samples that had not been drawn specially for this project, and cover the entire population without over-sampling. Given its wide user base, the US 1990 PUMS was not recoded. Instead, PAU offers mapping modules, which recode the PUMS variables into the project''''s classifications, nomenclatures, and coding schemes. Because of the high sampling density, these data cover various small groups of older people; contain as much geographic detail as possible under each country''''s confidentiality requirements; include more extensive information on housing conditions than many other data sources; and provide information for a number of countries whose data were not accessible until recently. Data Availability: Eight of the fifteen participating countries have signed the standard data release agreement making their data available through NACDA/ICPSR (see links below). Hungary and Switzerland require a clearance to be obtained from their national statistical offices for the use of microdata, however the documents signed between the PAU and these countries include clauses stipulating that, in general, all scholars interested in social research will be granted access. Russia requested that certain provisions for archiving the microdata samples be removed from its data release arrangement. The PAU has an agreement with several British scholars to facilitate access to the 1991 UK data through collaborative arrangements. Statistics Canada and the Italian Institute of statistics (ISTAT) provide access to data from Canada and Italy, respectively. * Dates of Study: 1989-1992 * Study Features: International, Minority Oversamples * Sample Size: Approx. 1 million/country Links: * Bulgaria (1992), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/02200 * Czech Republic (1991), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/06857 * Estonia (1989), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/06780 * Finland (1990), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/06797 * Romania (1992), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/06900 * Latvia (1989), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/02572 * Lithuania (1989), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/03952 * Turkey (1990), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/03292 * U.S. (1990), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/06219
Census of Population and Housing, 2010 [United States]: Summary File 2 With...
icpsr.umich.edu
search.datacite.org
Updated Jul 18, 2013
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
United States. Bureau of the Census (2013). Census of Population and Housing, 2010 [United States]: Summary File 2 With National Update [Dataset]. http://doi.org/10.3886/ICPSR34755.v1
Explore at:
Unique identifier
https://doi.org/10.3886/ICPSR34755.v1
Dataset updated
Jul 18, 2013
Dataset provided by
Inter-university Consortium for Political and Social Researchhttps://www.icpsr.umich.edu/web/pages/
Authors
United States. Bureau of the Census
License
https://www.icpsr.umich.edu/web/ICPSR/studies/34755/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/34755/terms
Time period covered
2010
Area covered
United States
Description
This data collection contains summary statistics on population and housing subjects derived from the responses to the 2010 Census questionnaire. Population items include sex, age, average household size, household type, and relationship to householder such as nonrelative or child. Housing items include tenure (whether a housing unit is owner-occupied or renter-occupied), age of householder, and household size for occupied housing units. Selected aggregates and medians also are provided. The summary statistics are presented in 71 tables, which are tabulated for multiple levels of observation (called "summary levels" in the Census Bureau's nomenclature), including, but not limited to, regions, divisions, states, metropolitan/micropolitan areas, counties, county subdivisions, places, ZIP Code Tabulation Areas (ZCTAs), school districts, census tracts, American Indian and Alaska Native areas, tribal subdivisions, and Hawaiian home lands. There are 10 population tables shown down to the county level and 47 population tables and 14 housing tables shown down to the census tract level. Every table cell is represented by a separate variable in the data. Each table is iterated for up to 330 population groups, which are called "characteristic iterations" in the Census Bureau's nomenclature: the total population, 74 race categories, 114 American Indian and Alaska Native categories, 47 Asian categories, 43 Native Hawaiian and Other Pacific Islander categories, and 51 Hispanic/not Hispanic groups. Moreover, the tables for some large summary areas (e.g., regions, divisions, and states) are iterated for portions of geographic areas ("geographic components" in the Census Bureau's nomenclature) such as metropolitan/micropolitan statistical areas and the principal cities of metropolitan statistical areas. The collection has a separate set of files for every state, the District of Columbia, Puerto Rico, and the National File. Each file set has 11 data files per characteristic iteration, a data file with geographic variables called the "geographic header file," and a documentation file called the "packing list" with information about the files in the file set. Altogether, the 53 file sets have 110,416 data files and 53 packing list files. Each file set is compressed in a separate ZIP archive (Datasets 1-56, 72, and 99). Another ZIP archive (Dataset 100) contains a Microsoft Access database shell and additional documentation files besides the codebook. The National File (Dataset 99) constitutes the National Update for Summary File 2. The National Update added summary levels for the United States as a whole, regions, divisions, and geographic areas that cross state lines such as Core Based Statistical Areas.
Worldwide Population Data🌎 🌎
kaggle.com
zip
Updated Oct 9, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shiv_D24Coder (2023). Worldwide Population Data🌎 🌎 [Dataset]. https://www.kaggle.com/shivd24coder/worldwide-population-data
Explore at:
zip(48744075 bytes)Available download formats
Dataset updated
Oct 9, 2023
Authors
Shiv_D24Coder
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Area covered
World
Description
This Dataset provides comprehensive demographic information on global populations from 1950 to the present. It offers insights into various aspects of population dynamics, including population counts, gender ratios, birth and death rates, life expectancy, and migration patterns.

Column Descriptions:

SortOrder: Numeric identifier for sorting.

LocID: Location identifier.

Notes: Additional notes or comments (blank in this dataset).

ISO3_code: ISO 3-character country code.

ISO2_code: ISO 2-character country code.

SDMX_code: Statistical Data and Metadata Exchange code.

LocTypeID: Location type identifier.

LocTypeName: Location type name.

ParentID: Identifier for the parent location.

Location: Name of the location.

VarID: Identifier for the variant.

Variant: Type of population variant.

Time: Year or time period.

TPopulation1Jan: Total population on January 1st.

TPopulation1July: Total population on July 1st.

TPopulationMale1July: Total male population on July 1st.

TPopulationFemale1July: Total female population on July 1st.

PopDensity: Population density (people per square kilometer).

PopSexRatio: Population sex ratio (male/female).

MedianAgePop: Median age of the population.

NatChange: Natural change in population.

NatChangeRT: Natural change rate (per 1,000 people).

PopChange: Population change.

PopGrowthRate: Population growth rate (percentage).

DoublingTime: Time for population to double (in years).

Births: Total number of births.

Births1519: Births to mothers aged 15-19.

CBR: Crude birth rate (per 1,000 people).

TFR: Total fertility rate (average number of children per woman).

NRR: Net reproduction rate.

MAC: Mean age at childbearing.

SRB: Sex ratio at birth (male/female).

Deaths: Total number of deaths.

DeathsMale: Total male deaths.

DeathsFemale: Total female deaths.

CDR: Crude death rate (per 1,000 people).

LEx: Life expectancy at birth.

LExMale: Life expectancy for males at birth.

LExFemale: Life expectancy for females at birth.

LE15: Life expectancy at age 15.

LE15Male: Life expectancy for males at age 15.

LE15Female: Life expectancy for females at age 15.

LE65: Life expectancy at age 65.

LE65Male: Life expectancy for males at age 65.

LE65Female: Life expectancy for females at age 65.

LE80: Life expectancy at age 80.

LE80Male: Life expectancy for males at age 80.

LE80Female: Life expectancy for females at age 80.

InfantDeaths: Number of infant deaths.

IMR: Infant mortality rate (per 1,000 live births).

LBsurvivingAge1: Children surviving to age 1.

Under5Deaths: Number of deaths under age 5.

NetMigrations: Net migration rate (per 1,000 people).

CNMR: Crude net migration rate.

How to Use the Dataset:

Researchers can analyze demographic trends, birth and death rates, and population growth over time.

Policymakers can use population data to inform decisions on healthcare, education, and social services.

Data scientists can visualize and model population dynamics for various regions.

Journalists can use the dataset to report on global population trends and disparities.

Educators can incorporate real-world population data into lessons and research.

Please upvote and show your support if you find this dataset valuable for your research or analysis. Your feedback and contributions help make this dataset more accessible to the Kaggle community. Thank you!
N
Michigan City, IN Population Breakdown by Gender and Age Dataset: Male and...
neilsberg.com
csv, json
Updated Feb 19, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2024). Michigan City, IN Population Breakdown by Gender and Age Dataset: Male and Female Population Distribution Across 18 Age Groups // 2024 Edition [Dataset]. https://www.neilsberg.com/research/datasets/8e20aa53-c989-11ee-9145-3860777c1fe6/
Explore at:
json, csvAvailable download formats
Dataset updated
Feb 19, 2024
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Michigan City
Variables measured
Male and Female Population Under 5 Years, Male and Female Population over 85 years, Male and Female Population Between 5 and 9 years, Male and Female Population Between 10 and 14 years, Male and Female Population Between 15 and 19 years, Male and Female Population Between 20 and 24 years, Male and Female Population Between 25 and 29 years, Male and Female Population Between 30 and 34 years, Male and Female Population Between 35 and 39 years, Male and Female Population Between 40 and 44 years, and 8 more
Measurement technique
The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2018-2022 5-Year Estimates. To measure the three variables, namely (a) Population (Male), (b) Population (Female), and (c) Gender Ratio (Males per 100 Females), we initially analyzed and categorized the data for each of the gender classifications (biological sex) reported by the US Census Bureau across 18 age groups, ranging from under 5 years to 85 years and above. These age groups are described above in the variables section. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset tabulates the population of Michigan City by gender across 18 age groups. It lists the male and female population in each age group along with the gender ratio for Michigan City. The dataset can be utilized to understand the population distribution of Michigan City by gender and age. For example, using this dataset, we can identify the largest age group for both Men and Women in Michigan City. Additionally, it can be used to see how the gender ratio changes from birth to senior most age group and male to female ratio across each age group for Michigan City.

Key observations

Largest age group (population): Male # 25-29 years (1,643) | Female # 35-39 years (1,182). Source: U.S. Census Bureau American Community Survey (ACS) 2018-2022 5-Year Estimates.

Content

When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2018-2022 5-Year Estimates.

Age groups:

Under 5 years

5 to 9 years

10 to 14 years

15 to 19 years

20 to 24 years

25 to 29 years

30 to 34 years

35 to 39 years

40 to 44 years

45 to 49 years

50 to 54 years

55 to 59 years

60 to 64 years

65 to 69 years

70 to 74 years

75 to 79 years

80 to 84 years

85 years and over

Scope of gender :

Please note that American Community Survey asks a question about the respondents current sex, but not about gender, sexual orientation, or sex at birth. The question is intended to capture data for biological sex, not gender. Respondents are supposed to respond with the answer as either of Male or Female. Our research and this dataset mirrors the data reported as Male and Female for gender distribution analysis.

Variables / Data Columns

Age Group: This column displays the age group for the Michigan City population analysis. Total expected values are 18 and are define above in the age groups section.

Population (Male): The male population in the Michigan City is shown in the following column.

Population (Female): The female population in the Michigan City is shown in the following column.

Gender Ratio: Also known as the sex ratio, this column displays the number of males per 100 females in Michigan City for each age group.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Recommended for further research

This dataset is a part of the main dataset for Michigan City Population by Gender. You can refer the same here
World Cities, Countries & Languages Dataset
kaggle.com
zip
Updated Sep 10, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Adil Shamim (2025). World Cities, Countries & Languages Dataset [Dataset]. https://www.kaggle.com/datasets/adilshamim8/world-cities-countries-and-languages-dataset/versions/3
Explore at:
zip(82890 bytes)Available download formats
Dataset updated
Sep 10, 2025
Authors
Adil Shamim
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset provides a structured view of the world’s cities, countries, and languages, derived from the well-known World Database (SQL → CSV). It is designed to be beginner-friendly yet powerful for researchers, analysts, and data scientists who want to explore global demographics, population distribution, and linguistic diversity.

The dataset is split into three clean, relational tables:

🔹 city.csv

Contains information about the world’s cities.

Key columns:

ID → Unique city identifier

Name → City name

CountryCode → Links each city to its country

District → Administrative division

Population → Population of the city

🔹 country.csv

Describes countries and their attributes.

Key columns:

Code → Unique country code

Name → Country name

Continent, Region → Geographic classification

SurfaceArea → Area in square kilometers

Population → Country’s population

GovernmentForm, HeadOfState → Political details

🔹 countrylanguage.csv

Captures languages spoken across countries.

Key columns:

CountryCode → Links to country.csv

Language → Language name

IsOfficial → Whether the language is official

Percentage → Percentage of speakers in the population

Why Use This Dataset?

Study urbanization and population trends across the globe.

Explore language diversity and compare official vs. non-official usage.

Perform SQL-style joins across the three tables for deeper analysis.

Great for data visualization projects, machine learning experiments, or teaching relational databases.

Possible Use Cases

📊 Build dashboards to visualize population growth by continent or country.

🌍 Rank cities by size, density, or region.

🗣️ Analyze global language distribution and multilingual countries.

🤖 Use as a practice dataset for SQL queries, joins, and normalization in machine learning pipelines.

This dataset offers a balanced mix of geography, demography, and linguistics — perfect for analysts, students, and Kaggle competitors alike.
2021 American Community Survey: S0101 | AGE AND SEX (ACS 1-Year Estimates...
data.census.gov
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ACS, 2021 American Community Survey: S0101 | AGE AND SEX (ACS 1-Year Estimates Subject Tables) [Dataset]. https://data.census.gov/table/ACSST1Y2021.S0101?q=S0101:+AGE+AND+SEX
Explore at:
Dataset provided by
United States Census Bureauhttp://census.gov/
Authors
ACS
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Time period covered
2021
Description
Although the American Community Survey (ACS) produces population, demographic and housing unit estimates, it is the Census Bureau's Population Estimates Program that produces and disseminates the official estimates of the population for the nation, states, counties, cities, and towns and estimates of housing units for states and counties..Supporting documentation on code lists, subject definitions, data accuracy, and statistical testing can be found on the American Community Survey website in the Technical Documentation section.Sample size and data quality measures (including coverage rates, allocation rates, and response rates) can be found on the American Community Survey website in the Methodology section..Source: U.S. Census Bureau, 2021 American Community Survey 1-Year Estimates.Data are based on a sample and are subject to sampling variability. The degree of uncertainty for an estimate arising from sampling variability is represented through the use of a margin of error. The value shown here is the 90 percent margin of error. The margin of error can be interpreted roughly as providing a 90 percent probability that the interval defined by the estimate minus the margin of error and the estimate plus the margin of error (the lower and upper confidence bounds) contains the true value. In addition to sampling variability, the ACS estimates are subject to nonsampling error (for a discussion of nonsampling variability, see ACS Technical Documentation). The effect of nonsampling error is not represented in these tables..The age dependency ratio is derived by dividing the combined under-18 and 65-and-over populations by the 18-to-64 population and multiplying by 100..The old-age dependency ratio is derived by dividing the population 65 and over by the 18-to-64 population and multiplying by 100..The child dependency ratio is derived by dividing the population under 18 by the 18-to-64 population and multiplying by 100..When information is missing or inconsistent, the Census Bureau logically assigns an acceptable value using the response to a related question or questions. If a logical assignment is not possible, data are filled using a statistical process called allocation, which uses a similar individual or household to provide a donor value. The "Allocated" section is the number of respondents who received an allocated value for a particular subject..The 2021 American Community Survey (ACS) data generally reflect the March 2020 Office of Management and Budget (OMB) delineations of metropolitan and micropolitan statistical areas. In certain instances the names, codes, and boundaries of the principal cities shown in ACS tables may differ from the OMB delineations due to differences in the effective dates of the geographic entities..Estimates of urban and rural populations, housing units, and characteristics reflect boundaries of urban areas defined based on Census 2010 data. As a result, data for urban and rural areas from the ACS do not necessarily reflect the results of ongoing urbanization..Explanation of Symbols:- The estimate could not be computed because there were an insufficient number of sample observations. For a ratio of medians estimate, one or both of the median estimates falls in the lowest interval or highest interval of an open-ended distribution. For a 5-year median estimate, the margin of error associated with a median was larger than the median itself.N The estimate or margin of error cannot be displayed because there were an insufficient number of sample cases in the selected geographic area. (X) The estimate or margin of error is not applicable or not available.median- The median falls in the lowest interval of an open-ended distribution (for example "2,500-")median+ The median falls in the highest interval of an open-ended distribution (for example "250,000+").** The margin of error could not be computed because there were an insufficient number of sample observations.*** The margin of error could not be computed because the median falls in the lowest interval or highest interval of an open-ended distribution.***** A margin of error is not appropriate because the corresponding estimate is controlled to an independent population or housing estimate. Effectively, the corresponding estimate has no sampling error and the margin of error may be treated as zero.
w
Population and Family Health Survey 2023 - Jordan
microdata.worldbank.org
datacatalog.ihsn.org
+1more
Updated Aug 23, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department of Statistics (DoS) (2024). Population and Family Health Survey 2023 - Jordan [Dataset]. https://microdata.worldbank.org/index.php/catalog/6288
Explore at:
Dataset updated
Aug 23, 2024
Dataset authored and provided by
Department of Statistics (DoS)
Time period covered
2023
Area covered
Jordan
Description
Abstract

The 2023 Jordan Population and Family Health Survey (JPFHS) is the eighth Population and Family Health Survey conducted in Jordan, following those conducted in 1990, 1997, 2002, 2007, 2009, 2012, and 2017–18. It was implemented by the Department of Statistics (DoS) at the request of the Ministry of Health (MoH).

The primary objective of the 2023 JPFHS is to provide up-to-date estimates of key demographic and health indicators. Specifically, the 2023 JPFHS: • Collected data at the national level that allowed calculation of key demographic indicators • Explored the direct and indirect factors that determine levels of and trends in fertility and childhood mortality • Measured contraceptive knowledge and practice • Collected data on key aspects of family health, including immunisation coverage among children, prevalence and treatment of diarrhoea and other diseases among children under age 5, and maternity care indicators such as antenatal visits and assistance at delivery • Obtained data on child feeding practices, including breastfeeding, and conducted anthropometric measurements to assess the nutritional status of children under age 5 and women age 15–49 • Conducted haemoglobin testing with eligible children age 6–59 months and women age 15–49 to gather information on the prevalence of anaemia • Collected data on women’s and men’s knowledge and attitudes regarding sexually transmitted infections and HIV/AIDS • Obtained data on women’s experience of emotional, physical, and sexual violence • Gathered data on disability among household members

The information collected through the 2023 JPFHS is intended to assist policymakers and programme managers in evaluating and designing programmes and strategies for improving the health of the country’s population. The survey also provides indicators relevant to the Sustainable Development Goals (SDGs) for Jordan.

Geographic coverage

National coverage

Analysis unit

Household

Individual

Children age 0-5

Woman age 15-49

Man age 15-59

Universe

The survey covered all de jure household members (usual residents), all women aged 15-49, men aged 15-59, and all children aged 0-4 resident in the household.

Kind of data

Sample survey data [ssd]

Sampling procedure

The sampling frame used for the 2023 JPFHS was the 2015 Jordan Population and Housing Census (JPHC) frame. The survey was designed to produce representative results for the country as a whole, for urban and rural areas separately, for each of the country’s 12 governorates, and for four nationality domains: the Jordanian population, the Syrian population living in refugee camps, the Syrian population living outside of camps, and the population of other nationalities. Each of the 12 governorates is subdivided into districts, each district into subdistricts, each subdistrict into localities, and each locality into areas and subareas. In addition to these administrative units, during the 2015 JPHC each subarea was divided into convenient area units called census blocks. An electronic file of a complete list of all of the census blocks is available from DoS. The list contains census information on households, populations, geographical locations, and socioeconomic characteristics of each block. Based on this list, census blocks were regrouped to form a general statistical unit of moderate size, called a cluster, which is widely used in various surveys as the primary sampling unit (PSU). The sample clusters for the 2023 JPFHS were selected from the frame of cluster units provided by the DoS.

The sample for the 2023 JPFHS was a stratified sample selected in two stages from the 2015 census frame. Stratification was achieved by separating each governorate into urban and rural areas. In addition, the Syrian refugee camps in Zarqa and Mafraq each formed a special sampling stratum. In total, 26 sampling strata were constructed. Samples were selected independently in each sampling stratum, through a twostage selection process, according to the sample allocation. Before the sample selection, the sampling frame was sorted by district and subdistrict within each sampling stratum. By using a probability proportional to size selection at the first stage of sampling, an implicit stratification and proportional allocation were achieved at each of the lower administrative levels.

For further details on sample design, see APPENDIX A of the final report.

Mode of data collection

Computer Assisted Personal Interview [capi]

Research instrument

Five questionnaires were used for the 2023 JPFHS: (1) the Household Questionnaire, (2) the Woman’s Questionnaire, (3) the Man’s Questionnaire, (4) the Biomarker Questionnaire, and (5) the Fieldworker Questionnaire. The questionnaires, based on The DHS Program’s model questionnaires, were adapted to reflect the population and health issues relevant to Jordan. Input was solicited from various stakeholders representing government ministries and agencies, nongovernmental organisations, and international donors. After all questionnaires were finalised in English, they were translated into Arabic.

Cleaning operations

All electronic data files for the 2023 JPFHS were transferred via SynCloud to the DoS central office in Amman, where they were stored on a password-protected computer. The data processing operation included secondary editing, which required resolution of computer-identified inconsistencies and coding of open-ended questions. Data editing was accomplished using CSPro software. During the duration of fieldwork, tables were generated to check various data quality parameters, and specific feedback was given to the teams to improve performance. Secondary editing and data processing were initiated in July and completed in September 2023.

Response rate

A total of 20,054 households were selected for the sample, of which 19,809 were occupied. Of the occupied households, 19,475 were successfully interviewed, yielding a response rate of 98%.

In the interviewed households, 13,020 eligible women age 15–49 were identified for individual interviews; interviews were completed with 12,595 women, yielding a response rate of 97%. In the subsample of households selected for the male survey, 6,506 men age 15–59 were identified as eligible for individual interviews and 5,873 were successfully interviewed, yielding a response rate of 90%.

Sampling error estimates

The estimates from a sample survey are affected by two types of errors: nonsampling errors and sampling errors. Nonsampling errors are the results of mistakes made in implementing data collection and in data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the 2023 Jordan Population and Family Health Survey (2023 JPFHS) to minimise this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.

Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the 2023 JPFHS is only one of many samples that could have been selected from the same population, using the same design and sample size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling errors are a measure of the variability among all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.

Sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95% of all possible samples of identical size and design.

If the sample of respondents had been selected by simple random sampling, it would have been possible to use straightforward formulas for calculating sampling errors. However, the 2023 JPFHS sample was the result of a multistage stratified design, and, consequently, it was necessary to use more complex formulas. Sampling errors are computed using SAS programs developed by ICF. These programs use the Taylor linearisation method to estimate variances for survey estimates that are means, proportions, or ratios. The Jackknife repeated replication method is used for variance estimation of more complex statistics such as fertility and mortality rates.

A more detailed description of estimates of sampling errors are presented in APPENDIX B of the survey report.

Data appraisal

Data Quality Tables

Household age distribution

Age distribution of eligible and interviewed women

Age distribution of eligible and interviewed men

Age displacement at age 14/15

Age displacement at age 49/50

Pregnancy outcomes by years preceding the survey

Completeness of reporting

Standardization exercise results from anthropometry training

Height and weight data completeness and quality for children

Height measurements from random subsample of measured children

Interference in height and weight measurements of children

Interference in height and weight measurements of women

Heaping in
Improved Statistical Analysis of Low Abundance Phenomena in Bimodal...
plos.figshare.com
pdf
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Friedrich Reinhard; Jan Roelof van der Meer (2023). Improved Statistical Analysis of Low Abundance Phenomena in Bimodal Bacterial Populations [Dataset]. http://doi.org/10.1371/journal.pone.0078288
Explore at:
pdfAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0078288
Dataset updated
May 31, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Friedrich Reinhard; Jan Roelof van der Meer
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Accurate detection of subpopulation size determinations in bimodal populations remains problematic yet it represents a powerful way by which cellular heterogeneity under different environmental conditions can be compared. So far, most studies have relied on qualitative descriptions of population distribution patterns, on population-independent descriptors, or on arbitrary placement of thresholds distinguishing biological ON from OFF states. We found that all these methods fall short of accurately describing small population sizes in bimodal populations. Here we propose a simple, statistics-based method for the analysis of small subpopulation sizes for use in the free software environment R and test this method on real as well as simulated data. Four so-called population splitting methods were designed with different algorithms that can estimate subpopulation sizes from bimodal populations. All four methods proved more precise than previously used methods when analyzing subpopulation sizes of transfer competent cells arising in populations of the bacterium Pseudomonas knackmussii B13. The methods’ resolving powers were further explored by bootstrapping and simulations. Two of the methods were not severely limited by the proportions of subpopulations they could estimate correctly, but the two others only allowed accurate subpopulation quantification when this amounted to less than 25% of the total population. In contrast, only one method was still sufficiently accurate with subpopulations smaller than 1% of the total population. This study proposes a number of rational approximations to quantifying small subpopulations and offers an easy-to-use protocol for their implementation in the open source statistical software environment R.
Analyzing and interpreting spatial and temporal variability of the United...
plos.figshare.com
docx
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Meng Xu; Joel E. Cohen (2023). Analyzing and interpreting spatial and temporal variability of the United States county population distributions using Taylor's law [Dataset]. http://doi.org/10.1371/journal.pone.0226096
Explore at:
docxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0226096
Dataset updated
Jun 1, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Meng Xu; Joel E. Cohen
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
United States
Description
We study the spatial and temporal variation of the human population in the United States (US) counties from 1790 to 2010, using an ecological scaling pattern called Taylor's law (TL). TL states that the variance of population abundance is a power function of the mean population abundance. Despite extensive studies of TL for non-human populations, testing and interpreting TL using data on human populations are rare. Here we examine three types of TL that quantify the spatial and temporal variation of US county population abundance. Our results show that TL and its quadratic extension describe the mean-variance relationship of county population distribution well. The slope and statistics of TL reveal economic and demographic trends of the county populations. We propose TL as a useful statistical tool for analyzing human population variability. We suggest new ways of using TL to select and make population projections.
“Population” in Biology and Statistics: Topic Model and Analyses
figshare.com
txt
Updated Dec 7, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Charles Pence; Nicola Bertoldi (2024). “Population” in Biology and Statistics: Topic Model and Analyses [Dataset]. http://doi.org/10.6084/m9.figshare.27987335.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.27987335.v1
Dataset updated
Dec 7, 2024
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Charles Pence; Nicola Bertoldi
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
LicenseThe code present in code is licensed under the GNU GPL v3.All other data and figures present here are released under CC-BY 4.0.CodeA static version of the code that we used to generate the topic models is preserved here in the code directory. The latest version of that code, called quick-topic, is kept in a repository here:https://codeberg.org/cpence/quick-topicIf this repository is moved or closed, please look for future versions of this code at https://codeberg.org/cpence/ or https://codeberg.org/pencelab/.CorpusNote that not all files in this directory will work as expected, because the original, full-text journal articles cannot be uploaded here due to copyright provisions. Please contact us if you would like to make an arrangement to access this data.
National Health Interview Survey
odgavaprod.ogopendata.com
healthdata.gov
+3more
Updated Jul 25, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Centers for Disease Control and Prevention, Department of Health & Human Services (2023). National Health Interview Survey [Dataset]. https://odgavaprod.ogopendata.com/dataset/national-health-interview-survey
Explore at:
Dataset updated
Jul 25, 2023
Dataset provided by
United States Department of Health and Human Serviceshttp://www.hhs.gov/
Centers for Disease Control and Preventionhttp://www.cdc.gov/
Description
The National Health Interview Survey (NHIS) is the principal source of information on the health of the civilian noninstitutionalized population of the United States and is one of the major data collection programs of the National Center for Health Statistics (NCHS) which is part of the Centers for Disease Control and Prevention (CDC). The National Health Survey Act of 1956 provided for a continuing survey and special studies to secure accurate and current statistical information on the amount, distribution, and effects of illness and disability in the United States and the services rendered for or because of such conditions. The survey referred to in the Act, now called the National Health Interview Survey, was initiated in July 1957. Since 1960, the survey has been conducted by NCHS, which was formed when the National Health Survey and the National Vital Statistics Division were combined.
NHIS data are used widely throughout the Department of Health and Human Services (DHHS) to monitor trends in illness and disability and to track progress toward achieving national health objectives. The data are also used by the public health research community for epidemiologic and policy analysis of such timely issues as characterizing those with various health problems, determining barriers to accessing and using appropriate health care, and evaluating Federal health programs.
The NHIS also has a central role in the ongoing integration of household surveys in DHHS. The designs of two major DHHS national household surveys have been or are linked to the NHIS. The National Survey of Family Growth used the NHIS sampling frame in its first five cycles and the Medical Expenditure Panel Survey currently uses half of the NHIS sampling frame. Other linkage includes linking NHIS data to death certificates in the National Death Index (NDI).
While the NHIS has been conducted continuously since 1957, the content of the survey has been updated about every 10-15 years. In 1996, a substantially revised NHIS questionnaire began field testing. This revised questionnaire, described in detail below, was implemented in 1997 and has improved the ability of the NHIS to provide important health information.
N
Malta, OH Population Breakdown by Gender and Age
neilsberg.com
csv, json
Updated Sep 14, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2023). Malta, OH Population Breakdown by Gender and Age [Dataset]. https://www.neilsberg.com/research/datasets/6703b61d-3d85-11ee-9abe-0aa64bf2eeb2/
Explore at:
csv, jsonAvailable download formats
Dataset updated
Sep 14, 2023
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Ohio, Malta
Variables measured
Male and Female Population Under 5 Years, Male and Female Population over 85 years, Male and Female Population Between 5 and 9 years, Male and Female Population Between 10 and 14 years, Male and Female Population Between 15 and 19 years, Male and Female Population Between 20 and 24 years, Male and Female Population Between 25 and 29 years, Male and Female Population Between 30 and 34 years, Male and Female Population Between 35 and 39 years, Male and Female Population Between 40 and 44 years, and 8 more
Measurement technique
The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates. To measure the three variables, namely (a) Population (Male), (b) Population (Female), and (c) Gender Ratio (Males per 100 Females), we initially analyzed and categorized the data for each of the gender classifications (biological sex) reported by the US Census Bureau across 18 age groups, ranging from under 5 years to 85 years and above. These age groups are described above in the variables section. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset tabulates the population of Malta by gender across 18 age groups. It lists the male and female population in each age group along with the gender ratio for Malta. The dataset can be utilized to understand the population distribution of Malta by gender and age. For example, using this dataset, we can identify the largest age group for both Men and Women in Malta. Additionally, it can be used to see how the gender ratio changes from birth to senior most age group and male to female ratio across each age group for Malta.

Key observations

Largest age group (population): Male # 35-39 years (52) | Female # 25-29 years (62). Source: U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.

Content

When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.

Age groups:

Under 5 years

5 to 9 years

10 to 14 years

15 to 19 years

20 to 24 years

25 to 29 years

30 to 34 years

35 to 39 years

40 to 44 years

45 to 49 years

50 to 54 years

55 to 59 years

60 to 64 years

65 to 69 years

70 to 74 years

75 to 79 years

80 to 84 years

85 years and over

Scope of gender :

Please note that American Community Survey asks a question about the respondents current sex, but not about gender, sexual orientation, or sex at birth. The question is intended to capture data for biological sex, not gender. Respondents are supposed to respond with the answer as either of Male or Female. Our research and this dataset mirrors the data reported as Male and Female for gender distribution analysis.

Variables / Data Columns

Age Group: This column displays the age group for the Malta population analysis. Total expected values are 18 and are define above in the age groups section.

Population (Male): The male population in the Malta is shown in the following column.

Population (Female): The female population in the Malta is shown in the following column.

Gender Ratio: Also known as the sex ratio, this column displays the number of males per 100 females in Malta for each age group.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Recommended for further research

This dataset is a part of the main dataset for Malta Population by Gender. You can refer the same here
Population of the world 10,000BCE-2100
statista.com
Updated Nov 28, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Population of the world 10,000BCE-2100 [Dataset]. https://www.statista.com/statistics/1006502/global-population-ten-thousand-bc-to-2050/
Explore at:
Dataset updated
Nov 28, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
World
Description
Until the 1800s, population growth was incredibly slow on a global level. The global population was estimated to have been around 188 million people in the year 1CE, and did not reach one billion until around 1803. However, since the 1800s, a phenomenon known as the demographic transition has seen population growth skyrocket, reaching eight billion people in 2023, and this is expected to peak at over 10 billion in the 2080s.
2018 American Community Survey: S0101 | AGE AND SEX (ACS 5-Year Estimates...
data.census.gov
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ACS, 2018 American Community Survey: S0101 | AGE AND SEX (ACS 5-Year Estimates Subject Tables) [Dataset]. https://data.census.gov/table/ACSST5Y2018.S0101?g=040XX00US39
Explore at:
Dataset provided by
United States Census Bureauhttp://census.gov/
Authors
ACS
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Time period covered
2018
Description
Although the American Community Survey (ACS) produces population, demographic and housing unit estimates, it is the Census Bureau's Population Estimates Program that produces and disseminates the official estimates of the population for the nation, states, counties, cities, and towns and estimates of housing units for states and counties..Supporting documentation on code lists, subject definitions, data accuracy, and statistical testing can be found on the American Community Survey website in the .Technical Documentation.. section......Sample size and data quality measures (including coverage rates, allocation rates, and response rates) can be found on the American Community Survey website in the .Methodology.. section..Source: U.S. Census Bureau, 2014-2018 American Community Survey 5-Year Estimates.Data are based on a sample and are subject to sampling variability. The degree of uncertainty for an estimate arising from sampling variability is represented through the use of a margin of error. The value shown here is the 90 percent margin of error. The margin of error can be interpreted roughly as providing a 90 percent probability that the interval defined by the estimate minus the margin of error and the estimate plus the margin of error (the lower and upper confidence bounds) contains the true value. In addition to sampling variability, the ACS estimates are subject to nonsampling error (for a discussion of nonsampling variability, see .ACS Technical Documentation..). The effect of nonsampling error is not represented in these tables..The age dependency ratio is derived by dividing the combined under-18 and 65-and-over populations by the 18-to-64 population and multiplying by 100..The old-age dependency ratio is derived by dividing the population 65 and over by the 18-to-64 population and multiplying by 100..The child dependency ratio is derived by dividing the population under 18 by the 18-to-64 population and multiplying by 100..When information is missing or inconsistent, the Census Bureau logically assigns an acceptable value using the response to a related question or questions. If a logical assignment is not possible, data are filled using a statistical process called allocation, which uses a similar individual or household to provide a donor value. The "Allocated" section is the number of respondents who received an allocated value for a particular subject..While the 2014-2018 American Community Survey (ACS) data generally reflect the February 2013 Office of Management and Budget (OMB) definitions of metropolitan and micropolitan statistical areas; in certain instances the names, codes, and boundaries of the principal cities shown in ACS tables may differ from the OMB definitions due to differences in the effective dates of the geographic entities..Estimates of urban and rural populations, housing units, and characteristics reflect boundaries of urban areas defined based on Census 2010 data. As a result, data for urban and rural areas from the ACS do not necessarily reflect the results of ongoing urbanization..Explanation of Symbols:..An "**" entry in the margin of error column indicates that either no sample observations or too few sample observations were available to compute a standard error and thus the margin of error. A statistical test is not appropriate..An "-" entry in the estimate column indicates that either no sample observations or too few sample observations were available to compute an estimate, or a ratio of medians cannot be calculated because one or both of the median estimates falls in the lowest interval or upper interval of an open-ended distribution, or the margin of error associated with a median was larger than the median itself..An "-" following a median estimate means the median falls in the lowest interval of an open-ended distribution..An "+" following a median estimate means the median falls in the upper interval of an open-ended distribution..An "***" entry in the margin of error column indicates that the median falls in the lowest interval or upper interval of an open-ended distribution. A statistical test is not appropriate..An "*****" entry in the margin of error column indicates that the estimate is controlled. A statistical test for sampling variability is not appropriate. .An "N" entry in the estimate and margin of error columns indicates that data for this geographic area cannot be displayed because the number of sample cases is too small..An "(X)" means that the estimate is not applicable or not available....
d
Current Population Survey (CPS)
search.dataone.org
dataverse.harvard.edu
Updated Nov 21, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Damico, Anthony (2023). Current Population Survey (CPS) [Dataset]. http://doi.org/10.7910/DVN/AK4FDD
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/AK4FDD
Dataset updated
Nov 21, 2023
Dataset provided by
Harvard Dataverse
Authors
Damico, Anthony
Description
analyze the current population survey (cps) annual social and economic supplement (asec) with r the annual march cps-asec has been supplying the statistics for the census bureau's report on income, poverty, and health insurance coverage since 1948. wow. the us census bureau and the bureau of labor statistics ( bls) tag-team on this one. until the american community survey (acs) hit the scene in the early aughts (2000s), the current population survey had the largest sample size of all the annual general demographic data sets outside of the decennial census - about two hundred thousand respondents. this provides enough sample to conduct state- and a few large metro area-level analyses. your sample size will vanish if you start investigating subgroups b y state - consider pooling multiple years. county-level is a no-no. despite the american community survey's larger size, the cps-asec contains many more variables related to employment, sources of income, and insurance - and can be trended back to harry truman's presidency. aside from questions specifically asked about an annual experience (like income), many of the questions in this march data set should be t reated as point-in-time statistics. cps-asec generalizes to the united states non-institutional, non-active duty military population. the national bureau of economic research (nber) provides sas, spss, and stata importation scripts to create a rectangular file (rectangular data means only person-level records; household- and family-level information gets attached to each person). to import these files into r, the parse.SAScii function uses nber's sas code to determine how to import the fixed-width file, then RSQLite to put everything into a schnazzy database. you can try reading through the nber march 2012 sas importation code yourself, but it's a bit of a proc freak show. this new github repository contains three scripts: 2005-2012 asec - download all microdata.R down load the fixed-width file containing household, family, and person records import by separating this file into three tables, then merge 'em together at the person-level download the fixed-width file containing the person-level replicate weights merge the rectangular person-level file with the replicate weights, then store it in a sql database create a new variable - one - in the data table 2012 asec - analysis examples.R connect to the sql database created by the 'download all microdata' progr am create the complex sample survey object, using the replicate weights perform a boatload of analysis examples replicate census estimates - 2011.R connect to the sql database created by the 'download all microdata' program create the complex sample survey object, using the replicate weights match the sas output shown in the png file below 2011 asec replicate weight sas output.png statistic and standard error generated from the replicate-weighted example sas script contained in this census-provided person replicate weights usage instructions document. click here to view these three scripts for more detail about the current population survey - annual social and economic supplement (cps-asec), visit: the census bureau's current population survey page the bureau of labor statistics' current population survey page the current population survey's wikipedia article notes: interviews are conducted in march about experiences during the previous year. the file labeled 2012 includes information (income, work experience, health insurance) pertaining to 2011. when you use the current populat ion survey to talk about america, subract a year from the data file name. as of the 2010 file (the interview focusing on america during 2009), the cps-asec contains exciting new medical out-of-pocket spending variables most useful for supplemental (medical spending-adjusted) poverty research. confidential to sas, spss, stata, sudaan users: why are you still rubbing two sticks together after we've invented the butane lighter? time to transition to r. :D
N
Illinois Population Breakdown by Gender and Age Dataset: Male and Female...
neilsberg.com
csv, json
Updated Feb 24, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2025). Illinois Population Breakdown by Gender and Age Dataset: Male and Female Population Distribution Across 18 Age Groups // 2025 Edition [Dataset]. https://www.neilsberg.com/research/datasets/e1e81541-f25d-11ef-8c1b-3860777c1fe6/
Explore at:
csv, jsonAvailable download formats
Dataset updated
Feb 24, 2025
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Illinois
Variables measured
Male and Female Population Under 5 Years, Male and Female Population over 85 years, Male and Female Population Between 5 and 9 years, Male and Female Population Between 10 and 14 years, Male and Female Population Between 15 and 19 years, Male and Female Population Between 20 and 24 years, Male and Female Population Between 25 and 29 years, Male and Female Population Between 30 and 34 years, Male and Female Population Between 35 and 39 years, Male and Female Population Between 40 and 44 years, and 8 more
Measurement technique
The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates. To measure the three variables, namely (a) Population (Male), (b) Population (Female), and (c) Gender Ratio (Males per 100 Females), we initially analyzed and categorized the data for each of the gender classifications (biological sex) reported by the US Census Bureau across 18 age groups, ranging from under 5 years to 85 years and above. These age groups are described above in the variables section. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset tabulates the population of Illinois by gender across 18 age groups. It lists the male and female population in each age group along with the gender ratio for Illinois. The dataset can be utilized to understand the population distribution of Illinois by gender and age. For example, using this dataset, we can identify the largest age group for both Men and Women in Illinois. Additionally, it can be used to see how the gender ratio changes from birth to senior most age group and male to female ratio across each age group for Illinois.

Key observations

Largest age group (population): Male # 30-34 years (437,015) | Female # 30-34 years (429,453). Source: U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

Content

When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

Age groups:

Under 5 years

5 to 9 years

10 to 14 years

15 to 19 years

20 to 24 years

25 to 29 years

30 to 34 years

35 to 39 years

40 to 44 years

45 to 49 years

50 to 54 years

55 to 59 years

60 to 64 years

65 to 69 years

70 to 74 years

75 to 79 years

80 to 84 years

85 years and over

Scope of gender :

Please note that American Community Survey asks a question about the respondents current sex, but not about gender, sexual orientation, or sex at birth. The question is intended to capture data for biological sex, not gender. Respondents are supposed to respond with the answer as either of Male or Female. Our research and this dataset mirrors the data reported as Male and Female for gender distribution analysis.

Variables / Data Columns

Age Group: This column displays the age group for the Illinois population analysis. Total expected values are 18 and are define above in the age groups section.

Population (Male): The male population in the Illinois is shown in the following column.

Population (Female): The female population in the Illinois is shown in the following column.

Gender Ratio: Also known as the sex ratio, this column displays the number of males per 100 females in Illinois for each age group.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Recommended for further research

This dataset is a part of the main dataset for Illinois Population by Gender. You can refer the same here
undefined undefined: undefined | undefined (undefined)
data.census.gov
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
United States Census Bureau, undefined undefined: undefined | undefined (undefined) [Dataset]. https://data.census.gov/table?g=050XX00US06025&tid=ACSST1Y2021.S0701
Explore at:
Dataset provided by
United States Census Bureauhttp://census.gov/
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Although the American Community Survey (ACS) produces population, demographic and housing unit estimates, it is the Census Bureau's Population Estimates Program that produces and disseminates the official estimates of the population for the nation, states, counties, cities, and towns and estimates of housing units for states and counties..Supporting documentation on code lists, subject definitions, data accuracy, and statistical testing can be found on the American Community Survey website in the Technical Documentation section.Sample size and data quality measures (including coverage rates, allocation rates, and response rates) can be found on the American Community Survey website in the Methodology section..Source: U.S. Census Bureau, 2021 American Community Survey 1-Year Estimates.Data are based on a sample and are subject to sampling variability. The degree of uncertainty for an estimate arising from sampling variability is represented through the use of a margin of error. The value shown here is the 90 percent margin of error. The margin of error can be interpreted roughly as providing a 90 percent probability that the interval defined by the estimate minus the margin of error and the estimate plus the margin of error (the lower and upper confidence bounds) contains the true value. In addition to sampling variability, the ACS estimates are subject to nonsampling error (for a discussion of nonsampling variability, see ACS Technical Documentation). The effect of nonsampling error is not represented in these tables..Foreign born excludes people born outside the United States to a parent who is a U.S. citizen..When information is missing or inconsistent, the Census Bureau logically assigns an acceptable value using the response to a related question or questions. If a logical assignment is not possible, data are filled using a statistical process called allocation, which uses a similar individual or household to provide a donor value. The "Allocated" section is the number of respondents who received an allocated value for a particular subject..The 2021 American Community Survey (ACS) data generally reflect the March 2020 Office of Management and Budget (OMB) delineations of metropolitan and micropolitan statistical areas. In certain instances the names, codes, and boundaries of the principal cities shown in ACS tables may differ from the OMB delineations due to differences in the effective dates of the geographic entities..Estimates of urban and rural populations, housing units, and characteristics reflect boundaries of urban areas defined based on Census 2010 data. As a result, data for urban and rural areas from the ACS do not necessarily reflect the results of ongoing urbanization..Explanation of Symbols:- The estimate could not be computed because there were an insufficient number of sample observations. For a ratio of medians estimate, one or both of the median estimates falls in the lowest interval or highest interval of an open-ended distribution. For a 5-year median estimate, the margin of error associated with a median was larger than the median itself.N The estimate or margin of error cannot be displayed because there were an insufficient number of sample cases in the selected geographic area. (X) The estimate or margin of error is not applicable or not available.median- The median falls in the lowest interval of an open-ended distribution (for example "2,500-")median+ The median falls in the highest interval of an open-ended distribution (for example "250,000+").** The margin of error could not be computed because there were an insufficient number of sample observations.*** The margin of error could not be computed because the median falls in the lowest interval or highest interval of an open-ended distribution.***** A margin of error is not appropriate because the corresponding estimate is controlled to an independent population or housing estimate. Effectively, the corresponding estimate has no sampling error and the margin of error may be treated as zero.
2019 American Community Survey: S0101 | AGE AND SEX (ACS 1-Year Estimates...
data.census.gov
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ACS, 2019 American Community Survey: S0101 | AGE AND SEX (ACS 1-Year Estimates Subject Tables) [Dataset]. https://data.census.gov/cedsci/table?q=2019%20acs
Explore at:
Dataset provided by
United States Census Bureauhttp://census.gov/
Authors
ACS
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Time period covered
2019
Description
Although the American Community Survey (ACS) produces population, demographic and housing unit estimates, it is the Census Bureau's Population Estimates Program that produces and disseminates the official estimates of the population for the nation, states, counties, cities, and towns and estimates of housing units for states and counties..Supporting documentation on code lists, subject definitions, data accuracy, and statistical testing can be found on the American Community Survey website in the Technical Documentation section.Sample size and data quality measures (including coverage rates, allocation rates, and response rates) can be found on the American Community Survey website in the Methodology section..Source: U.S. Census Bureau, 2019 American Community Survey 1-Year Estimates.Data are based on a sample and are subject to sampling variability. The degree of uncertainty for an estimate arising from sampling variability is represented through the use of a margin of error. The value shown here is the 90 percent margin of error. The margin of error can be interpreted roughly as providing a 90 percent probability that the interval defined by the estimate minus the margin of error and the estimate plus the margin of error (the lower and upper confidence bounds) contains the true value. In addition to sampling variability, the ACS estimates are subject to nonsampling error (for a discussion of nonsampling variability, see ACS Technical Documentation). The effect of nonsampling error is not represented in these tables..The age dependency ratio is derived by dividing the combined under-18 and 65-and-over populations by the 18-to-64 population and multiplying by 100..The old-age dependency ratio is derived by dividing the population 65 and over by the 18-to-64 population and multiplying by 100..The child dependency ratio is derived by dividing the population under 18 by the 18-to-64 population and multiplying by 100..When information is missing or inconsistent, the Census Bureau logically assigns an acceptable value using the response to a related question or questions. If a logical assignment is not possible, data are filled using a statistical process called allocation, which uses a similar individual or household to provide a donor value. The "Allocated" section is the number of respondents who received an allocated value for a particular subject..The 2019 American Community Survey (ACS) data generally reflect the September 2018 Office of Management and Budget (OMB) delineations of metropolitan and micropolitan statistical areas. In certain instances the names, codes, and boundaries of the principal cities shown in ACS tables may differ from the OMB delineations due to differences in the effective dates of the geographic entities..Estimates of urban and rural populations, housing units, and characteristics reflect boundaries of urban areas defined based on Census 2010 data. As a result, data for urban and rural areas from the ACS do not necessarily reflect the results of ongoing urbanization..Explanation of Symbols:An "**" entry in the margin of error column indicates that either no sample observations or too few sample observations were available to compute a standard error and thus the margin of error. A statistical test is not appropriate.An "-" entry in the estimate column indicates that either no sample observations or too few sample observations were available to compute an estimate, or a ratio of medians cannot be calculated because one or both of the median estimates falls in the lowest interval or upper interval of an open-ended distribution, or the margin of error associated with a median was larger than the median itself.An "-" following a median estimate means the median falls in the lowest interval of an open-ended distribution.An "+" following a median estimate means the median falls in the upper interval of an open-ended distribution.An "***" entry in the margin of error column indicates that the median falls in the lowest interval or upper interval of an open-ended distribution. A statistical test is not appropriate.An "*****" entry in the margin of error column indicates that the estimate is controlled. A statistical test for sampling variability is not appropriate. An "N" entry in the estimate and margin of error columns indicates that data for this geographic area cannot be displayed because the number of sample cases is too small.An "(X)" means that the estimate is not applicable or not available.

Facebook

Twitter

Click to copy link

Link copied

Cite

Umeå university (2025). The database TABVERK [Dataset]. https://researchdata.se/en/catalogue/dataset/ext0084-1

The database TABVERK

Explore at:

5 scholarly articles cite this dataset (View in Google Scholar)

Dataset updated

Jun 24, 2025

Dataset provided by

Umeå University

Authors

Umeå university

Time period covered

1749 - 1859

Description

The database TABVERK contains population statistics in the Swedish parishes during the period 1749-1859. It constitutes the earliest population statistics throughout the world making it a unique source. Merely in Finland, a similar time serial of data exists. The database gathers all the information about the parishes’ populations that Swedish clergymens filled out in large forms of tables which were reported to the Tabellkommissionen in Stockholm. Now this information is accessible through the agency of the Demographic Data Base.

The database contain information on births, deaths, marriage, and occupation. The population is describe by sex, age, civil status, and by occupation and social status every fifth year. Yearly statistics is given about the demographical events, fertility, mortality and nuptiality, as well as migrations in the 19th century. The statistics in Tabellverket (TABVERK) enable demographical studies from several angles on both local and regional level in Sweden during the pre-industrial society.

TABVERK holds an online search tool called SHiPS (Swedish Historical Population Statistics), which can be reached and utilized from the home page of the Demographic Data Base.

Purpose:

The Demographic Database (DDB) is a special unit at Umeå University with activities focused on two main areas: data production and research. DDB creates and makes available population databases, mainly based on historical information in parish registers during the 18th, 19th, and 20th centuries.

Clear search

Close search

Google apps

Main menu

The database TABVERK

Population and Family Health Survey 2012 - Jordan

Abstract

Geographic coverage

Analysis unit

Kind of data

Sampling procedure

Mode of data collection

Research instrument

Cleaning operations

Response rate

Sampling error estimates

Census Microdata Samples Project

Census of Population and Housing, 2010 [United States]: Summary File 2 With...

Worldwide Population Data🌎 🌎

Column Descriptions:

How to Use the Dataset:

Michigan City, IN Population Breakdown by Gender and Age Dataset: Male and...

About this dataset

Content

Inspiration

Recommended for further research

World Cities, Countries & Languages Dataset

🔹 city.csv

🔹 country.csv

🔹 countrylanguage.csv

Why Use This Dataset?

Possible Use Cases

2021 American Community Survey: S0101 | AGE AND SEX (ACS 1-Year Estimates...

Population and Family Health Survey 2023 - Jordan

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Sampling procedure

Mode of data collection

Research instrument

Cleaning operations

Response rate

Sampling error estimates

Data appraisal

Improved Statistical Analysis of Low Abundance Phenomena in Bimodal...

Analyzing and interpreting spatial and temporal variability of the United...

“Population” in Biology and Statistics: Topic Model and Analyses

National Health Interview Survey

Malta, OH Population Breakdown by Gender and Age

About this dataset

Content

Inspiration

Recommended for further research

Population of the world 10,000BCE-2100

2018 American Community Survey: S0101 | AGE AND SEX (ACS 5-Year Estimates...

Current Population Survey (CPS)

Illinois Population Breakdown by Gender and Age Dataset: Male and Female...

About this dataset

Content

Inspiration

Recommended for further research

undefined undefined: undefined | undefined (undefined)

2019 American Community Survey: S0101 | AGE AND SEX (ACS 1-Year Estimates...

The database TABVERK