100+ datasets found
  1. r

    The database TABVERK

    • researchdata.se
    Updated Jun 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Umeå university (2025). The database TABVERK [Dataset]. https://researchdata.se/en/catalogue/dataset/ext0084-1
    Explore at:
    Dataset updated
    Jun 24, 2025
    Dataset provided by
    Umeå University
    Authors
    Umeå university
    Time period covered
    1749 - 1859
    Description

    The database TABVERK contains population statistics in the Swedish parishes during the period 1749-1859. It constitutes the earliest population statistics throughout the world making it a unique source. Merely in Finland, a similar time serial of data exists. The database gathers all the information about the parishes’ populations that Swedish clergymens filled out in large forms of tables which were reported to the Tabellkommissionen in Stockholm. Now this information is accessible through the agency of the Demographic Data Base.

    The database contain information on births, deaths, marriage, and occupation. The population is describe by sex, age, civil status, and by occupation and social status every fifth year. Yearly statistics is given about the demographical events, fertility, mortality and nuptiality, as well as migrations in the 19th century. The statistics in Tabellverket (TABVERK) enable demographical studies from several angles on both local and regional level in Sweden during the pre-industrial society.

    TABVERK holds an online search tool called SHiPS (Swedish Historical Population Statistics), which can be reached and utilized from the home page of the Demographic Data Base.

    Purpose:

    The Demographic Database (DDB) is a special unit at Umeå University with activities focused on two main areas: data production and research. DDB creates and makes available population databases, mainly based on historical information in parish registers during the 18th, 19th, and 20th centuries.

  2. u

    Population and Family Health Survey 2012 - Jordan

    • microdata.unhcr.org
    • catalog.ihsn.org
    • +2more
    Updated May 19, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department of Statistics (DoS) (2021). Population and Family Health Survey 2012 - Jordan [Dataset]. https://microdata.unhcr.org/index.php/catalog/405
    Explore at:
    Dataset updated
    May 19, 2021
    Dataset authored and provided by
    Department of Statistics (DoS)
    Time period covered
    2012
    Area covered
    Jordan
    Description

    Abstract

    The Jordan Population and Family Health Survey (JPFHS) is part of the worldwide Demographic and Health Surveys Program, which is designed to collect data on fertility, family planning, and maternal and child health.

    The primary objective of the 2012 Jordan Population and Family Health Survey (JPFHS) is to provide reliable estimates of demographic parameters, such as fertility, mortality, family planning, and fertility preferences, as well as maternal and child health and nutrition, that can be used by program managers and policymakers to evaluate and improve existing programs. The JPFHS data will be useful to researchers and scholars interested in analyzing demographic trends in Jordan, as well as those conducting comparative, regional, or cross-national studies.

    Geographic coverage

    National coverage

    Analysis unit

    • Household
    • Women age 15-49

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    Sample Design The 2012 JPFHS sample was designed to produce reliable estimates of major survey variables for the country as a whole, urban and rural areas, each of the 12 governorates, and for the two special domains: the Badia areas and people living in refugee camps. To facilitate comparisons with previous surveys, the sample was also designed to produce estimates for the three regions (North, Central, and South). The grouping of the governorates into regions is as follows: the North consists of Irbid, Jarash, Ajloun, and Mafraq governorates; the Central region consists of Amman, Madaba, Balqa, and Zarqa governorates; and the South region consists of Karak, Tafiela, Ma'an, and Aqaba governorates.

    The 2012 JPFHS sample was selected from the 2004 Jordan Population and Housing Census sampling frame. The frame excludes the population living in remote areas (most of whom are nomads), as well as those living in collective housing units such as hotels, hospitals, work camps, prisons, and the like. For the 2004 census, the country was subdivided into convenient area units called census blocks. For the purposes of the household surveys, the census blocks were regrouped to form a general statistical unit of moderate size (30 households or more), called a "cluster", which is widely used in surveys as a primary sampling unit (PSU).

    Stratification was achieved by first separating each governorate into urban and rural areas and then, within each urban and rural area, by Badia areas, refugee camps, and other. A two-stage sampling procedure was employed. In the first stage, 806 clusters were selected with probability proportional to the cluster size, that is, the number of residential households counted in the 2004 census. A household listing operation was then carried out in all of the selected clusters, and the resulting lists of households served as the sampling frame for the selection of households in the second stage. In the second stage of selection, a fixed number of 20 households was selected in each cluster with an equal probability systematic selection. A subsample of two-thirds of the selected households was identified for anthropometry measurements.

    Refer to Appendix A in the final report (Jordan Population and Family Health Survey 2012) for details of sampling weights calculation.

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    The 2012 JPFHS used two questionnaires, namely the Household Questionnaire and the Woman’s Questionnaire (see Appendix D). The Household Questionnaire was used to list all usual members of the sampled households, and visitors who slept in the household the night before the interview, and to obtain information on each household member’s age, sex, educational attainment, relationship to the head of the household, and marital status. In addition, questions were included on the socioeconomic characteristics of the household, such as source of water, sanitation facilities, and the availability of durable goods. Moreover, the questionnaire included questions about child discipline. The Household Questionnaire was also used to identify women who were eligible for the individual interview (ever-married women age 15-49 years). In addition, all women age 15-49 and children under age 5 living in the subsample of households were eligible for height and weight measurement and anemia testing.

    The Woman’s Questionnaire was administered to ever-married women age 15-49 and collected information on the following topics: • Respondent’s background characteristics • Birth history • Knowledge, attitudes, and practice of family planning and exposure to family planning messages • Maternal health (antenatal, delivery, and postnatal care) • Immunization and health of children under age 5 • Breastfeeding and infant feeding practices • Marriage and husband’s background characteristics • Fertility preferences • Respondent’s employment • Knowledge of AIDS and sexually transmitted infections (STIs) • Other health issues specific to women • Early childhood development • Domestic violence

    In addition, information on births, pregnancies, and contraceptive use and discontinuation during the five years prior to the survey was collected using a monthly calendar.

    The Household and Woman’s Questionnaires were based on the model questionnaires developed by the MEASURE DHS program. Additions and modifications to the model questionnaires were made in order to provide detailed information specific to Jordan. The questionnaires were then translated into Arabic.

    Anthropometric data were collected during the 2012 JPFHS in a subsample of two-thirds of the selected households in each cluster. All women age 15-49 and children age 0-4 in these households were measured for height using Shorr height boards and for weight using electronic Seca scales. In addition, a drop of capillary blood was taken from these women and children in the field to measure their hemoglobin level using the HemoCue system. Hemoglobin testing was used to estimate the prevalence of anemia.

    Cleaning operations

    Fieldwork and data processing activities overlapped. Data processing began two weeks after the start of the fieldwork. After field editing of questionnaires for completeness and consistency, the questionnaires for each cluster were packaged together and sent to the central office in Amman, where they were registered and stored. Special teams were formed to carry out office editing and coding of the openended questions.

    Data entry and verification started after two weeks of office data processing. The process of data entry, including 100 percent reentry, editing, and cleaning, was done by using PCs and the CSPro (Census and Survey Processing) computer package, developed specially for such surveys. The CSPro program allows data to be edited while being entered. Data processing operations were completed by early January 2013. A data processing specialist from ICF International made a trip to Jordan in February 2013 to follow up on data editing and cleaning and to work on the tabulation of results for the survey preliminary report, which was published in March 2013. The tabulations for this report were completed in April 2013.

    Response rate

    In all, 16,120 households were selected for the survey and, of these, 15,722 were found to be occupied households. Of these households, 15,190 (97 percent) were successfully interviewed.

    In the households interviewed, 11,673 ever-married women age 15-49 were identified and interviews were completed with 11,352 women, or 97 percent of all eligible women.

    Sampling error estimates

    The estimates from a sample survey are affected by two types of errors: (1) nonsampling errors and (2) sampling errors. Nonsampling errors are the results of mistakes made in implementing data collection and data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the 2012 Jordan Population and Family Health Survey (JPFHS) to minimize this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.

    Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the 2012 JPFHS is only one of many samples that could have been selected from the same population, using the same design and identical size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling error is a measure of the variability between all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.

    A sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95 percent of all possible samples of identical size and design.

    If the sample of respondents had been selected as a simple random sample, it would have been possible to use straightforward formulas for calculating sampling errors. However, the 2012 JPFHS sample is the result of a multistage stratified design, and, consequently, it was necessary to use more complex formulae. The computer

  3. n

    Census Microdata Samples Project

    • neuinfo.org
    • dknet.org
    • +2more
    Updated Jan 29, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). Census Microdata Samples Project [Dataset]. http://identifiers.org/RRID:SCR_008902
    Explore at:
    Dataset updated
    Jan 29, 2022
    Description

    A data set of cross-nationally comparable microdata samples for 15 Economic Commission for Europe (ECE) countries (Bulgaria, Canada, Czech Republic, Estonia, Finland, Hungary, Italy, Latvia, Lithuania, Romania, Russia, Switzerland, Turkey, UK, USA) based on the 1990 national population and housing censuses in countries of Europe and North America to study the social and economic conditions of older persons. These samples have been designed to allow research on a wide range of issues related to aging, as well as on other social phenomena. A common set of nomenclatures and classifications, derived on the basis of a study of census data comparability in Europe and North America, was adopted as a standard for recoding. This series was formerly called Dynamics of Population Aging in ECE Countries. The recommendations regarding the design and size of the samples drawn from the 1990 round of censuses envisaged: (1) drawing individual-based samples of about one million persons; (2) progressive oversampling with age in order to ensure sufficient representation of various categories of older people; and (3) retaining information on all persons co-residing in the sampled individual''''s dwelling unit. Estonia, Latvia and Lithuania provided the entire population over age 50, while Finland sampled it with progressive over-sampling. Canada, Italy, Russia, Turkey, UK, and the US provided samples that had not been drawn specially for this project, and cover the entire population without over-sampling. Given its wide user base, the US 1990 PUMS was not recoded. Instead, PAU offers mapping modules, which recode the PUMS variables into the project''''s classifications, nomenclatures, and coding schemes. Because of the high sampling density, these data cover various small groups of older people; contain as much geographic detail as possible under each country''''s confidentiality requirements; include more extensive information on housing conditions than many other data sources; and provide information for a number of countries whose data were not accessible until recently. Data Availability: Eight of the fifteen participating countries have signed the standard data release agreement making their data available through NACDA/ICPSR (see links below). Hungary and Switzerland require a clearance to be obtained from their national statistical offices for the use of microdata, however the documents signed between the PAU and these countries include clauses stipulating that, in general, all scholars interested in social research will be granted access. Russia requested that certain provisions for archiving the microdata samples be removed from its data release arrangement. The PAU has an agreement with several British scholars to facilitate access to the 1991 UK data through collaborative arrangements. Statistics Canada and the Italian Institute of statistics (ISTAT) provide access to data from Canada and Italy, respectively. * Dates of Study: 1989-1992 * Study Features: International, Minority Oversamples * Sample Size: Approx. 1 million/country Links: * Bulgaria (1992), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/02200 * Czech Republic (1991), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/06857 * Estonia (1989), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/06780 * Finland (1990), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/06797 * Romania (1992), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/06900 * Latvia (1989), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/02572 * Lithuania (1989), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/03952 * Turkey (1990), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/03292 * U.S. (1990), http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/06219

  4. Census of Population and Housing, 2010 [United States]: Summary File 2 With...

    • icpsr.umich.edu
    • search.datacite.org
    Updated Jul 18, 2013
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    United States. Bureau of the Census (2013). Census of Population and Housing, 2010 [United States]: Summary File 2 With National Update [Dataset]. http://doi.org/10.3886/ICPSR34755.v1
    Explore at:
    Dataset updated
    Jul 18, 2013
    Dataset provided by
    Inter-university Consortium for Political and Social Researchhttps://www.icpsr.umich.edu/web/pages/
    Authors
    United States. Bureau of the Census
    License

    https://www.icpsr.umich.edu/web/ICPSR/studies/34755/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/34755/terms

    Time period covered
    2010
    Area covered
    United States
    Description

    This data collection contains summary statistics on population and housing subjects derived from the responses to the 2010 Census questionnaire. Population items include sex, age, average household size, household type, and relationship to householder such as nonrelative or child. Housing items include tenure (whether a housing unit is owner-occupied or renter-occupied), age of householder, and household size for occupied housing units. Selected aggregates and medians also are provided. The summary statistics are presented in 71 tables, which are tabulated for multiple levels of observation (called "summary levels" in the Census Bureau's nomenclature), including, but not limited to, regions, divisions, states, metropolitan/micropolitan areas, counties, county subdivisions, places, ZIP Code Tabulation Areas (ZCTAs), school districts, census tracts, American Indian and Alaska Native areas, tribal subdivisions, and Hawaiian home lands. There are 10 population tables shown down to the county level and 47 population tables and 14 housing tables shown down to the census tract level. Every table cell is represented by a separate variable in the data. Each table is iterated for up to 330 population groups, which are called "characteristic iterations" in the Census Bureau's nomenclature: the total population, 74 race categories, 114 American Indian and Alaska Native categories, 47 Asian categories, 43 Native Hawaiian and Other Pacific Islander categories, and 51 Hispanic/not Hispanic groups. Moreover, the tables for some large summary areas (e.g., regions, divisions, and states) are iterated for portions of geographic areas ("geographic components" in the Census Bureau's nomenclature) such as metropolitan/micropolitan statistical areas and the principal cities of metropolitan statistical areas. The collection has a separate set of files for every state, the District of Columbia, Puerto Rico, and the National File. Each file set has 11 data files per characteristic iteration, a data file with geographic variables called the "geographic header file," and a documentation file called the "packing list" with information about the files in the file set. Altogether, the 53 file sets have 110,416 data files and 53 packing list files. Each file set is compressed in a separate ZIP archive (Datasets 1-56, 72, and 99). Another ZIP archive (Dataset 100) contains a Microsoft Access database shell and additional documentation files besides the codebook. The National File (Dataset 99) constitutes the National Update for Summary File 2. The National Update added summary levels for the United States as a whole, regions, divisions, and geographic areas that cross state lines such as Core Based Statistical Areas.

  5. Worldwide Population Data🌎 🌎

    • kaggle.com
    zip
    Updated Oct 9, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shiv_D24Coder (2023). Worldwide Population Data🌎 🌎 [Dataset]. https://www.kaggle.com/shivd24coder/worldwide-population-data
    Explore at:
    zip(48744075 bytes)Available download formats
    Dataset updated
    Oct 9, 2023
    Authors
    Shiv_D24Coder
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Area covered
    World
    Description

    This Dataset provides comprehensive demographic information on global populations from 1950 to the present. It offers insights into various aspects of population dynamics, including population counts, gender ratios, birth and death rates, life expectancy, and migration patterns.

    Column Descriptions:

    SortOrder: Numeric identifier for sorting.

    LocID: Location identifier.

    Notes: Additional notes or comments (blank in this dataset).

    ISO3_code: ISO 3-character country code.

    ISO2_code: ISO 2-character country code.

    SDMX_code: Statistical Data and Metadata Exchange code.

    LocTypeID: Location type identifier.

    LocTypeName: Location type name.

    ParentID: Identifier for the parent location.

    Location: Name of the location.

    VarID: Identifier for the variant.

    Variant: Type of population variant.

    Time: Year or time period.

    TPopulation1Jan: Total population on January 1st.

    TPopulation1July: Total population on July 1st.

    TPopulationMale1July: Total male population on July 1st.

    TPopulationFemale1July: Total female population on July 1st.

    PopDensity: Population density (people per square kilometer).

    PopSexRatio: Population sex ratio (male/female).

    MedianAgePop: Median age of the population.

    NatChange: Natural change in population.

    NatChangeRT: Natural change rate (per 1,000 people).

    PopChange: Population change.

    PopGrowthRate: Population growth rate (percentage).

    DoublingTime: Time for population to double (in years).

    Births: Total number of births.

    Births1519: Births to mothers aged 15-19.

    CBR: Crude birth rate (per 1,000 people).

    TFR: Total fertility rate (average number of children per woman).

    NRR: Net reproduction rate.

    MAC: Mean age at childbearing.

    SRB: Sex ratio at birth (male/female).

    Deaths: Total number of deaths.

    DeathsMale: Total male deaths.

    DeathsFemale: Total female deaths.

    CDR: Crude death rate (per 1,000 people).

    LEx: Life expectancy at birth.

    LExMale: Life expectancy for males at birth.

    LExFemale: Life expectancy for females at birth.

    LE15: Life expectancy at age 15.

    LE15Male: Life expectancy for males at age 15.

    LE15Female: Life expectancy for females at age 15.

    LE65: Life expectancy at age 65.

    LE65Male: Life expectancy for males at age 65.

    LE65Female: Life expectancy for females at age 65.

    LE80: Life expectancy at age 80.

    LE80Male: Life expectancy for males at age 80.

    LE80Female: Life expectancy for females at age 80.

    InfantDeaths: Number of infant deaths.

    IMR: Infant mortality rate (per 1,000 live births).

    LBsurvivingAge1: Children surviving to age 1.

    Under5Deaths: Number of deaths under age 5.

    NetMigrations: Net migration rate (per 1,000 people).

    CNMR: Crude net migration rate.

    How to Use the Dataset:

    1. Researchers can analyze demographic trends, birth and death rates, and population growth over time.
    2. Policymakers can use population data to inform decisions on healthcare, education, and social services.
    3. Data scientists can visualize and model population dynamics for various regions.
    4. Journalists can use the dataset to report on global population trends and disparities.
    5. Educators can incorporate real-world population data into lessons and research.

    Please upvote and show your support if you find this dataset valuable for your research or analysis. Your feedback and contributions help make this dataset more accessible to the Kaggle community. Thank you!

  6. N

    Michigan City, IN Population Breakdown by Gender and Age Dataset: Male and...

    • neilsberg.com
    csv, json
    Updated Feb 19, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2024). Michigan City, IN Population Breakdown by Gender and Age Dataset: Male and Female Population Distribution Across 18 Age Groups // 2024 Edition [Dataset]. https://www.neilsberg.com/research/datasets/8e20aa53-c989-11ee-9145-3860777c1fe6/
    Explore at:
    json, csvAvailable download formats
    Dataset updated
    Feb 19, 2024
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Michigan City
    Variables measured
    Male and Female Population Under 5 Years, Male and Female Population over 85 years, Male and Female Population Between 5 and 9 years, Male and Female Population Between 10 and 14 years, Male and Female Population Between 15 and 19 years, Male and Female Population Between 20 and 24 years, Male and Female Population Between 25 and 29 years, Male and Female Population Between 30 and 34 years, Male and Female Population Between 35 and 39 years, Male and Female Population Between 40 and 44 years, and 8 more
    Measurement technique
    The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2018-2022 5-Year Estimates. To measure the three variables, namely (a) Population (Male), (b) Population (Female), and (c) Gender Ratio (Males per 100 Females), we initially analyzed and categorized the data for each of the gender classifications (biological sex) reported by the US Census Bureau across 18 age groups, ranging from under 5 years to 85 years and above. These age groups are described above in the variables section. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    The dataset tabulates the population of Michigan City by gender across 18 age groups. It lists the male and female population in each age group along with the gender ratio for Michigan City. The dataset can be utilized to understand the population distribution of Michigan City by gender and age. For example, using this dataset, we can identify the largest age group for both Men and Women in Michigan City. Additionally, it can be used to see how the gender ratio changes from birth to senior most age group and male to female ratio across each age group for Michigan City.

    Key observations

    Largest age group (population): Male # 25-29 years (1,643) | Female # 35-39 years (1,182). Source: U.S. Census Bureau American Community Survey (ACS) 2018-2022 5-Year Estimates.

    Content

    When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2018-2022 5-Year Estimates.

    Age groups:

    • Under 5 years
    • 5 to 9 years
    • 10 to 14 years
    • 15 to 19 years
    • 20 to 24 years
    • 25 to 29 years
    • 30 to 34 years
    • 35 to 39 years
    • 40 to 44 years
    • 45 to 49 years
    • 50 to 54 years
    • 55 to 59 years
    • 60 to 64 years
    • 65 to 69 years
    • 70 to 74 years
    • 75 to 79 years
    • 80 to 84 years
    • 85 years and over

    Scope of gender :

    Please note that American Community Survey asks a question about the respondents current sex, but not about gender, sexual orientation, or sex at birth. The question is intended to capture data for biological sex, not gender. Respondents are supposed to respond with the answer as either of Male or Female. Our research and this dataset mirrors the data reported as Male and Female for gender distribution analysis.

    Variables / Data Columns

    • Age Group: This column displays the age group for the Michigan City population analysis. Total expected values are 18 and are define above in the age groups section.
    • Population (Male): The male population in the Michigan City is shown in the following column.
    • Population (Female): The female population in the Michigan City is shown in the following column.
    • Gender Ratio: Also known as the sex ratio, this column displays the number of males per 100 females in Michigan City for each age group.

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

    Recommended for further research

    This dataset is a part of the main dataset for Michigan City Population by Gender. You can refer the same here

  7. World Cities, Countries & Languages Dataset

    • kaggle.com
    zip
    Updated Sep 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Adil Shamim (2025). World Cities, Countries & Languages Dataset [Dataset]. https://www.kaggle.com/datasets/adilshamim8/world-cities-countries-and-languages-dataset/versions/3
    Explore at:
    zip(82890 bytes)Available download formats
    Dataset updated
    Sep 10, 2025
    Authors
    Adil Shamim
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset provides a structured view of the world’s cities, countries, and languages, derived from the well-known World Database (SQL → CSV). It is designed to be beginner-friendly yet powerful for researchers, analysts, and data scientists who want to explore global demographics, population distribution, and linguistic diversity.

    The dataset is split into three clean, relational tables:

    🔹 city.csv

    • Contains information about the world’s cities.
    • Key columns:

      • ID → Unique city identifier
      • Name → City name
      • CountryCode → Links each city to its country
      • District → Administrative division
      • Population → Population of the city

    🔹 country.csv

    • Describes countries and their attributes.
    • Key columns:

      • Code → Unique country code
      • Name → Country name
      • Continent, Region → Geographic classification
      • SurfaceArea → Area in square kilometers
      • Population → Country’s population
      • GovernmentForm, HeadOfState → Political details

    🔹 countrylanguage.csv

    • Captures languages spoken across countries.
    • Key columns:

      • CountryCode → Links to country.csv
      • Language → Language name
      • IsOfficial → Whether the language is official
      • Percentage → Percentage of speakers in the population

    Why Use This Dataset?

    • Study urbanization and population trends across the globe.
    • Explore language diversity and compare official vs. non-official usage.
    • Perform SQL-style joins across the three tables for deeper analysis.
    • Great for data visualization projects, machine learning experiments, or teaching relational databases.

    Possible Use Cases

    • 📊 Build dashboards to visualize population growth by continent or country.
    • 🌍 Rank cities by size, density, or region.
    • 🗣️ Analyze global language distribution and multilingual countries.
    • 🤖 Use as a practice dataset for SQL queries, joins, and normalization in machine learning pipelines.

    This dataset offers a balanced mix of geography, demography, and linguistics — perfect for analysts, students, and Kaggle competitors alike.

  8. 2021 American Community Survey: S0101 | AGE AND SEX (ACS 1-Year Estimates...

    • data.census.gov
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ACS, 2021 American Community Survey: S0101 | AGE AND SEX (ACS 1-Year Estimates Subject Tables) [Dataset]. https://data.census.gov/table/ACSST1Y2021.S0101?q=S0101:+AGE+AND+SEX
    Explore at:
    Dataset provided by
    United States Census Bureauhttp://census.gov/
    Authors
    ACS
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Time period covered
    2021
    Description

    Although the American Community Survey (ACS) produces population, demographic and housing unit estimates, it is the Census Bureau's Population Estimates Program that produces and disseminates the official estimates of the population for the nation, states, counties, cities, and towns and estimates of housing units for states and counties..Supporting documentation on code lists, subject definitions, data accuracy, and statistical testing can be found on the American Community Survey website in the Technical Documentation section.Sample size and data quality measures (including coverage rates, allocation rates, and response rates) can be found on the American Community Survey website in the Methodology section..Source: U.S. Census Bureau, 2021 American Community Survey 1-Year Estimates.Data are based on a sample and are subject to sampling variability. The degree of uncertainty for an estimate arising from sampling variability is represented through the use of a margin of error. The value shown here is the 90 percent margin of error. The margin of error can be interpreted roughly as providing a 90 percent probability that the interval defined by the estimate minus the margin of error and the estimate plus the margin of error (the lower and upper confidence bounds) contains the true value. In addition to sampling variability, the ACS estimates are subject to nonsampling error (for a discussion of nonsampling variability, see ACS Technical Documentation). The effect of nonsampling error is not represented in these tables..The age dependency ratio is derived by dividing the combined under-18 and 65-and-over populations by the 18-to-64 population and multiplying by 100..The old-age dependency ratio is derived by dividing the population 65 and over by the 18-to-64 population and multiplying by 100..The child dependency ratio is derived by dividing the population under 18 by the 18-to-64 population and multiplying by 100..When information is missing or inconsistent, the Census Bureau logically assigns an acceptable value using the response to a related question or questions. If a logical assignment is not possible, data are filled using a statistical process called allocation, which uses a similar individual or household to provide a donor value. The "Allocated" section is the number of respondents who received an allocated value for a particular subject..The 2021 American Community Survey (ACS) data generally reflect the March 2020 Office of Management and Budget (OMB) delineations of metropolitan and micropolitan statistical areas. In certain instances the names, codes, and boundaries of the principal cities shown in ACS tables may differ from the OMB delineations due to differences in the effective dates of the geographic entities..Estimates of urban and rural populations, housing units, and characteristics reflect boundaries of urban areas defined based on Census 2010 data. As a result, data for urban and rural areas from the ACS do not necessarily reflect the results of ongoing urbanization..Explanation of Symbols:- The estimate could not be computed because there were an insufficient number of sample observations. For a ratio of medians estimate, one or both of the median estimates falls in the lowest interval or highest interval of an open-ended distribution. For a 5-year median estimate, the margin of error associated with a median was larger than the median itself.N The estimate or margin of error cannot be displayed because there were an insufficient number of sample cases in the selected geographic area. (X) The estimate or margin of error is not applicable or not available.median- The median falls in the lowest interval of an open-ended distribution (for example "2,500-")median+ The median falls in the highest interval of an open-ended distribution (for example "250,000+").** The margin of error could not be computed because there were an insufficient number of sample observations.*** The margin of error could not be computed because the median falls in the lowest interval or highest interval of an open-ended distribution.***** A margin of error is not appropriate because the corresponding estimate is controlled to an independent population or housing estimate. Effectively, the corresponding estimate has no sampling error and the margin of error may be treated as zero.

  9. w

    Population and Family Health Survey 2023 - Jordan

    • microdata.worldbank.org
    • datacatalog.ihsn.org
    • +1more
    Updated Aug 23, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department of Statistics (DoS) (2024). Population and Family Health Survey 2023 - Jordan [Dataset]. https://microdata.worldbank.org/index.php/catalog/6288
    Explore at:
    Dataset updated
    Aug 23, 2024
    Dataset authored and provided by
    Department of Statistics (DoS)
    Time period covered
    2023
    Area covered
    Jordan
    Description

    Abstract

    The 2023 Jordan Population and Family Health Survey (JPFHS) is the eighth Population and Family Health Survey conducted in Jordan, following those conducted in 1990, 1997, 2002, 2007, 2009, 2012, and 2017–18. It was implemented by the Department of Statistics (DoS) at the request of the Ministry of Health (MoH).

    The primary objective of the 2023 JPFHS is to provide up-to-date estimates of key demographic and health indicators. Specifically, the 2023 JPFHS: • Collected data at the national level that allowed calculation of key demographic indicators • Explored the direct and indirect factors that determine levels of and trends in fertility and childhood mortality • Measured contraceptive knowledge and practice • Collected data on key aspects of family health, including immunisation coverage among children, prevalence and treatment of diarrhoea and other diseases among children under age 5, and maternity care indicators such as antenatal visits and assistance at delivery • Obtained data on child feeding practices, including breastfeeding, and conducted anthropometric measurements to assess the nutritional status of children under age 5 and women age 15–49 • Conducted haemoglobin testing with eligible children age 6–59 months and women age 15–49 to gather information on the prevalence of anaemia • Collected data on women’s and men’s knowledge and attitudes regarding sexually transmitted infections and HIV/AIDS • Obtained data on women’s experience of emotional, physical, and sexual violence • Gathered data on disability among household members

    The information collected through the 2023 JPFHS is intended to assist policymakers and programme managers in evaluating and designing programmes and strategies for improving the health of the country’s population. The survey also provides indicators relevant to the Sustainable Development Goals (SDGs) for Jordan.

    Geographic coverage

    National coverage

    Analysis unit

    • Household
    • Individual
    • Children age 0-5
    • Woman age 15-49
    • Man age 15-59

    Universe

    The survey covered all de jure household members (usual residents), all women aged 15-49, men aged 15-59, and all children aged 0-4 resident in the household.

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    The sampling frame used for the 2023 JPFHS was the 2015 Jordan Population and Housing Census (JPHC) frame. The survey was designed to produce representative results for the country as a whole, for urban and rural areas separately, for each of the country’s 12 governorates, and for four nationality domains: the Jordanian population, the Syrian population living in refugee camps, the Syrian population living outside of camps, and the population of other nationalities. Each of the 12 governorates is subdivided into districts, each district into subdistricts, each subdistrict into localities, and each locality into areas and subareas. In addition to these administrative units, during the 2015 JPHC each subarea was divided into convenient area units called census blocks. An electronic file of a complete list of all of the census blocks is available from DoS. The list contains census information on households, populations, geographical locations, and socioeconomic characteristics of each block. Based on this list, census blocks were regrouped to form a general statistical unit of moderate size, called a cluster, which is widely used in various surveys as the primary sampling unit (PSU). The sample clusters for the 2023 JPFHS were selected from the frame of cluster units provided by the DoS.

    The sample for the 2023 JPFHS was a stratified sample selected in two stages from the 2015 census frame. Stratification was achieved by separating each governorate into urban and rural areas. In addition, the Syrian refugee camps in Zarqa and Mafraq each formed a special sampling stratum. In total, 26 sampling strata were constructed. Samples were selected independently in each sampling stratum, through a twostage selection process, according to the sample allocation. Before the sample selection, the sampling frame was sorted by district and subdistrict within each sampling stratum. By using a probability proportional to size selection at the first stage of sampling, an implicit stratification and proportional allocation were achieved at each of the lower administrative levels.

    For further details on sample design, see APPENDIX A of the final report.

    Mode of data collection

    Computer Assisted Personal Interview [capi]

    Research instrument

    Five questionnaires were used for the 2023 JPFHS: (1) the Household Questionnaire, (2) the Woman’s Questionnaire, (3) the Man’s Questionnaire, (4) the Biomarker Questionnaire, and (5) the Fieldworker Questionnaire. The questionnaires, based on The DHS Program’s model questionnaires, were adapted to reflect the population and health issues relevant to Jordan. Input was solicited from various stakeholders representing government ministries and agencies, nongovernmental organisations, and international donors. After all questionnaires were finalised in English, they were translated into Arabic.

    Cleaning operations

    All electronic data files for the 2023 JPFHS were transferred via SynCloud to the DoS central office in Amman, where they were stored on a password-protected computer. The data processing operation included secondary editing, which required resolution of computer-identified inconsistencies and coding of open-ended questions. Data editing was accomplished using CSPro software. During the duration of fieldwork, tables were generated to check various data quality parameters, and specific feedback was given to the teams to improve performance. Secondary editing and data processing were initiated in July and completed in September 2023.

    Response rate

    A total of 20,054 households were selected for the sample, of which 19,809 were occupied. Of the occupied households, 19,475 were successfully interviewed, yielding a response rate of 98%.

    In the interviewed households, 13,020 eligible women age 15–49 were identified for individual interviews; interviews were completed with 12,595 women, yielding a response rate of 97%. In the subsample of households selected for the male survey, 6,506 men age 15–59 were identified as eligible for individual interviews and 5,873 were successfully interviewed, yielding a response rate of 90%.

    Sampling error estimates

    The estimates from a sample survey are affected by two types of errors: nonsampling errors and sampling errors. Nonsampling errors are the results of mistakes made in implementing data collection and in data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the 2023 Jordan Population and Family Health Survey (2023 JPFHS) to minimise this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.

    Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the 2023 JPFHS is only one of many samples that could have been selected from the same population, using the same design and sample size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling errors are a measure of the variability among all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.

    Sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95% of all possible samples of identical size and design.

    If the sample of respondents had been selected by simple random sampling, it would have been possible to use straightforward formulas for calculating sampling errors. However, the 2023 JPFHS sample was the result of a multistage stratified design, and, consequently, it was necessary to use more complex formulas. Sampling errors are computed using SAS programs developed by ICF. These programs use the Taylor linearisation method to estimate variances for survey estimates that are means, proportions, or ratios. The Jackknife repeated replication method is used for variance estimation of more complex statistics such as fertility and mortality rates.

    A more detailed description of estimates of sampling errors are presented in APPENDIX B of the survey report.

    Data appraisal

    Data Quality Tables

    • Household age distribution
    • Age distribution of eligible and interviewed women
    • Age distribution of eligible and interviewed men
    • Age displacement at age 14/15
    • Age displacement at age 49/50
    • Pregnancy outcomes by years preceding the survey
    • Completeness of reporting
    • Standardization exercise results from anthropometry training
    • Height and weight data completeness and quality for children
    • Height measurements from random subsample of measured children
    • Interference in height and weight measurements of children
    • Interference in height and weight measurements of women
    • Heaping in
  10. Improved Statistical Analysis of Low Abundance Phenomena in Bimodal...

    • plos.figshare.com
    pdf
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Friedrich Reinhard; Jan Roelof van der Meer (2023). Improved Statistical Analysis of Low Abundance Phenomena in Bimodal Bacterial Populations [Dataset]. http://doi.org/10.1371/journal.pone.0078288
    Explore at:
    pdfAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Friedrich Reinhard; Jan Roelof van der Meer
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Accurate detection of subpopulation size determinations in bimodal populations remains problematic yet it represents a powerful way by which cellular heterogeneity under different environmental conditions can be compared. So far, most studies have relied on qualitative descriptions of population distribution patterns, on population-independent descriptors, or on arbitrary placement of thresholds distinguishing biological ON from OFF states. We found that all these methods fall short of accurately describing small population sizes in bimodal populations. Here we propose a simple, statistics-based method for the analysis of small subpopulation sizes for use in the free software environment R and test this method on real as well as simulated data. Four so-called population splitting methods were designed with different algorithms that can estimate subpopulation sizes from bimodal populations. All four methods proved more precise than previously used methods when analyzing subpopulation sizes of transfer competent cells arising in populations of the bacterium Pseudomonas knackmussii B13. The methods’ resolving powers were further explored by bootstrapping and simulations. Two of the methods were not severely limited by the proportions of subpopulations they could estimate correctly, but the two others only allowed accurate subpopulation quantification when this amounted to less than 25% of the total population. In contrast, only one method was still sufficiently accurate with subpopulations smaller than 1% of the total population. This study proposes a number of rational approximations to quantifying small subpopulations and offers an easy-to-use protocol for their implementation in the open source statistical software environment R.

  11. Analyzing and interpreting spatial and temporal variability of the United...

    • plos.figshare.com
    docx
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Meng Xu; Joel E. Cohen (2023). Analyzing and interpreting spatial and temporal variability of the United States county population distributions using Taylor's law [Dataset]. http://doi.org/10.1371/journal.pone.0226096
    Explore at:
    docxAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Meng Xu; Joel E. Cohen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    United States
    Description

    We study the spatial and temporal variation of the human population in the United States (US) counties from 1790 to 2010, using an ecological scaling pattern called Taylor's law (TL). TL states that the variance of population abundance is a power function of the mean population abundance. Despite extensive studies of TL for non-human populations, testing and interpreting TL using data on human populations are rare. Here we examine three types of TL that quantify the spatial and temporal variation of US county population abundance. Our results show that TL and its quadratic extension describe the mean-variance relationship of county population distribution well. The slope and statistics of TL reveal economic and demographic trends of the county populations. We propose TL as a useful statistical tool for analyzing human population variability. We suggest new ways of using TL to select and make population projections.

  12. “Population” in Biology and Statistics: Topic Model and Analyses

    • figshare.com
    txt
    Updated Dec 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Charles Pence; Nicola Bertoldi (2024). “Population” in Biology and Statistics: Topic Model and Analyses [Dataset]. http://doi.org/10.6084/m9.figshare.27987335.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Dec 7, 2024
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Charles Pence; Nicola Bertoldi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    LicenseThe code present in code is licensed under the GNU GPL v3.All other data and figures present here are released under CC-BY 4.0.CodeA static version of the code that we used to generate the topic models is preserved here in the code directory. The latest version of that code, called quick-topic, is kept in a repository here:https://codeberg.org/cpence/quick-topicIf this repository is moved or closed, please look for future versions of this code at https://codeberg.org/cpence/ or https://codeberg.org/pencelab/.CorpusNote that not all files in this directory will work as expected, because the original, full-text journal articles cannot be uploaded here due to copyright provisions. Please contact us if you would like to make an arrangement to access this data.

  13. National Health Interview Survey

    • odgavaprod.ogopendata.com
    • healthdata.gov
    • +3more
    Updated Jul 25, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Centers for Disease Control and Prevention, Department of Health & Human Services (2023). National Health Interview Survey [Dataset]. https://odgavaprod.ogopendata.com/dataset/national-health-interview-survey
    Explore at:
    Dataset updated
    Jul 25, 2023
    Description

    The National Health Interview Survey (NHIS) is the principal source of information on the health of the civilian noninstitutionalized population of the United States and is one of the major data collection programs of the National Center for Health Statistics (NCHS) which is part of the Centers for Disease Control and Prevention (CDC). The National Health Survey Act of 1956 provided for a continuing survey and special studies to secure accurate and current statistical information on the amount, distribution, and effects of illness and disability in the United States and the services rendered for or because of such conditions. The survey referred to in the Act, now called the National Health Interview Survey, was initiated in July 1957. Since 1960, the survey has been conducted by NCHS, which was formed when the National Health Survey and the National Vital Statistics Division were combined.
    NHIS data are used widely throughout the Department of Health and Human Services (DHHS) to monitor trends in illness and disability and to track progress toward achieving national health objectives. The data are also used by the public health research community for epidemiologic and policy analysis of such timely issues as characterizing those with various health problems, determining barriers to accessing and using appropriate health care, and evaluating Federal health programs.
    The NHIS also has a central role in the ongoing integration of household surveys in DHHS. The designs of two major DHHS national household surveys have been or are linked to the NHIS. The National Survey of Family Growth used the NHIS sampling frame in its first five cycles and the Medical Expenditure Panel Survey currently uses half of the NHIS sampling frame. Other linkage includes linking NHIS data to death certificates in the National Death Index (NDI).
    While the NHIS has been conducted continuously since 1957, the content of the survey has been updated about every 10-15 years. In 1996, a substantially revised NHIS questionnaire began field testing. This revised questionnaire, described in detail below, was implemented in 1997 and has improved the ability of the NHIS to provide important health information.

  14. N

    Malta, OH Population Breakdown by Gender and Age

    • neilsberg.com
    csv, json
    Updated Sep 14, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2023). Malta, OH Population Breakdown by Gender and Age [Dataset]. https://www.neilsberg.com/research/datasets/6703b61d-3d85-11ee-9abe-0aa64bf2eeb2/
    Explore at:
    csv, jsonAvailable download formats
    Dataset updated
    Sep 14, 2023
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Ohio, Malta
    Variables measured
    Male and Female Population Under 5 Years, Male and Female Population over 85 years, Male and Female Population Between 5 and 9 years, Male and Female Population Between 10 and 14 years, Male and Female Population Between 15 and 19 years, Male and Female Population Between 20 and 24 years, Male and Female Population Between 25 and 29 years, Male and Female Population Between 30 and 34 years, Male and Female Population Between 35 and 39 years, Male and Female Population Between 40 and 44 years, and 8 more
    Measurement technique
    The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates. To measure the three variables, namely (a) Population (Male), (b) Population (Female), and (c) Gender Ratio (Males per 100 Females), we initially analyzed and categorized the data for each of the gender classifications (biological sex) reported by the US Census Bureau across 18 age groups, ranging from under 5 years to 85 years and above. These age groups are described above in the variables section. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    The dataset tabulates the population of Malta by gender across 18 age groups. It lists the male and female population in each age group along with the gender ratio for Malta. The dataset can be utilized to understand the population distribution of Malta by gender and age. For example, using this dataset, we can identify the largest age group for both Men and Women in Malta. Additionally, it can be used to see how the gender ratio changes from birth to senior most age group and male to female ratio across each age group for Malta.

    Key observations

    Largest age group (population): Male # 35-39 years (52) | Female # 25-29 years (62). Source: U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.

    Content

    When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.

    Age groups:

    • Under 5 years
    • 5 to 9 years
    • 10 to 14 years
    • 15 to 19 years
    • 20 to 24 years
    • 25 to 29 years
    • 30 to 34 years
    • 35 to 39 years
    • 40 to 44 years
    • 45 to 49 years
    • 50 to 54 years
    • 55 to 59 years
    • 60 to 64 years
    • 65 to 69 years
    • 70 to 74 years
    • 75 to 79 years
    • 80 to 84 years
    • 85 years and over

    Scope of gender :

    Please note that American Community Survey asks a question about the respondents current sex, but not about gender, sexual orientation, or sex at birth. The question is intended to capture data for biological sex, not gender. Respondents are supposed to respond with the answer as either of Male or Female. Our research and this dataset mirrors the data reported as Male and Female for gender distribution analysis.

    Variables / Data Columns

    • Age Group: This column displays the age group for the Malta population analysis. Total expected values are 18 and are define above in the age groups section.
    • Population (Male): The male population in the Malta is shown in the following column.
    • Population (Female): The female population in the Malta is shown in the following column.
    • Gender Ratio: Also known as the sex ratio, this column displays the number of males per 100 females in Malta for each age group.

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

    Recommended for further research

    This dataset is a part of the main dataset for Malta Population by Gender. You can refer the same here

  15. Population of the world 10,000BCE-2100

    • statista.com
    Updated Nov 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Population of the world 10,000BCE-2100 [Dataset]. https://www.statista.com/statistics/1006502/global-population-ten-thousand-bc-to-2050/
    Explore at:
    Dataset updated
    Nov 28, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    World
    Description

    Until the 1800s, population growth was incredibly slow on a global level. The global population was estimated to have been around 188 million people in the year 1CE, and did not reach one billion until around 1803. However, since the 1800s, a phenomenon known as the demographic transition has seen population growth skyrocket, reaching eight billion people in 2023, and this is expected to peak at over 10 billion in the 2080s.

  16. 2018 American Community Survey: S0101 | AGE AND SEX (ACS 5-Year Estimates...

    • data.census.gov
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ACS, 2018 American Community Survey: S0101 | AGE AND SEX (ACS 5-Year Estimates Subject Tables) [Dataset]. https://data.census.gov/table/ACSST5Y2018.S0101?g=040XX00US39
    Explore at:
    Dataset provided by
    United States Census Bureauhttp://census.gov/
    Authors
    ACS
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Time period covered
    2018
    Description

    Although the American Community Survey (ACS) produces population, demographic and housing unit estimates, it is the Census Bureau's Population Estimates Program that produces and disseminates the official estimates of the population for the nation, states, counties, cities, and towns and estimates of housing units for states and counties..Supporting documentation on code lists, subject definitions, data accuracy, and statistical testing can be found on the American Community Survey website in the .Technical Documentation.. section......Sample size and data quality measures (including coverage rates, allocation rates, and response rates) can be found on the American Community Survey website in the .Methodology.. section..Source: U.S. Census Bureau, 2014-2018 American Community Survey 5-Year Estimates.Data are based on a sample and are subject to sampling variability. The degree of uncertainty for an estimate arising from sampling variability is represented through the use of a margin of error. The value shown here is the 90 percent margin of error. The margin of error can be interpreted roughly as providing a 90 percent probability that the interval defined by the estimate minus the margin of error and the estimate plus the margin of error (the lower and upper confidence bounds) contains the true value. In addition to sampling variability, the ACS estimates are subject to nonsampling error (for a discussion of nonsampling variability, see .ACS Technical Documentation..). The effect of nonsampling error is not represented in these tables..The age dependency ratio is derived by dividing the combined under-18 and 65-and-over populations by the 18-to-64 population and multiplying by 100..The old-age dependency ratio is derived by dividing the population 65 and over by the 18-to-64 population and multiplying by 100..The child dependency ratio is derived by dividing the population under 18 by the 18-to-64 population and multiplying by 100..When information is missing or inconsistent, the Census Bureau logically assigns an acceptable value using the response to a related question or questions. If a logical assignment is not possible, data are filled using a statistical process called allocation, which uses a similar individual or household to provide a donor value. The "Allocated" section is the number of respondents who received an allocated value for a particular subject..While the 2014-2018 American Community Survey (ACS) data generally reflect the February 2013 Office of Management and Budget (OMB) definitions of metropolitan and micropolitan statistical areas; in certain instances the names, codes, and boundaries of the principal cities shown in ACS tables may differ from the OMB definitions due to differences in the effective dates of the geographic entities..Estimates of urban and rural populations, housing units, and characteristics reflect boundaries of urban areas defined based on Census 2010 data. As a result, data for urban and rural areas from the ACS do not necessarily reflect the results of ongoing urbanization..Explanation of Symbols:..An "**" entry in the margin of error column indicates that either no sample observations or too few sample observations were available to compute a standard error and thus the margin of error. A statistical test is not appropriate..An "-" entry in the estimate column indicates that either no sample observations or too few sample observations were available to compute an estimate, or a ratio of medians cannot be calculated because one or both of the median estimates falls in the lowest interval or upper interval of an open-ended distribution, or the margin of error associated with a median was larger than the median itself..An "-" following a median estimate means the median falls in the lowest interval of an open-ended distribution..An "+" following a median estimate means the median falls in the upper interval of an open-ended distribution..An "***" entry in the margin of error column indicates that the median falls in the lowest interval or upper interval of an open-ended distribution. A statistical test is not appropriate..An "*****" entry in the margin of error column indicates that the estimate is controlled. A statistical test for sampling variability is not appropriate. .An "N" entry in the estimate and margin of error columns indicates that data for this geographic area cannot be displayed because the number of sample cases is too small..An "(X)" means that the estimate is not applicable or not available....

  17. d

    Current Population Survey (CPS)

    • search.dataone.org
    • dataverse.harvard.edu
    Updated Nov 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Damico, Anthony (2023). Current Population Survey (CPS) [Dataset]. http://doi.org/10.7910/DVN/AK4FDD
    Explore at:
    Dataset updated
    Nov 21, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Damico, Anthony
    Description

    analyze the current population survey (cps) annual social and economic supplement (asec) with r the annual march cps-asec has been supplying the statistics for the census bureau's report on income, poverty, and health insurance coverage since 1948. wow. the us census bureau and the bureau of labor statistics ( bls) tag-team on this one. until the american community survey (acs) hit the scene in the early aughts (2000s), the current population survey had the largest sample size of all the annual general demographic data sets outside of the decennial census - about two hundred thousand respondents. this provides enough sample to conduct state- and a few large metro area-level analyses. your sample size will vanish if you start investigating subgroups b y state - consider pooling multiple years. county-level is a no-no. despite the american community survey's larger size, the cps-asec contains many more variables related to employment, sources of income, and insurance - and can be trended back to harry truman's presidency. aside from questions specifically asked about an annual experience (like income), many of the questions in this march data set should be t reated as point-in-time statistics. cps-asec generalizes to the united states non-institutional, non-active duty military population. the national bureau of economic research (nber) provides sas, spss, and stata importation scripts to create a rectangular file (rectangular data means only person-level records; household- and family-level information gets attached to each person). to import these files into r, the parse.SAScii function uses nber's sas code to determine how to import the fixed-width file, then RSQLite to put everything into a schnazzy database. you can try reading through the nber march 2012 sas importation code yourself, but it's a bit of a proc freak show. this new github repository contains three scripts: 2005-2012 asec - download all microdata.R down load the fixed-width file containing household, family, and person records import by separating this file into three tables, then merge 'em together at the person-level download the fixed-width file containing the person-level replicate weights merge the rectangular person-level file with the replicate weights, then store it in a sql database create a new variable - one - in the data table 2012 asec - analysis examples.R connect to the sql database created by the 'download all microdata' progr am create the complex sample survey object, using the replicate weights perform a boatload of analysis examples replicate census estimates - 2011.R connect to the sql database created by the 'download all microdata' program create the complex sample survey object, using the replicate weights match the sas output shown in the png file below 2011 asec replicate weight sas output.png statistic and standard error generated from the replicate-weighted example sas script contained in this census-provided person replicate weights usage instructions document. click here to view these three scripts for more detail about the current population survey - annual social and economic supplement (cps-asec), visit: the census bureau's current population survey page the bureau of labor statistics' current population survey page the current population survey's wikipedia article notes: interviews are conducted in march about experiences during the previous year. the file labeled 2012 includes information (income, work experience, health insurance) pertaining to 2011. when you use the current populat ion survey to talk about america, subract a year from the data file name. as of the 2010 file (the interview focusing on america during 2009), the cps-asec contains exciting new medical out-of-pocket spending variables most useful for supplemental (medical spending-adjusted) poverty research. confidential to sas, spss, stata, sudaan users: why are you still rubbing two sticks together after we've invented the butane lighter? time to transition to r. :D

  18. N

    Illinois Population Breakdown by Gender and Age Dataset: Male and Female...

    • neilsberg.com
    csv, json
    Updated Feb 24, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2025). Illinois Population Breakdown by Gender and Age Dataset: Male and Female Population Distribution Across 18 Age Groups // 2025 Edition [Dataset]. https://www.neilsberg.com/research/datasets/e1e81541-f25d-11ef-8c1b-3860777c1fe6/
    Explore at:
    csv, jsonAvailable download formats
    Dataset updated
    Feb 24, 2025
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Illinois
    Variables measured
    Male and Female Population Under 5 Years, Male and Female Population over 85 years, Male and Female Population Between 5 and 9 years, Male and Female Population Between 10 and 14 years, Male and Female Population Between 15 and 19 years, Male and Female Population Between 20 and 24 years, Male and Female Population Between 25 and 29 years, Male and Female Population Between 30 and 34 years, Male and Female Population Between 35 and 39 years, Male and Female Population Between 40 and 44 years, and 8 more
    Measurement technique
    The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates. To measure the three variables, namely (a) Population (Male), (b) Population (Female), and (c) Gender Ratio (Males per 100 Females), we initially analyzed and categorized the data for each of the gender classifications (biological sex) reported by the US Census Bureau across 18 age groups, ranging from under 5 years to 85 years and above. These age groups are described above in the variables section. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    The dataset tabulates the population of Illinois by gender across 18 age groups. It lists the male and female population in each age group along with the gender ratio for Illinois. The dataset can be utilized to understand the population distribution of Illinois by gender and age. For example, using this dataset, we can identify the largest age group for both Men and Women in Illinois. Additionally, it can be used to see how the gender ratio changes from birth to senior most age group and male to female ratio across each age group for Illinois.

    Key observations

    Largest age group (population): Male # 30-34 years (437,015) | Female # 30-34 years (429,453). Source: U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

    Content

    When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

    Age groups:

    • Under 5 years
    • 5 to 9 years
    • 10 to 14 years
    • 15 to 19 years
    • 20 to 24 years
    • 25 to 29 years
    • 30 to 34 years
    • 35 to 39 years
    • 40 to 44 years
    • 45 to 49 years
    • 50 to 54 years
    • 55 to 59 years
    • 60 to 64 years
    • 65 to 69 years
    • 70 to 74 years
    • 75 to 79 years
    • 80 to 84 years
    • 85 years and over

    Scope of gender :

    Please note that American Community Survey asks a question about the respondents current sex, but not about gender, sexual orientation, or sex at birth. The question is intended to capture data for biological sex, not gender. Respondents are supposed to respond with the answer as either of Male or Female. Our research and this dataset mirrors the data reported as Male and Female for gender distribution analysis.

    Variables / Data Columns

    • Age Group: This column displays the age group for the Illinois population analysis. Total expected values are 18 and are define above in the age groups section.
    • Population (Male): The male population in the Illinois is shown in the following column.
    • Population (Female): The female population in the Illinois is shown in the following column.
    • Gender Ratio: Also known as the sex ratio, this column displays the number of males per 100 females in Illinois for each age group.

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

    Recommended for further research

    This dataset is a part of the main dataset for Illinois Population by Gender. You can refer the same here

  19. undefined undefined: undefined | undefined (undefined)

    • data.census.gov
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    United States Census Bureau, undefined undefined: undefined | undefined (undefined) [Dataset]. https://data.census.gov/table?g=050XX00US06025&tid=ACSST1Y2021.S0701
    Explore at:
    Dataset provided by
    United States Census Bureauhttp://census.gov/
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Although the American Community Survey (ACS) produces population, demographic and housing unit estimates, it is the Census Bureau's Population Estimates Program that produces and disseminates the official estimates of the population for the nation, states, counties, cities, and towns and estimates of housing units for states and counties..Supporting documentation on code lists, subject definitions, data accuracy, and statistical testing can be found on the American Community Survey website in the Technical Documentation section.Sample size and data quality measures (including coverage rates, allocation rates, and response rates) can be found on the American Community Survey website in the Methodology section..Source: U.S. Census Bureau, 2021 American Community Survey 1-Year Estimates.Data are based on a sample and are subject to sampling variability. The degree of uncertainty for an estimate arising from sampling variability is represented through the use of a margin of error. The value shown here is the 90 percent margin of error. The margin of error can be interpreted roughly as providing a 90 percent probability that the interval defined by the estimate minus the margin of error and the estimate plus the margin of error (the lower and upper confidence bounds) contains the true value. In addition to sampling variability, the ACS estimates are subject to nonsampling error (for a discussion of nonsampling variability, see ACS Technical Documentation). The effect of nonsampling error is not represented in these tables..Foreign born excludes people born outside the United States to a parent who is a U.S. citizen..When information is missing or inconsistent, the Census Bureau logically assigns an acceptable value using the response to a related question or questions. If a logical assignment is not possible, data are filled using a statistical process called allocation, which uses a similar individual or household to provide a donor value. The "Allocated" section is the number of respondents who received an allocated value for a particular subject..The 2021 American Community Survey (ACS) data generally reflect the March 2020 Office of Management and Budget (OMB) delineations of metropolitan and micropolitan statistical areas. In certain instances the names, codes, and boundaries of the principal cities shown in ACS tables may differ from the OMB delineations due to differences in the effective dates of the geographic entities..Estimates of urban and rural populations, housing units, and characteristics reflect boundaries of urban areas defined based on Census 2010 data. As a result, data for urban and rural areas from the ACS do not necessarily reflect the results of ongoing urbanization..Explanation of Symbols:- The estimate could not be computed because there were an insufficient number of sample observations. For a ratio of medians estimate, one or both of the median estimates falls in the lowest interval or highest interval of an open-ended distribution. For a 5-year median estimate, the margin of error associated with a median was larger than the median itself.N The estimate or margin of error cannot be displayed because there were an insufficient number of sample cases in the selected geographic area. (X) The estimate or margin of error is not applicable or not available.median- The median falls in the lowest interval of an open-ended distribution (for example "2,500-")median+ The median falls in the highest interval of an open-ended distribution (for example "250,000+").** The margin of error could not be computed because there were an insufficient number of sample observations.*** The margin of error could not be computed because the median falls in the lowest interval or highest interval of an open-ended distribution.***** A margin of error is not appropriate because the corresponding estimate is controlled to an independent population or housing estimate. Effectively, the corresponding estimate has no sampling error and the margin of error may be treated as zero.

  20. 2019 American Community Survey: S0101 | AGE AND SEX (ACS 1-Year Estimates...

    • data.census.gov
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ACS, 2019 American Community Survey: S0101 | AGE AND SEX (ACS 1-Year Estimates Subject Tables) [Dataset]. https://data.census.gov/cedsci/table?q=2019%20acs
    Explore at:
    Dataset provided by
    United States Census Bureauhttp://census.gov/
    Authors
    ACS
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Time period covered
    2019
    Description

    Although the American Community Survey (ACS) produces population, demographic and housing unit estimates, it is the Census Bureau's Population Estimates Program that produces and disseminates the official estimates of the population for the nation, states, counties, cities, and towns and estimates of housing units for states and counties..Supporting documentation on code lists, subject definitions, data accuracy, and statistical testing can be found on the American Community Survey website in the Technical Documentation section.Sample size and data quality measures (including coverage rates, allocation rates, and response rates) can be found on the American Community Survey website in the Methodology section..Source: U.S. Census Bureau, 2019 American Community Survey 1-Year Estimates.Data are based on a sample and are subject to sampling variability. The degree of uncertainty for an estimate arising from sampling variability is represented through the use of a margin of error. The value shown here is the 90 percent margin of error. The margin of error can be interpreted roughly as providing a 90 percent probability that the interval defined by the estimate minus the margin of error and the estimate plus the margin of error (the lower and upper confidence bounds) contains the true value. In addition to sampling variability, the ACS estimates are subject to nonsampling error (for a discussion of nonsampling variability, see ACS Technical Documentation). The effect of nonsampling error is not represented in these tables..The age dependency ratio is derived by dividing the combined under-18 and 65-and-over populations by the 18-to-64 population and multiplying by 100..The old-age dependency ratio is derived by dividing the population 65 and over by the 18-to-64 population and multiplying by 100..The child dependency ratio is derived by dividing the population under 18 by the 18-to-64 population and multiplying by 100..When information is missing or inconsistent, the Census Bureau logically assigns an acceptable value using the response to a related question or questions. If a logical assignment is not possible, data are filled using a statistical process called allocation, which uses a similar individual or household to provide a donor value. The "Allocated" section is the number of respondents who received an allocated value for a particular subject..The 2019 American Community Survey (ACS) data generally reflect the September 2018 Office of Management and Budget (OMB) delineations of metropolitan and micropolitan statistical areas. In certain instances the names, codes, and boundaries of the principal cities shown in ACS tables may differ from the OMB delineations due to differences in the effective dates of the geographic entities..Estimates of urban and rural populations, housing units, and characteristics reflect boundaries of urban areas defined based on Census 2010 data. As a result, data for urban and rural areas from the ACS do not necessarily reflect the results of ongoing urbanization..Explanation of Symbols:An "**" entry in the margin of error column indicates that either no sample observations or too few sample observations were available to compute a standard error and thus the margin of error. A statistical test is not appropriate.An "-" entry in the estimate column indicates that either no sample observations or too few sample observations were available to compute an estimate, or a ratio of medians cannot be calculated because one or both of the median estimates falls in the lowest interval or upper interval of an open-ended distribution, or the margin of error associated with a median was larger than the median itself.An "-" following a median estimate means the median falls in the lowest interval of an open-ended distribution.An "+" following a median estimate means the median falls in the upper interval of an open-ended distribution.An "***" entry in the margin of error column indicates that the median falls in the lowest interval or upper interval of an open-ended distribution. A statistical test is not appropriate.An "*****" entry in the margin of error column indicates that the estimate is controlled. A statistical test for sampling variability is not appropriate. An "N" entry in the estimate and margin of error columns indicates that data for this geographic area cannot be displayed because the number of sample cases is too small.An "(X)" means that the estimate is not applicable or not available.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Umeå university (2025). The database TABVERK [Dataset]. https://researchdata.se/en/catalogue/dataset/ext0084-1

The database TABVERK

Explore at:
5 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Jun 24, 2025
Dataset provided by
Umeå University
Authors
Umeå university
Time period covered
1749 - 1859
Description

The database TABVERK contains population statistics in the Swedish parishes during the period 1749-1859. It constitutes the earliest population statistics throughout the world making it a unique source. Merely in Finland, a similar time serial of data exists. The database gathers all the information about the parishes’ populations that Swedish clergymens filled out in large forms of tables which were reported to the Tabellkommissionen in Stockholm. Now this information is accessible through the agency of the Demographic Data Base.

The database contain information on births, deaths, marriage, and occupation. The population is describe by sex, age, civil status, and by occupation and social status every fifth year. Yearly statistics is given about the demographical events, fertility, mortality and nuptiality, as well as migrations in the 19th century. The statistics in Tabellverket (TABVERK) enable demographical studies from several angles on both local and regional level in Sweden during the pre-industrial society.

TABVERK holds an online search tool called SHiPS (Swedish Historical Population Statistics), which can be reached and utilized from the home page of the Demographic Data Base.

Purpose:

The Demographic Database (DDB) is a special unit at Umeå University with activities focused on two main areas: data production and research. DDB creates and makes available population databases, mainly based on historical information in parish registers during the 18th, 19th, and 20th centuries.

Search
Clear search
Close search
Google apps
Main menu