32 datasets found

Population estimates time series dataset
ons.gov.uk
cy.ons.gov.uk
csv, xlsx
Updated Nov 27, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Office for National Statistics (2025). Population estimates time series dataset [Dataset]. https://www.ons.gov.uk/peoplepopulationandcommunity/populationandmigration/populationestimates/datasets/populationestimatestimeseriesdataset
Explore at:
csv, xlsxAvailable download formats
Dataset updated
Nov 27, 2025
Dataset provided by
Office for National Statisticshttp://www.ons.gov.uk/
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Description
The mid-year estimates refer to the population on 30 June of the reference year and are produced in line with the standard United Nations (UN) definition for population estimates. They are the official set of population estimates for the UK and its constituent countries, the regions and counties of England, and local authorities and their equivalents.
Demographic balances and indicators by type of projection and NUTS 3 region
ec.europa.eu
Updated Apr 14, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Eurostat (2025). Demographic balances and indicators by type of projection and NUTS 3 region [Dataset]. http://doi.org/10.2908/PROJ_19RDBI3
Explore at:
application/vnd.sdmx.genericdata+xml;version=2.1, application/vnd.sdmx.data+csv;version=1.0.0, json, tsv, application/vnd.sdmx.data+csv;version=2.0.0, application/vnd.sdmx.data+xml;version=3.0.0Available download formats
Unique identifier
https://doi.org/10.2908/PROJ_19RDBI3
Dataset updated
Apr 14, 2025
Dataset authored and provided by
Eurostathttps://ec.europa.eu/eurostat
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
2019 - 2100
Area covered
Silistra, Miesbach, Merzig-Wadern, Ferrara, Teramo, Worms, Kreisfreie Stadt, Rheinisch-Bergischer Kreis, West-Noord-Brabant, Pohjois-Savo (NUTS 2021), Solingen, Kreisfreie Stadt
Description
EUROPOP2019 are the latest Eurostat population projections produced at national and subnational levels for 31 countries: all 27 European Union (EU) Member States and four European Free Trade Association (EFTA) countries, covering the time horizon from 2019 to 2100.

Population projections are 'what-if scenario' that aim to show the hypothetically developments of the population size and its structure based on a sets of assumptions for fertility, mortality and net migration; they are presented for a long time period that covers more than a half-century (50 years).

The datasets at national level are composed by the baseline population projections and five sensitivity tests, namely:

no migration – it is assumed that the net migration is set to zero in each year of the entire horizon of projections;

lower migration – it is assumed that the net migration is 33% lower than in the baseline assumptions, in each year of the entire horizon of projections;

higher migration – it is assumed that the net migration is 33% higher than in the baseline assumptions, in each year of the entire horizon of projections;

lower fertility - it is assumed that the fertility rates are lower 20% than in the baseline assumptions, in each year of the entire horizon of projections;

lower mortality - it is assumed that the mortality rates are decreased such that the life expectancy at birth will increase of about two years by 2070 when compared with the baseline assumptions.

Data are available by single year time interval, as follows:

Projected population on 1 January by age and sex;

Assumptions on future age-specific fertility rates, probabilities of dying and net migration levels;

Projected life expectancy by age (in completed years) and sex.

Moreover, the demographic balances and indicators are available for the baseline projections and the five sensitive variants:

Total numbers of the projected live births and deaths;

Projected population structure indicators: proportions of broad age groups in total population, age dependency ratios and median ages of the population (for each sex component).

The dataset at regional level is composed by the baseline population projections and covers all 1169 regions classified as NUTS level 3 corresponding to the NUTS-2016 classification (the Nomenclature of Territorial Units for Statistics) and the 47 Statistical Regions (SR) agreed between European Commission and EFTA countries. Statistical regions are defined according to principles similar to those used in the establishment of the NUTS classification.

For all 1216 regions NUTS-3 level, data are available by single year time interval as follows:

Projected population on 1 January by age and sex;

Assumptions on future age-specific fertility rates, probabilities of dying and net migration levels;

Projected deaths by age and sex;

Projected life expectancy by age (reached during the year) and sex, which is computed according to the method described in the https://ec.europa.eu/eurostat/cache/metadata/Annexes/proj_19n_esms_an_24.pdf" target="_self">Technical note - Alternative life table (with annex)

In addition to the baseline projections, datasets on projected population at regional level are available for two sensitivity tests:

no migration - it is assumed that migration is zero for both international and internal components in each year of the entire horizon of projections;

no inter-regional migration - it is assumed that only internal migration is zero in each year of the entire horizon of projections.

Moreover, the demographic balances and indicators are available for the baseline projections and the two sensitive variants:

Total numbers of the projected live births by sex and deaths;

Projected population structure indicators: proportions of broad age groups in total population, age dependency ratios and median ages of the population (for each sex component).

The additional dataset called ‘Short-term update of the projected population (2022-2032)’ [proj_stp22] was published on 28 September 2022. While EUROPOP2019 remain the main set of reference for population projections, this new dataset includes updates of baseline projections for the total population, population in the age group 15 to 74 years (considered as the population in the working-age group), and its share in the total population. In addition, two sensitivity tests are carried out – high and very high number of refugees – by introducing in the baseline projections a shock due to the mass-influx of refugees fleeing the war in Ukraine, and who have received temporary protection in the EU countries.

The updated EUROPOP2019 projections were constructed from cumulative sums of weighted averages of annual population changes of two series: the original EUROPOP2019 projection and a new short-term population projection computed from the latest available data over the period of 10 years.

The two sensitivity tests were built on the following assumptions:

High number of refugees sensitivity test – assumes that the influx of refugees occurs during 2022 only, and is followed by annual returns at a constant rate such that at the end of 2031 the remaining number of refugees is 10% of the total influx in 2022;

Very high number of refugees sensitivity test – assumes that the influx of refugees occurs during 2022 and 2023, and is followed by annual returns at a constant rate such that at the end of 2031 the remaining number of refugees is 15% of the cumulated influx in 2022 and 2023.
New_Zealand_Births_and_Deaths_by_Region
kaggle.com
zip
Updated Oct 30, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ogundele Oluwanishola (2025). New_Zealand_Births_and_Deaths_by_Region [Dataset]. https://www.kaggle.com/datasets/ogundeleoluwanishola/new-zealand-births-and-deaths-by-region
Explore at:
zip(820402 bytes)Available download formats
Dataset updated
Oct 30, 2025
Authors
Ogundele Oluwanishola
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Area covered
New Zealand
Description
Dataset Overview

This dataset contains comprehensive birth and death statistics for New Zealand regions spanning 18 years (2005-2022).

Content

Period: Years from 2005 to 2022 (18 years)

Regions: 16 regions across New Zealand

Metrics: Birth counts and death counts per region per year

Total Records: 576+ data points

Columns

Period - Year (2005-2022)

Birth_Death - Category (Births or Deaths)

Region - New Zealand region name

Count - Number of births or deaths

Potential Use Cases

✅ Demographic trend analysis ✅ Regional population studies
✅ Time series forecasting ✅ Machine learning prediction models ✅ COVID-19 impact analysis ✅ Statistical analysis practice

Related Work

This dataset is used in my comprehensive analysis project: - GitHub: https://github.com/0luwanishola/New-Zealand-birth-analysis-and-model - Kaggle Notebooks: [Links will be added after publishing]

Data Source

New Zealand official statistics (December 2022)

Inspiration

What patterns can you find in New Zealand's demographic trends? How did COVID-19 impact birth rates? Can you predict future birth rates using machine learning? Tags: Add these tags (type and press Enter after each) demographics new zealand births time series analysis beginner healthcare social science License: Select CC0: Public Domain or CC BY-SA 4.0
Assumptions for probability of dying by age, sex and type of projection
ec.europa.eu
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Eurostat, Assumptions for probability of dying by age, sex and type of projection [Dataset]. http://doi.org/10.2908/PROJ_19NAASMR
Explore at:
json, application/vnd.sdmx.data+csv;version=1.0.0, tsv, application/vnd.sdmx.data+csv;version=2.0.0, application/vnd.sdmx.genericdata+xml;version=2.1, application/vnd.sdmx.data+xml;version=3.0.0Available download formats
Unique identifier
https://doi.org/10.2908/PROJ_19NAASMR
Dataset authored and provided by
Eurostathttps://ec.europa.eu/eurostat
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
2019 - 2100
Area covered
Romania, Cyprus, Belgium, Hungary, Denmark, Switzerland, Austria, Norway, Czechia, Iceland
Description
EUROPOP2019 are the latest Eurostat population projections produced at national and subnational levels for 31 countries: all 27 European Union (EU) Member States and four European Free Trade Association (EFTA) countries, covering the time horizon from 2019 to 2100.

Population projections are 'what-if scenario' that aim to show the hypothetically developments of the population size and its structure based on a sets of assumptions for fertility, mortality and net migration; they are presented for a long time period that covers more than a half-century (50 years).

The datasets at national level are composed by the baseline population projections and five sensitivity tests, namely:

no migration – it is assumed that the net migration is set to zero in each year of the entire horizon of projections;

lower migration – it is assumed that the net migration is 33% lower than in the baseline assumptions, in each year of the entire horizon of projections;

higher migration – it is assumed that the net migration is 33% higher than in the baseline assumptions, in each year of the entire horizon of projections;

lower fertility - it is assumed that the fertility rates are lower 20% than in the baseline assumptions, in each year of the entire horizon of projections;

lower mortality - it is assumed that the mortality rates are decreased such that the life expectancy at birth will increase of about two years by 2070 when compared with the baseline assumptions.

Data are available by single year time interval, as follows:

Projected population on 1 January by age and sex;

Assumptions on future age-specific fertility rates, probabilities of dying and net migration levels;

Projected life expectancy by age (in completed years) and sex.

Moreover, the demographic balances and indicators are available for the baseline projections and the five sensitive variants:

Total numbers of the projected live births and deaths;

Projected population structure indicators: proportions of broad age groups in total population, age dependency ratios and median ages of the population (for each sex component).

The dataset at regional level is composed by the baseline population projections and covers all 1169 regions classified as NUTS level 3 corresponding to the NUTS-2016 classification (the Nomenclature of Territorial Units for Statistics) and the 47 Statistical Regions (SR) agreed between European Commission and EFTA countries. Statistical regions are defined according to principles similar to those used in the establishment of the NUTS classification.

For all 1216 regions NUTS-3 level, data are available by single year time interval as follows:

Projected population on 1 January by age and sex;

Assumptions on future age-specific fertility rates, probabilities of dying and net migration levels;

Projected deaths by age and sex;

Projected life expectancy by age (reached during the year) and sex, which is computed according to the method described in the https://ec.europa.eu/eurostat/cache/metadata/Annexes/proj_19n_esms_an_24.pdf" target="_self">Technical note - Alternative life table (with annex)

In addition to the baseline projections, datasets on projected population at regional level are available for two sensitivity tests:

no migration - it is assumed that migration is zero for both international and internal components in each year of the entire horizon of projections;

no inter-regional migration - it is assumed that only internal migration is zero in each year of the entire horizon of projections.

Moreover, the demographic balances and indicators are available for the baseline projections and the two sensitive variants:

Total numbers of the projected live births by sex and deaths;

Projected population structure indicators: proportions of broad age groups in total population, age dependency ratios and median ages of the population (for each sex component).

The additional dataset called ‘Short-term update of the projected population (2022-2032)’ [proj_stp22] was published on 28 September 2022. While EUROPOP2019 remain the main set of reference for population projections, this new dataset includes updates of baseline projections for the total population, population in the age group 15 to 74 years (considered as the population in the working-age group), and its share in the total population. In addition, two sensitivity tests are carried out – high and very high number of refugees – by introducing in the baseline projections a shock due to the mass-influx of refugees fleeing the war in Ukraine, and who have received temporary protection in the EU countries.

The updated EUROPOP2019 projections were constructed from cumulative sums of weighted averages of annual population changes of two series: the original EUROPOP2019 projection and a new short-term population projection computed from the latest available data over the period of 10 years.

The two sensitivity tests were built on the following assumptions:

High number of refugees sensitivity test – assumes that the influx of refugees occurs during 2022 only, and is followed by annual returns at a constant rate such that at the end of 2031 the remaining number of refugees is 10% of the total influx in 2022;

Very high number of refugees sensitivity test – assumes that the influx of refugees occurs during 2022 and 2023, and is followed by annual returns at a constant rate such that at the end of 2031 the remaining number of refugees is 15% of the cumulated influx in 2022 and 2023.
World Development Indicators
kaggle.com
zip
Updated Dec 10, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Umit Kayaardi (2025). World Development Indicators [Dataset]. https://www.kaggle.com/datasets/umitka/world-development-indicators
Explore at:
zip(66543746 bytes)Available download formats
Dataset updated
Dec 10, 2025
Authors
Umit Kayaardi
License
https://www.worldbank.org/en/about/legal/terms-of-use-for-datasetshttps://www.worldbank.org/en/about/legal/terms-of-use-for-datasets
Description
Content & Context:

This dataset provides a comprehensive collection of annual country-level indicators spanning social, economic, environmental, financial, and demographic domains for 265 countries and regions from 1960 to 2024. Each row corresponds to a unique combination of country/region, indicator, sex, and age group, with values reported annually in wide format. The indicators cover areas such as population demographics, health, education, labor, income, trade, government finance, agriculture, energy, environmental sustainability, and infrastructure, making it suitable for trend analysis, cross-country comparisons, policy research, and predictive modeling. Missing values are marked as NaN, and metadata columns provide additional context including units, aggregation methods, and data sources.

It is ideal for:

Trend analysis over time (e.g., population growth, GDP, education levels)

Comparisons between countries or regions

Research and policy studies in development, economics, health, agriculture, and sustainability

Machine learning and predictive modeling using historical indicators

Example Usage:

Visualizing trends: Compare GDP per capita or fertility rates over decades.

Country comparisons: Examine health or education indicators across regions.

Predictive modeling: Forecast future economic or demographic trends.

Notes:

Wide format is suitable for trend analysis; can be converted to long format using pd.melt() if needed.

Early years (1960s–1970s) have more missing data.

Dataset is cleaned and contains no duplicate rows.

Columns:

FREQ – Data frequency code (e.g., 'A' = Annual)

FREQ_LABEL – Frequency label (e.g., 'Annual')

REF_AREA – Country/region ISO code

REF_AREA_LABEL – Country/region name

INDICATOR – Indicator code (WDI code)

INDICATOR_LABEL – Indicator name and description

SEX – Sex code (_T=Total, M=Male, F=Female)

SEX_LABEL – Sex description

AGE – Age group code (_T=All ages or no breakdown)

AGE_LABEL – Age group description

UNIT_MEASURE – Unit code

UNIT_MEASURE_LABEL – Unit description

AGG_METHOD – Aggregation method code

AGG_METHOD_LABEL – Aggregation method description

DECIMALS – Number of decimal places reported

DECIMALS_LABEL – Decimal description

DATABASE_ID – Source database code

DATABASE_ID_LABEL – Source database name

UNIT_MULT – Unit multiplier (e.g., 1, 1000, 1e6)

UNIT_MULT_LABEL – Multiplier description

DATA_SOURCE – Data source code

DATA_SOURCE_LABEL – Data source name

OBS_STATUS – Observation status code

OBS_STATUS_LABEL – Observation status description

OBS_CONF – Confidence level code

OBS_CONF_LABEL – Confidence level description

1960, 1961, 1962, …, 2024 – Annual values for each indicator (float64, missing values marked as NaN)
Wikipedia Dataset
kaggle.com
zip
Updated Oct 13, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vincent Amonde (2025). Wikipedia Dataset [Dataset]. https://www.kaggle.com/datasets/vincentdsc/wikipedia-dataset
Explore at:
zip(12363201 bytes)Available download formats
Dataset updated
Oct 13, 2025
Authors
Vincent Amonde
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset 1: Wikipedia Article Metadata and Content Distribution (2019–2023)

This dataset represents metadata and structural information extracted from Wikipedia articles across multiple language editions between January 2019 and December 2023. The data was collected through the Wikimedia REST API and Wikidata Query Service, focusing on high-level article characteristics such as content length, number of references, topic classification, and readership activity. Each row corresponds to a unique Wikipedia article identified by an article_id and includes metadata describing its topic category (e.g., Politics, Science, Culture), geographic focus, and quality assessment.

The dataset was designed to help quantify content inequality and topic bias across languages. For example, English and German editions tend to have more extensive coverage of scientific and technological topics, while Swahili and Arabic editions show higher representation of local cultural and geographical content but fewer high-quality (“Featured Article”) designations. Article-level metrics like word_count, references_count, and page_views were gathered to provide indicators of article depth, credibility, and public engagement. The last_edit_date variable helps capture how frequently articles are updated, indicating editorial activity over time.

Temporal coverage: 2019–2023 Data sources: Wikimedia REST API, Wikidata Query Service, Pageview Analytics Primary purpose: To analyze disparities in article depth, topic diversity, and regional focus across Wikipedia’s major language editions.

Dataset 2: Wikipedia Editor Demographics and Contribution Data (2018–2023)

This dataset summarizes demographic and contribution patterns of active Wikipedia editors from 2018 to 2023, based on public edit histories available through the Wikimedia Dumps and MediaWiki API. Each record corresponds to a unique editor identified by editor_id, containing attributes such as country, primary language of editing, total edit counts, and dominant topic area.

Although Wikipedia does not directly record personal information, country and language data were inferred using IP-based geolocation for anonymous edits and user-declared data for registered contributors. The dataset was sampled to capture editors across seven major languages (English, French, Spanish, German, Swahili, Arabic, and Chinese). Demographic variables like gender and education_level are approximations derived from community surveys conducted by the Wikimedia Foundation in 2019 and 2021, used here to represent broad participation trends rather than individual identities.

This dataset provides insight into editorial imbalance, highlighting, for example, that editors from Europe and North America contribute disproportionately more to technical and scientific topics compared to those from Africa or South America. Fields such as total_edits, articles_edited, and avg_edit_size reflect productivity and depth of engagement, while active_since helps trace editor retention and historical participation.

Temporal coverage: 2018–2023 Data sources: Wikimedia Dumps, MediaWiki API, Wikimedia Community Surveys (2019, 2021) Primary purpose: To analyze demographic participation gaps and editing activity distribution across languages and regions.

Dataset 3: Wikipedia Language and Geographic Coverage Statistics (2023)

This dataset presents aggregated statistics at the language edition level, representing Wikipedia’s overall content and contributor structure as of December 2023. The data was compiled from the Wikimedia Statistics Portal and Meta-Wiki language reports, which provide high-level metrics such as total number of articles, average article length, number of active editors, and editing intensity per language.

Each entry represents one Wikipedia language edition, capturing its global footprint and coverage balance. The column coverage_score is a composite index derived from article volume, diversity of covered topics, and proportional representation of countries and regions. underrepresented_regions indicates the number of global regions (out of ten defined by the UN geoscheme) that have low coverage or minimal article representation in that language edition. The dataset allows researchers to identify which language Wikipedias most effectively cover global topics and which remain regionally or linguistically constrained.
Population on 1st January by age, sex, type of projection and NUTS 3 region
ec.europa.eu
Updated Nov 11, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Eurostat (2024). Population on 1st January by age, sex, type of projection and NUTS 3 region [Dataset]. http://doi.org/10.2908/PROJ_19RP3
Explore at:
tsv, json, application/vnd.sdmx.data+csv;version=1.0.0, application/vnd.sdmx.data+xml;version=3.0.0, application/vnd.sdmx.data+csv;version=2.0.0, application/vnd.sdmx.genericdata+xml;version=2.1Available download formats
Unique identifier
https://doi.org/10.2908/PROJ_19RP3
Dataset updated
Nov 11, 2024
Dataset authored and provided by
Eurostathttps://ec.europa.eu/eurostat
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
2019 - 2100
Area covered
Zuidwest-Friesland (NUTS 2021), Kreisfreie Stadt, Halle (Saale), Pirkanmaa (NUTS 2021), Lanzarote, Rheinisch-Bergischer Kreis, Traunviertel, Catanzaro, Lot, Telšių apskritis, Mansfeld-Südharz
Description
EUROPOP2019 are the latest Eurostat population projections produced at national and subnational levels for 31 countries: all 27 European Union (EU) Member States and four European Free Trade Association (EFTA) countries, covering the time horizon from 2019 to 2100.

Population projections are 'what-if scenario' that aim to show the hypothetically developments of the population size and its structure based on a sets of assumptions for fertility, mortality and net migration; they are presented for a long time period that covers more than a half-century (50 years).

The datasets at national level are composed by the baseline population projections and five sensitivity tests, namely:

no migration – it is assumed that the net migration is set to zero in each year of the entire horizon of projections;

lower migration – it is assumed that the net migration is 33% lower than in the baseline assumptions, in each year of the entire horizon of projections;

higher migration – it is assumed that the net migration is 33% higher than in the baseline assumptions, in each year of the entire horizon of projections;

lower fertility - it is assumed that the fertility rates are lower 20% than in the baseline assumptions, in each year of the entire horizon of projections;

lower mortality - it is assumed that the mortality rates are decreased such that the life expectancy at birth will increase of about two years by 2070 when compared with the baseline assumptions.

Data are available by single year time interval, as follows:

Projected population on 1 January by age and sex;

Assumptions on future age-specific fertility rates, probabilities of dying and net migration levels;

Projected life expectancy by age (in completed years) and sex.

Moreover, the demographic balances and indicators are available for the baseline projections and the five sensitive variants:

Total numbers of the projected live births and deaths;

Projected population structure indicators: proportions of broad age groups in total population, age dependency ratios and median ages of the population (for each sex component).

The dataset at regional level is composed by the baseline population projections and covers all 1169 regions classified as NUTS level 3 corresponding to the NUTS-2016 classification (the Nomenclature of Territorial Units for Statistics) and the 47 Statistical Regions (SR) agreed between European Commission and EFTA countries. Statistical regions are defined according to principles similar to those used in the establishment of the NUTS classification.

For all 1216 regions NUTS-3 level, data are available by single year time interval as follows:

Projected population on 1 January by age and sex;

Assumptions on future age-specific fertility rates, probabilities of dying and net migration levels;

Projected deaths by age and sex;

Projected life expectancy by age (reached during the year) and sex, which is computed according to the method described in the https://ec.europa.eu/eurostat/cache/metadata/Annexes/proj_19n_esms_an_24.pdf" target="_self">Technical note - Alternative life table (with annex)

In addition to the baseline projections, datasets on projected population at regional level are available for two sensitivity tests:

no migration - it is assumed that migration is zero for both international and internal components in each year of the entire horizon of projections;

no inter-regional migration - it is assumed that only internal migration is zero in each year of the entire horizon of projections.

Moreover, the demographic balances and indicators are available for the baseline projections and the two sensitive variants:

Total numbers of the projected live births by sex and deaths;

Projected population structure indicators: proportions of broad age groups in total population, age dependency ratios and median ages of the population (for each sex component).

The additional dataset called ‘Short-term update of the projected population (2022-2032)’ [proj_stp22] was published on 28 September 2022. While EUROPOP2019 remain the main set of reference for population projections, this new dataset includes updates of baseline projections for the total population, population in the age group 15 to 74 years (considered as the population in the working-age group), and its share in the total population. In addition, two sensitivity tests are carried out – high and very high number of refugees – by introducing in the baseline projections a shock due to the mass-influx of refugees fleeing the war in Ukraine, and who have received temporary protection in the EU countries.

The updated EUROPOP2019 projections were constructed from cumulative sums of weighted averages of annual population changes of two series: the original EUROPOP2019 projection and a new short-term population projection computed from the latest available data over the period of 10 years.

The two sensitivity tests were built on the following assumptions:

High number of refugees sensitivity test – assumes that the influx of refugees occurs during 2022 only, and is followed by annual returns at a constant rate such that at the end of 2031 the remaining number of refugees is 10% of the total influx in 2022;

Very high number of refugees sensitivity test – assumes that the influx of refugees occurs during 2022 and 2023, and is followed by annual returns at a constant rate such that at the end of 2031 the remaining number of refugees is 15% of the cumulated influx in 2022 and 2023.
Short-term update of the projected population (2022-2032)
ec.europa.eu
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Eurostat, Short-term update of the projected population (2022-2032) [Dataset]. http://doi.org/10.2908/PROJ_STP22
Explore at:
application/vnd.sdmx.data+csv;version=2.0.0, application/vnd.sdmx.genericdata+xml;version=2.1, json, application/vnd.sdmx.data+xml;version=3.0.0, application/vnd.sdmx.data+csv;version=1.0.0, tsvAvailable download formats
Unique identifier
https://doi.org/10.2908/PROJ_STP22
Dataset authored and provided by
Eurostathttps://ec.europa.eu/eurostat
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
2019 - 2032
Area covered
Hungary, Malta, Italy, Czechia, Liechtenstein, Slovenia, Austria, Estonia, Belgium, Euro area - 19 countries (2015-2022)
Description
EUROPOP2019 are the latest Eurostat population projections produced at national and subnational levels for 31 countries: all 27 European Union (EU) Member States and four European Free Trade Association (EFTA) countries, covering the time horizon from 2019 to 2100.

Population projections are 'what-if scenario' that aim to show the hypothetically developments of the population size and its structure based on a sets of assumptions for fertility, mortality and net migration; they are presented for a long time period that covers more than a half-century (50 years).

The datasets at national level are composed by the baseline population projections and five sensitivity tests, namely:

no migration – it is assumed that the net migration is set to zero in each year of the entire horizon of projections;

lower migration – it is assumed that the net migration is 33% lower than in the baseline assumptions, in each year of the entire horizon of projections;

higher migration – it is assumed that the net migration is 33% higher than in the baseline assumptions, in each year of the entire horizon of projections;

lower fertility - it is assumed that the fertility rates are lower 20% than in the baseline assumptions, in each year of the entire horizon of projections;

lower mortality - it is assumed that the mortality rates are decreased such that the life expectancy at birth will increase of about two years by 2070 when compared with the baseline assumptions.

Data are available by single year time interval, as follows:

Projected population on 1 January by age and sex;

Assumptions on future age-specific fertility rates, probabilities of dying and net migration levels;

Projected life expectancy by age (in completed years) and sex.

Moreover, the demographic balances and indicators are available for the baseline projections and the five sensitive variants:

Total numbers of the projected live births and deaths;

Projected population structure indicators: proportions of broad age groups in total population, age dependency ratios and median ages of the population (for each sex component).

The dataset at regional level is composed by the baseline population projections and covers all 1169 regions classified as NUTS level 3 corresponding to the NUTS-2016 classification (the Nomenclature of Territorial Units for Statistics) and the 47 Statistical Regions (SR) agreed between European Commission and EFTA countries. Statistical regions are defined according to principles similar to those used in the establishment of the NUTS classification.

For all 1216 regions NUTS-3 level, data are available by single year time interval as follows:

Projected population on 1 January by age and sex;

Assumptions on future age-specific fertility rates, probabilities of dying and net migration levels;

Projected deaths by age and sex;

Projected life expectancy by age (reached during the year) and sex, which is computed according to the method described in the https://ec.europa.eu/eurostat/cache/metadata/Annexes/proj_19n_esms_an_24.pdf" target="_self">Technical note - Alternative life table (with annex)

In addition to the baseline projections, datasets on projected population at regional level are available for two sensitivity tests:

no migration - it is assumed that migration is zero for both international and internal components in each year of the entire horizon of projections;

no inter-regional migration - it is assumed that only internal migration is zero in each year of the entire horizon of projections.

Moreover, the demographic balances and indicators are available for the baseline projections and the two sensitive variants:

Total numbers of the projected live births by sex and deaths;

Projected population structure indicators: proportions of broad age groups in total population, age dependency ratios and median ages of the population (for each sex component).

The additional dataset called ‘Short-term update of the projected population (2022-2032)’ [proj_stp22] was published on 28 September 2022. While EUROPOP2019 remain the main set of reference for population projections, this new dataset includes updates of baseline projections for the total population, population in the age group 15 to 74 years (considered as the population in the working-age group), and its share in the total population. In addition, two sensitivity tests are carried out – high and very high number of refugees – by introducing in the baseline projections a shock due to the mass-influx of refugees fleeing the war in Ukraine, and who have received temporary protection in the EU countries.

The updated EUROPOP2019 projections were constructed from cumulative sums of weighted averages of annual population changes of two series: the original EUROPOP2019 projection and a new short-term population projection computed from the latest available data over the period of 10 years.

The two sensitivity tests were built on the following assumptions:

High number of refugees sensitivity test – assumes that the influx of refugees occurs during 2022 only, and is followed by annual returns at a constant rate such that at the end of 2031 the remaining number of refugees is 10% of the total influx in 2022;

Very high number of refugees sensitivity test – assumes that the influx of refugees occurs during 2022 and 2023, and is followed by annual returns at a constant rate such that at the end of 2031 the remaining number of refugees is 15% of the cumulated influx in 2022 and 2023.
m
Dataset on cloud-enabled storage services
data.mendeley.com
Updated May 8, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Chi-Hoon Song (2023). Dataset on cloud-enabled storage services [Dataset]. http://doi.org/10.17632/y98rmtf2py.5
Explore at:
Unique identifier
https://doi.org/10.17632/y98rmtf2py.5
Dataset updated
May 8, 2023
Authors
Chi-Hoon Song
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This is analytical proofs and raw data for research article, “The Role of Protection Motivation in the Adoption of Cloud-enabled Storage Service”. The original article aimed to investigate how the threat of data loss influences an individual’s intention to adopt cloud-enabled storage service as protection against data loss. This article includes analytical proofs, psychometric details of the measures and measurement items, analytic tables-related to the original article and raw data. Files included are as follows.

○ File 1 - Title: Details of prior studies (2009 to 2019) on the adoption of cloud-enabled storage at individual level - Description: This file presents a review of twenty-three studies (2009 to 2019) that focused on
the adoption of cloud-enabled storage service at the individual level. ○ File 2 - Title: Details of prior on applications of PMT in IS and IT areas - Description: This file presents a review of forty-seven studies (2009 to 2019) of PMT in IS/IT research areas. ○ File 3 - Title: Measurement items - Description: This file reports psychometric details of the measures and measurement items used in the original research article. ○ File 4 - Title: Sample characteristics - Description: This file reports the demographic characteristics of the respondents. ○ File 5 - Title: raw data for empirical analytics - Description: This file contains raw data for the original study: 392 samples were used for its final analysis. This data were collected through an online survey in South Korea.
Z
LAU1 dataset
data.niaid.nih.gov
zenodo.org
Updated Nov 29, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Páleník, Michal (2024). LAU1 dataset [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_6165135
Explore at:
Dataset updated
Nov 29, 2024
Dataset provided by
IZ Bratislava; Faculty of management, Comenius University in Bratislava
Authors
Páleník, Michal
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Statistical open data on LAU regions of Slovakia, Czech Republic, Poland, Hungary (and other countries in the future). LAU1 regions are called counties, okres, okresy, powiat, járás, járási, NUTS4, LAU, Local Administrative Units, ... and there are 733 of them in this V4 dataset. Overall, we cover 733 regions which are described by 137.828 observations (panel data rows) and more than 1.760.229 data points.

This LAU dataset contains panel data on population, on age structure of inhabitants, on number and on structure of registered unemployed. Dataset prepared by Michal Páleník. Output files are in json, shapefiles, xls, ods, json, topojson or CSV formats. Downloadable at zenodo.org.

This dataset consists of:

data on unemployment (by gender, education and duration of unemployment),

data on vacancies,

open data on population in Visegrad counties (by age and gender),

data on unemployment share.

Combined latest dataset

dataset of the latest available data on unemployment, vacancies and population

dataset includes map contours (shp, topojson or geojson format), relation id in OpenStreetMap, wikidata entry code,

it also includes NUTS4 code, LAU1 code used by national statistical office and abbreviation of the region (usually license plate),

source of map contours is OpenStreetMap, licensed under ODbL

no time series, only most recent data on population and unemployment combined in one output file

columns: period, lau, name, registered_unemployed, registered_unemployed_females, disponible_unemployed, low_educated, long_term, unemployment_inflow, unemployment_outflow, below_25, over_55, vacancies, pop_period, TOTAL, Y15-64, Y15-64-females, local_lau, osm_id, abbr, wikidata, population_density, area_square_km, way

Slovakia – SK: 79 LAU1 regions, data for 2024-10-01, 1.659 data,

Czech Republic – CZ: 77 LAU1 regions, data for 2024-10-01, 1.617 data,

Poland – PL: 380 LAU1 regions, data for 2024-09-01, 6.840 data,

Hungary – HU: 197 LAU1 regions, data for 2024-10-01, 2.955 data,

13.071 data in total.

column/number of observations description SK CZ PL HU

period period (month and year) the data is for 79 77 380 197

lau LAU code of the region 79 77 380 197

name name of the region in local language 79 77 380 197

registered_unemployed number of unemployed registered at labour offices 79 77 380 197

registered_unemployed_females number of unemployed women 79 77 380 197

disponible_unemployed unemployed able to accept job offer 79 77 0 0

low_educated unmployed without secondary school (ISCED 0 and 1) 79 77 380 197

long_term unemployed for longer than 1 year 79 77 380 0

unemployment_inflow inflow into unemployment 79 77 0 0

unemployment_outflow outflow from unemployment 79 77 0 0

below_25 number of unemployed below 25 years of age 79 77 380 197

over_55 unemployed older than 55 years 79 77 380 197

vacancies number of vacancies reported by labour offices 79 77 380 0

pop_period date of population data 79 77 380 197

TOTAL total population 79 77 380 197

Y15-64 number of people between 15 and 64 years of age, population in economically active age 79 77 380 197

Y15-64-females number of women between 15 and 64 years of age 79 77 380 197

local_lau region's code used by local labour offices 79 77 380 197

osm_id relation id in OpenStreetMap database 79 77 380 197

abbr abbreviation used for this region 79 77 380 0

wikidata wikidata identification code 79 77 380 197

population_density population density 79 77 380 197

area_square_km area of the region in square kilometres 79 77 380 197

way geometry, polygon of given region 79 77 380 197

Unemployment dataset

time series of unemployment data in Visegrad regions

by gender, duration of unemployment, education level, age groups, vacancies,

columns: period, lau, name, registered_unemployed, registered_unemployed_females, disponible_unemployed, low_educated, long_term, unemployment_inflow, unemployment_outflow, below_25, over_55, vacancies

Slovakia – SK: 79 LAU1 regions, data for 334 periods (1997-01-01 ... 2024-10-01), 202.082 data,

Czech Republic – CZ: 77 LAU1 regions, data for 244 periods (2004-07-01 ... 2024-10-01), 147.528 data,

Poland – PL: 380 LAU1 regions, data for 189 periods (2005-03-01 ... 2024-09-01), 314.100 data,

Hungary – HU: 197 LAU1 regions, data for 106 periods (2016-01-01 ... 2024-10-01), 104.408 data,

768.118 data in total.

column/number of observations description SK CZ PL HU

period period (month and year) the data is for 26 386 18 788 71 772 20 882

lau LAU code of the region 26 386 18 788 71 772 20 882

name name of the region in local language 26 386 18 788 71 772 20 882

registered_unemployed number of unemployed registered at labour offices 26 386 18 788 71 772 20 882

registered_unemployed_females number of unemployed women 26 386 18 788 62 676 20 882

disponible_unemployed unemployed able to accept job offer 25 438 18 788 0 0

low_educated unmployed without secondary school (ISCED 0 and 1) 11 771 9855 41 388 20 881

long_term unemployed for longer than 1 year 24 253 9855 41 388 0

unemployment_inflow inflow into unemployment 26 149 16 478 0 0

unemployment_outflow outflow from unemployment 26 149 16 478 0 0

below_25 number of unemployed below 25 years of age 11 929 9855 17 100 20 881

over_55 unemployed older than 55 years 11 929 9855 17 100 20 882

vacancies number of vacancies reported by labour offices 11 692 18 788 62 676 0

Population dataset

time series on population by gender and 5 year age groups in V4 counties

columns: period, lau, name, gender, TOTAL, Y00-04, Y05-09, Y10-14, Y15-19, Y20-24, Y25-29, Y30-34, Y35-39, Y40-44, Y45-49, Y50-54, Y55-59, Y60-64, Y65-69, Y70-74, Y75-79, Y80-84, Y85-89, Y90-94, Y_GE95, Y15-64

Slovakia – SK: 79 LAU1 regions, data for 28 periods (1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005, 2006, 2007, 2008, 2009, 2010, 2011, 2012, 2013, 2014, 2015, 2016, 2017, 2018, 2019, 2020, 2021, 2022, 2023), 152.628 data,

Czech Republic – CZ: 78 LAU1 regions, data for 24 periods (2000, 2001, 2002, 2003, 2004, 2005, 2006, 2007, 2008, 2009, 2010, 2011, 2012, 2013, 2014, 2015, 2016, 2017, 2018, 2019, 2020, 2021, 2022, 2023), 125.862 data,

Poland – PL: 382 LAU1 regions, data for 29 periods (1995, 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005, 2006, 2007, 2008, 2009, 2010, 2011, 2012, 2013, 2014, 2015, 2016, 2017, 2018, 2019, 2020, 2021, 2022, 2023), 626.941 data,

Hungary – HU: 197 LAU1 regions, data for 11 periods (2013, 2014, 2015, 2016, 2017, 2018, 2019, 2020, 2021, 2022, 2023), 86.680 data,

992.111 data in total.

column/number of observations description SK CZ PL HU

period period (month and year) the data is for 6636 5574 32 883 4334

lau LAU code of the region 6636 5574 32 883 4334

name name of the region in local language 6636 5574 32 883 4334

gender gender (male or female) 6636 5574 32 883 4334

TOTAL total population 6636 5574 32 503 4334

Y00-04 inhabitants between 00 to 04 years inclusive 6636 5574 32 503 4334

Y05-09 number of inhabitants between 05 to 09 years of age 6636 5574 32 503 4334

Y10-14 number of people between 10 to 14 years inclusive 6636 5574 32 503 4334

Y15-19 number of inhabitants between 15 to 19 years of age 6636 5574 32 503 4334

Y20-24 number of people between 20 to 24 years inclusive 6636 5574 32 503 4334

Y25-29 number of inhabitants between 25 to 29 years of age 6636 5574 32 503 4334

Y30-34 inhabitants between 30 to 34 years inclusive 6636 5574 32 503 4334

Y35-39 number of inhabitants between 35 to 39 years of age 6636 5574 32 503 4334

Y40-44 inhabitants between 40 to 44 years inclusive 6636 5574 32 503 4334

Y45-49 number of inhabitants younger than 49 and older than 45 years 6636 5574 32 503 4334

Y50-54 inhabitants between 50 to 54 years inclusive 6636 5574 32 503 4334

Y55-59 number of inhabitants between 55 to 59 years of age 6636 5574 32 503 4334

Y60-64 inhabitants between 60 to 64 years inclusive 6636 5574 32 503 4334

Y65-69 number of inhabitants younger than 69 and older than 65 years 6636 5574 32 503 4334

Y70-74 inhabitants between 70 to 74 years inclusive 6636 5574 24 670 4334

Y75-79 number of inhabitants between 75 to 79 years of age 6636 5574 24 670 4334

Y80-84 number of people between 80 to 84 years inclusive 6636 5574 24 670 4334

Y85-89 number of inhabitants younger than 89 and older than 85 years 6636 5574 0 0

Y90-94 inhabitants between 90 to 94 years inclusive 6636 5574 0 0

Y_GE95 number of people 95 years or older 6636 3234 0 0

Y15-64 number of people between 15 and 64 years of age, population in economically active age 6636 5574 32 503 4334

Notes

more examples at www.iz.sk

NUTS4 / LAU1 / LAU codes for HU and PL are created by me, so they can (and will) change in the future; CZ and SK NUTS4 codes are used by local statistical offices, so they should be more stable

NUTS4 codes are consistent with NUTS3 codes used by Eurostat

local_lau variable is an identifier used by local statistical office

abbr is abbreviation of region's name, used for map purposes (usually cars' license plate code; except for Hungary)

wikidata is code used by wikidata

osm_id is region's relation number in the OpenStreetMap database

Example outputs

you can download data in CSV, xml, ods, xlsx, shp, SQL, postgis, topojson, geojson or json format at 📥 doi:10.5281/zenodo.6165135

Counties of Slovakia – unemployment rate in Slovak LAU1 regions

Regions of the Slovak Republic

Unemployment of Czechia and Slovakia – unemployment share in LAU1 regions of Slovakia and Czechia

interactive map on unemployment in Slovakia

Slovakia – SK, Czech Republic – CZ, Hungary – HU, Poland – PL, NUTS3 regions of Slovakia

download at 📥 doi:10.5281/zenodo.6165135

suggested citation: Páleník, M. (2024). LAU1 dataset [Data set]. IZ Bratislava. https://doi.org/10.5281/zenodo.6165135
T
Population changes in different regions of Qinghai Province (1998-2010)
data.tpdc.ac.cn
tpdc.ac.cn
zip
Updated Mar 23, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Provincial Qinghai (2021). Population changes in different regions of Qinghai Province (1998-2010) [Dataset]. https://data.tpdc.ac.cn/en/data/045521b6-344f-4cd0-86c6-e416ca2e514a
Explore at:
zipAvailable download formats
Dataset updated
Mar 23, 2021
Dataset provided by
TPDC
Authors
Provincial Qinghai
Area covered

Description
The data set records the statistical data of population change in different regions of Qinghai Province from 1998 to 2010, which is divided by region, total number of households, total population, birth population and death population. The data are collected from the statistical yearbook of Qinghai Province issued by the Bureau of statistics of Qinghai Province. The data set contains 10 data tables with different structures. For example, the data table in 1999 has five fields: Field 1: Region Field 2: total number of households Field 3: total population Field 4: birth population Field 5: death population
n
Somali Health and Demographic Survey 2020 - Somalia
microdata.nbs.gov.so
Updated Jul 21, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Somali National Bureau of Statistics (2023). Somali Health and Demographic Survey 2020 - Somalia [Dataset]. https://microdata.nbs.gov.so/index.php/catalog/50
Explore at:
Dataset updated
Jul 21, 2023
Dataset authored and provided by
Somali National Bureau of Statistics
Time period covered
2018 - 2019
Area covered
Somalia
Description
Abstract

The SHDS is a national sample survey designed to provide information on population, birth spacing, reproductive health, nutrition, maternal and child health, child survival, HIV/AIDS and sexually transmitted infections (STIs), in Somalia.. The main objective of the SHDS was to provide evidence on the health and demographic characteristics of the Somali population that will guide the development of programmes and formulation of effective policies. This information would also help monitor and evaluate national, sub-national and sector development plans, including the Sustainable Development Goals (SDGs), both by the government and development partners. The target population for SHDS was the women between 15 and 49 years of age, and the children less than the age of 5 years

Geographic coverage

The SHDS 2020 was a nationally representative household survey.

Analysis unit

The unit analysis of this survey are households, women aged 15-49 and children aged 0-5

Universe

This sample survey covered Women aged 15-49 and Children aged 0-5 years.

Kind of data

Sample survey data [ssd]

Sampling procedure

Sample Design The sample for the SHDS was designed to provide estimates of key indicators for the country as a whole, for each of the eighteen pre-war geographical regions, which are the country's first-level administrative divisions, as well as separately for urban, rural and nomadic areas. With the exception of Banadir region, which is considered fully urban, each region was stratified into urban, rural and nomadic areas, yielding a total of 55 sampling strata. All three strata of Lower Shabelle and Middle Juba regions, as well as the rural and nomadic strata of Bay region, were completely excluded from the survey due to security reasons. A final total of 47 sampling strata formed the sampling frame. Through the use of up-to-date, high-resolution satellite imagery, as well as on-the-ground knowledge of staff from the respective ministries of planning, all dwelling structures were digitized in urban and rural areas. Enumeration Areas (EAs) were formed onscreen through a spatial count of dwelling structures in a Geographic Information System (GIS) software. Thereafter, a sample ground verification of the digitized structures was carried out for large urban and rural areas and necessary adjustments made to the frame.

Each EA created had a minimum of 50 and a maximum of 149 dwelling structures. A total of 10,525 EAs were digitized: 7,488 in urban areas and 3,037 in rural areas. However, because of security and accessibility constraints, not all digitized areas were included in the final sampling frame-9,136 EAs (7,308 in urban and 1,828 in rural) formed the final frame. The nomadic frame comprised an updated list of temporary nomadic settlements (TNS) obtained from the nomadic link workers who are tied to these settlements. A total of 2,521 TNS formed the SHDS nomadic sampling frame. The SHDS followed a three-stage stratified cluster sample design in urban and rural strata with a probability proportional to size, for the sampling of Primary Sampling Units (PSU) and Secondary Sampling Units (SSU) (respectively at the first and second stage), and systematic sampling of households at the third stage. For the nomadic stratum, a two-stage stratified cluster sample design was applied with a probability proportional to size for sampling of PSUs at the first stage and systematic sampling of households at the second stage. To ensure that the survey precision is comparable across regions, PSUs were allocated equally to all regions with slight adjustments in two regions. Within each stratum, a sample of 35 EAs was selected independently, with probability proportional to the number of digitized dwelling structures. In this first stage, a total of 1,433 EAs were allocated (to urban - 770 EAs, rural - 488 EAs, and nomadic - 175 EAs) representing about 16 percent of the total frame of EAs. In the urban and rural selected EAs, all households were listed and information on births and deaths was recorded through the maternal mortality questionnaire. The data collected in this first phase was cleaned and a summary of households listed per EA formed the sampling frames for the second phase. In the second stage, 10 EAs were sampled out of the possible 35 that were listed, using probability proportional to the number of households. All households in each of these 10 EAs were serialized based on their location in the EA and 30 of these households sampled for the survey. The serialization was done to ensure distribution of the households interviewed for the survey in the EA sampled. A total of 220 EAs and 150 EAs were allocated to urban and rural strata respectively, while in the third stage, an average of 30 households were selected from the listed households in every EA to yield a total of 16,360 households from 538 EAs covered (220 EAs in urban, 147 EAs in rural and 171 EAs in nomadic) out of the sampled 545 EAs. In nomadic areas, a sample of 10 EAs (in this case TNS) were selected from each nomadic stratum, with probability proportional to the number of estimated households. A complete listing of households was carried out in the selected TNS followed by the selection of 30 households for the main survey interview. In those TNS with less than 30 households, all households were interviewed for the main survey. All eligible ever-married women aged 12 to 49 and never-married women aged 15 to 49 were interviewed in the selected households, while the household questionnaire was administered to all households selected. The maternal mortality questionnaire was administered to all households in each sampled TNS.

Mode of data collection

Face-to-face [f2f]

Response rate

A total of 16,360 households were selected for the sample, of which 15,870 were occupied. Of the occupied households, 15,826 were successfully interviewed, yielding a response rate of 99.7 percent. The SHDS 2020 interviewed 16,486 women-11,876 ever-married women and 4,610 never-married women.

Sampling error estimates

Sampling errors are important data quality parameters which give measure of the precision of the survey estimates. They aid in determining the statistical reliability of survey estimates. The estimates from a sample survey are affected by two types of errors: non-sampling errors and sampling errors. Non-sampling errors are the results of mistakes made in implementing data collection and data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the Somaliland Health and Demographic Survey ( SHDS 2020) to minimise this type of error, non-sampling errors are impossible to avoid and difficult to evaluate statistically. Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the SHDS 2020 is only one of many samples that could have been selected from the same population, using the same design and sample size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling errors are a measure of the variability among all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results. Sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95% of all possible samples of identical size and design. If the sample of respondents had been selected by simple random sampling, it would have been possible to use straightforward formulas for calculating sampling errors. However, the SHDS 2020 sample was the result of a multi-stage stratified design, and, consequently, it was necessary to use more complex formulas. The variance approximation procedure that account for the complex sample design used R program was estimated sampling errors in SHDS which is Taylor series linearization. The non-linear estimates are approximated by linear ones for estimating variance. The linear approximation is derived by taking the first-order Tylor series approximation. Standard variance estimation methods for linear statistics are then used to estimate the variance of the linearized estimator. The Taylor linearisation method treats any linear statistic such as a percentage or mean as a ratio estimate, r = y/x, where y represents the total sample value for variable y and x represents the total number of cases in the group or subgroup under consideration

Data appraisal

Household age distribution

Age distribution of eligible and interviewed women

Pregnancy- related mortality trends Note: See detailed data quality tables in APPENDIX C of the report.
T
Proportion of urban population in different regions of China (2010-2018)
data.tpdc.ac.cn
tpdc.ac.cn
zip
Updated Mar 25, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Provincial Qinghai (2021). Proportion of urban population in different regions of China (2010-2018) [Dataset]. https://data.tpdc.ac.cn/en/data/cb5e436b-99b3-420a-9d52-057347cfe975
Explore at:
zipAvailable download formats
Dataset updated
Mar 25, 2021
Dataset provided by
TPDC
Authors
Provincial Qinghai
Area covered

Description
This data set records the statistical data of the proportion of urban population in various regions of China (2010-2018), which is divided by year. The data are collected from the statistical yearbook of Qinghai Province issued by the Bureau of statistics of Qinghai Province. The data set consists of three data tables Proportion of urban population in different regions of China (2010-2016). Xls Proportion of urban population in different regions of China (2011-2017). Xls The proportion of urban population in all regions of China (2011-2018). XLS, the data table structure is the same. For example, the data table in 2018 has two fields: Field 1: year Field 2: Region
r
Population by age, citizenship, gender and governorate (2022)
opendata.rcrc.gov.sa
csv, excel, json
Updated May 6, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). Population by age, citizenship, gender and governorate (2022) [Dataset]. https://opendata.rcrc.gov.sa/explore/dataset/population-by-age-citizenship-gender-and-governorate-2022/
Explore at:
csv, json, excelAvailable download formats
Dataset updated
May 6, 2025
License
https://open.data.gov.sa/odp-public/static/en/assets/Open_Data_License_En.pdfhttps://open.data.gov.sa/odp-public/static/en/assets/Open_Data_License_En.pdf
Description
Population (number of individuals) with a focus on Ar Riyadh Region, Ar Diriyah and Ar Riyadh governorates compared to the other governorates of Ar Riyadh Region, the other governorates in the Kingdom, and the Kingdom as a whole in the year 2022.1.MethodologyThe population data are taken from Gastat and aggregated at three levels:The Administrative Region,The Governorate,The City.The City level is not represented, as data is always aggregated at most to the Governorate level. This is due to the heterogeneity of availability of the information at the City level. Further, this dataset presents a breakdown for Ar Riyadh and Ad-Diriyah at the governorate level. Similarly, a breakdown exists at the Regional level, distinguishing Ar Riaydh and the other Regions. This presentation highlights the importance of Riyadh at both the governorate and region levels in the Kingdom.At the Administrative Region level,For Ar Riyadh Region, aggregation is provided at the Governorate Region for Ad Diriyah, Ar Riyadh, and the remaining governorates of Ar Riyadh Region combined under the label “Others (not Diri. And Riy.).For the regions other than Ar Riyadh Region, all governorates are aggregated together,Finally, the aggregate for the KSA "Total (KSA Govs.)" is provided in full for Ad Diriyah, Ar Riyadh, and all governorates other than Ad Diriyah and Ar Riyadh "Others (not Diri. and Riy.)", for all governorates in the Kingdom excluding Ar Riyadh Regions' governorates "Total (Govs. not Ar Riyadh Region)".2.Definition(s)Population: All individuals residing within the Kingdom's territory at a given date, including both Saudi citizens and permanent/temporary non-Saudi residents. Source: https://www.stats.gov.sa/en/term-details?id=2583312).Administrative Region: the 13 administrative regions of the Kingdom, which are administered by a government body directly affiliated with the Ministry of Interior (e.g. Ar Riyadh, Makkah). Every administrative region has a designated capital city. Governorate: The second level of administrative division within the Kingdom. Each region is subdivided into several governorates, varying in number from one region to another. Governorates are further divided into centres that report administratively to the governorate or emirate. For example, Al-Kharj Governorate within the Riyadh Region.3.Detailed breakdownRegion code: An identifier of the Region, not official (visible only in the downloadable version of the dataset).Region: The first level of administrative division within the Kingdom. After aggregation, ‘Ar Riaydh’, ‘Others (not Ar Riyadh Region)’ (aggregating every case where region is not Riyadh), Total (KSA Regions) (aggregating all regions) remains.Governorate: The second level of administrative division within the Kingdom. After aggregation, Ad Diriyah, Ar Riyadh (as a Governorate) and Others (not Diri. and Riy.) (for all the other governorates in Ar Riyadh Region) remains. The three together are forming Ar Riyadh Region. Outside Ar Riyadh Region, all governorates in a given Region have been aggregated, providing a sub-total which is finally collected in Total (Govs. not Ar Riyadh Region). Finally, Total (KSA Govs.) aggregating all governorates within the Kingdom.Citizenship: Saudi and Non-Saudi population.Gender: Male and femaleAge in 10-year groups: Age categorised into 10-year intervals, ranging from 0 to 90 years old and above.Age in 5-year groups: Age categorised into 5-year intervals, ranging from 0 to 100 years old and above.Age: Age recorded in individual year.Comments: The Open Data Team comments on the metadata publishedDsetIdx: >>182.
d
Dataset for: Psychosis Proneness: A Neglected Personality Correlate of...
demo-b2find.dkrz.de
Updated Sep 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). Dataset for: Psychosis Proneness: A Neglected Personality Correlate of Right-Wing Authoritarianism and Prejudice - Dataset - B2FIND [Dataset]. http://demo-b2find.dkrz.de/dataset/d9461896-8977-5828-a517-fa713c240fec
Explore at:
Dataset updated
Sep 22, 2025
Description
The goal of the study is to investigate the relationship between the HEXACO personality model and Disintegration—representing a broad spectrum of psychotic-like experiences and behavioral tendencies (Perceptual Distortions, General Executive/Cognitive Impairment, Enhanced Awareness, Paranoia, Mania, Flattened Affect, Apathy/Depression, Somatoform Dysregulation, and Magical Thinking) that are reconceptualized as a personality trait. In this preregistered study, we predicted that the Disintegration factor would separate from HEXACO. The replicability of the factorial structures of HEXACO and Disintegration subcomponents is investigated across the three national samples (UK, Germany, and Serbia), matched on key socio-demographic variables. Exploratory Structural Equation Modeling (ESEM) is used to study the invariance of the hypothesized seven-factor structure (six HEXACO plus Disintegration). Support for the metric invariance of the seven-factor structure based on HEXACO and Disintegration subcomponents/facets across the three nations was found. The Disintegration factor lied outside the HEXACO personality space with each of its nine subcomponents. The Disintegration factor appeared to be among the most coherent and replicable of the seven across the samples and units of measurement (facets and items). A broad spectrum of psychotic-like experiences/behavioral tendencies relevant in understanding and explaining many aspects of everyday and long-term (mal)adaptations is not captured by the HEXACO model. Dataset for: Knežević, G., Lazarević, L. B., Bosnjak, M., & Keller, J. (2022). Proneness to psychotic-like experiences as a basic personality trait complementing the HEXACO model—A preregistered cross-national study. Personality and Mental Health, 1– 19. https://doi.org/10.1002/pmh.1537
The table shows the possible transitions out of a given state (kab, ka, kb,...
plos.figshare.com
bin
Updated Jun 16, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gautam Upadhya; Matthias Steinrücken (2023). The table shows the possible transitions out of a given state (kab, ka, kb, κ) and their respective rates. [Dataset]. http://doi.org/10.1371/journal.pcbi.1010419.t001
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pcbi.1010419.t001
Dataset updated
Jun 16, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Gautam Upadhya; Matthias Steinrücken
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The first row gives the rate for coalescence between two lineages that are ancestral to both loci. The second row gives rate for two types of events, coalescences between two lineages ancestral to only locus a, and coalescences of a lineage ancestral only to a with a lineage ancestral to both. The third row reflects similar events for locus b. The last row gives the rate of recombination events. Note that these rates are defined to permit a maximum of 1 ancestral recombination event occurring between locus a and b.
r
Female population in childbearing age by age group and governorate (2022)
opendata.rcrc.gov.sa
csv, excel, json
Updated May 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). Female population in childbearing age by age group and governorate (2022) [Dataset]. https://opendata.rcrc.gov.sa/explore/dataset/female-population-in-childbearing-age-by-age-group-for-1000-women-2022/
Explore at:
csv, json, excelAvailable download formats
Dataset updated
May 6, 2025
License
https://open.data.gov.sa/odp-public/static/en/assets/Open_Data_License_En.pdfhttps://open.data.gov.sa/odp-public/static/en/assets/Open_Data_License_En.pdf
Description
Number of women of reproductive age (15–49 years) categorised by birth status -whether they gave birth or not- during the 12 months preceding the census, with a focus on Ar Riyadh Region, Ar Diriyah and Ar Riyadh governorates compared to the other governorates of Ar Riyadh Region, the other governorates in the Kingdom, and the Kingdom as a whole in the year 2022.1.MethodologyData are taken from Gastat and aggregated at three levels:The Administrative Region,The Governorate,The City.The City level is not represented, as data is always aggregated at most to the Governorate level. This is due to the heterogeneity of availability of the information at the City level. Further, this dataset presents a breakdown for Ar-Riyadh and Ad-Diriyah at the governorate level. Similarly, a breakdown exists at the Regional level, distinguishing Ar-Riaydh and the other Regions. This presentation highlights the importance of Riyadh at both the governorate and region levelst in the Kingdom.At the Administrative Region level,For Ar-Riyadh Region, aggregation is provided at the Governorate Region for Ad Diriyah, Ar Riyadh, and the remaining governorates of Ar Riyadh Region combined under the label “Others (not Diri. And Riy.).For the regions other than Ar-Riyadh Region, all governorates are aggregated together,Finally, the aggregate for the KSA "Total (KSA Govs.)" is provided in full for Ad Diriyah, Ar Riyadh, and all governorates other than Ad Diriyah and Ar Riyadh "Others (not Diri. and Riy.)", for all governorates in the Kingdom excluding Ar Riyadh Regions' governorates "Total (Govs. not Ar Riyadh Region)".2.Definition(s)Population: All individuals residing within the Kingdom's territory at a given date, including both Saudi citizens and permanent/temporary non-Saudi residents. Source: https://www.stats.gov.sa/en/term-details?id=2583312).Administrative Region: the 13 administrative regions of the Kingdom, which are administered by a government body directly affiliated with the Ministry of Interior (e.g. Ar Riyadh, Makkah). Every administrative region has a designated capital city. Governorate: The second level of administrative division within the Kingdom. Each region is subdivided into several governorates, varying in number from one region to another. Governorates are further divided into centres that report administratively to the governorate or emirate. For example, Al-Kharj Governorate within the Riyadh Region.3.Detailed breakdownRegion code: An identifier of the Region, not official (visible only in the downloadable version of the dataset).Region: The first level of administrative division within the Kingdom. After aggregation, ‘Ar Riaydh’, ‘Others (not Ar Riyadh Region)’ (aggregating every case where region is not Riyadh), Total (KSA Regions) (aggregating all regions) remains.Governorate: The second level of administrative division within the Kingdom. After aggregation, Ad Diriyah, Ar Riyadh (as a Governorate) and Others (not Diri. and Riy.) (for all the other governorates in Ar Riyadh Region) remains. The three together are forming Ar-Riyadh Region. Outside Ar Riyadh Region, all governorates in a given Region have been aggregated, providing a sub-total which is finally collected in Total (Govs. not Ar Riyadh Region). Finally, Total (KSA Govs.) aggregating all governorates within the Kingdom.Citizenship: Saudi and Non-Saudi population.Mother's age group: A demographic category that segments mothers by age at the time of birth, limited to the reproductive age range of 15 to 49 years. The age groups are divided into 5-year intervals.Gave birth in the last 12 months: Indicates the birth status during the 12 months preceding the census (Y- gave birth or N- did not give birth).Comments: The Open Data Team comments on the metadata publishedDsetIdx: >>186.
Summary of the performance and features of the different methods compared in...
plos.figshare.com
bin
Updated Jun 16, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gautam Upadhya; Matthias Steinrücken (2023). Summary of the performance and features of the different methods compared in our simulation study. [Dataset]. http://doi.org/10.1371/journal.pcbi.1010419.t004
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pcbi.1010419.t004
Dataset updated
Jun 16, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Gautam Upadhya; Matthias Steinrücken
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The ranges are given in generations before present.
f
Table_1_Resting Energy Expenditure Prediction Equations in the Pediatric...
datasetcatalog.nlm.nih.gov
Updated Dec 6, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fuentes-Servín, Jimena; Medina-Vera, Isabel; Avila-Nava, Azalia; Del Carmen Servín-Rodas, María; Pérez-González, Oscar A.; González-Salazar, Luis E.; Guevara-Cruz, Martha; Serralde-Zuñiga, Aurora E. (2021). Table_1_Resting Energy Expenditure Prediction Equations in the Pediatric Population: A Systematic Review.docx [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000739244
Explore at:
Dataset updated
Dec 6, 2021
Authors
Fuentes-Servín, Jimena; Medina-Vera, Isabel; Avila-Nava, Azalia; Del Carmen Servín-Rodas, María; Pérez-González, Oscar A.; González-Salazar, Luis E.; Guevara-Cruz, Martha; Serralde-Zuñiga, Aurora E.
Description
Background and Aims: The determination of energy requirements is necessary to promote adequate growth and nutritional status in pediatric populations. Currently, several predictive equations have been designed and modified to estimate energy expenditure at rest. Our objectives were (1) to identify the equations designed for energy expenditure prediction and (2) to identify the anthropometric and demographic variables used in the design of the equations for pediatric patients who are healthy and have illness.Methods: A systematic search in the Medline/PubMed, EMBASE and LILACS databases for observational studies published up to January 2021 that reported the design of predictive equations to estimate basal or resting energy expenditure in pediatric populations was carried out. Studies were excluded if the study population included athletes, adult patients, or any patients taking medications that altered energy expenditure. Risk of bias was assessed using the Quality Assessment Tool for Observational Cohort and Cross-Sectional Studies.Results: Of the 769 studies identified in the search, 39 met the inclusion criteria and were analyzed. Predictive equations were established for three pediatric populations: those who were healthy (n = 8), those who had overweight or obesity (n = 17), and those with a specific clinical situation (n = 14). In the healthy pediatric population, the FAO/WHO and Schofield equations had the highest R2 values, while in the population with obesity, the Molnár and Dietz equations had the highest R2 values for both boys and girls.Conclusions: Many different predictive equations for energy expenditure in pediatric patients have been published. This review is a compendium of most of these equations; this information will enable clinicians to critically evaluate their use in clinical practice.Systematic Review Registration:https://www.crd.york.ac.uk/prospero/display_record.php?RecordID=226270, PROSPERO [CRD42021226270].
Data from: Reforming Public Child Welfare in Indiana, 2007-2009
catalog.data.gov
icpsr.umich.edu
Updated Nov 14, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Office of Juvenile Justice and Delinquency Prevention (2025). Reforming Public Child Welfare in Indiana, 2007-2009 [Dataset]. https://catalog.data.gov/dataset/reforming-public-child-welfare-in-indiana-2007-2009-1b759
Explore at:
Dataset updated
Nov 14, 2025
Dataset provided by
Office of Juvenile Justice and Delinquency Preventionhttp://ojjdp.gov/
Area covered
Indiana
Description
The study of Indiana's Child Welfare reform was designed to identify community professionals' perceptions of the Department of Child Services (DCS) following the release of a pilot program to reform child welfare in the state of Indiana. In December, 2005, the pilot project was officially rolled out in three regions of the state. The three chosen regions of the state included 11 county agencies with both urban and rural population centers. Together these regions represented 28% of the state's CHINS (Child In Need of Service) population and 20% of the child fatalities for 2004. This study represents data collected to identify perceptions of the DCS by sending a survey to professionals in the 11 pilot and 12 comparison counties. The survey questions were arranged by categories of safety, permanency, well-being, DCS goals, the reform, team meetings, and demographics. Nine separate instruments were developed and disseminated for each community group. The community professionals surveyed included: Court Appointed Special Advocates (CASAs), foster parents, judges, Law Enforcement Agencies (LEAs), medical and public health professionals, schools, social service professionals, and mental health professionals. Survey instruments were tailored to each audience, with questions that were derived from the DCS "Framework for Individualized Needs-Based Child Welfare Service Provisions," which outlined the agency's core practice values and principles.

Facebook

Twitter

Click to copy link

Link copied

Cite

Office for National Statistics (2025). Population estimates time series dataset [Dataset]. https://www.ons.gov.uk/peoplepopulationandcommunity/populationandmigration/populationestimates/datasets/populationestimatestimeseriesdataset

Population estimates time series dataset

Explore at:

124 scholarly articles cite this dataset (View in Google Scholar)

csv, xlsxAvailable download formats

Dataset updated

Nov 27, 2025

Dataset provided by

Office for National Statisticshttp://www.ons.gov.uk/

License

Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically

Description

The mid-year estimates refer to the population on 30 June of the reference year and are produced in line with the standard United Nations (UN) definition for population estimates. They are the official set of population estimates for the UK and its constituent countries, the regions and counties of England, and local authorities and their equivalents.

Clear search

Close search

Google apps

Main menu

Population estimates time series dataset

Demographic balances and indicators by type of projection and NUTS 3 region

New_Zealand_Births_and_Deaths_by_Region

Dataset Overview

Content

Columns

Potential Use Cases

Related Work

Data Source

Inspiration

Assumptions for probability of dying by age, sex and type of projection

World Development Indicators

Content & Context:

It is ideal for:

Example Usage:

Notes:

Columns:

Wikipedia Dataset

Population on 1st January by age, sex, type of projection and NUTS 3 region

Short-term update of the projected population (2022-2032)

Dataset on cloud-enabled storage services

LAU1 dataset

Population changes in different regions of Qinghai Province (1998-2010)

Somali Health and Demographic Survey 2020 - Somalia

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Sampling procedure

Mode of data collection

Response rate

Sampling error estimates

Data appraisal

Proportion of urban population in different regions of China (2010-2018)

Population by age, citizenship, gender and governorate (2022)

Dataset for: Psychosis Proneness: A Neglected Personality Correlate of...

The table shows the possible transitions out of a given state (kab, ka, kb,...

Female population in childbearing age by age group and governorate (2022)

Summary of the performance and features of the different methods compared in...

Table_1_Resting Energy Expenditure Prediction Equations in the Pediatric...

Data from: Reforming Public Child Welfare in Indiana, 2007-2009

Population estimates time series dataset