100+ datasets found

World population by age and region 2024
statista.com
Updated Mar 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). World population by age and region 2024 [Dataset]. https://www.statista.com/statistics/265759/world-population-by-age-and-region/
Explore at:
Dataset updated
Mar 11, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
World
Description
Globally, about 25 percent of the population is under 15 years of age and 10 percent is over 65 years of age. Africa has the youngest population worldwide. In Sub-Saharan Africa, more than 40 percent of the population is below 15 years, and only three percent are above 65, indicating the low life expectancy in several of the countries. In Europe, on the other hand, a higher share of the population is above 65 years than the population under 15 years. Fertility rates The high share of children and youth in Africa is connected to the high fertility rates on the continent. For instance, South Sudan and Niger have the highest population growth rates globally. However, about 50 percent of the world’s population live in countries with low fertility, where women have less than 2.1 children. Some countries in Europe, like Latvia and Lithuania, have experienced a population decline of one percent, and in the Cook Islands, it is even above two percent. In Europe, the majority of the population was previously working-aged adults with few dependents, but this trend is expected to reverse soon, and it is predicted that by 2050, the older population will outnumber the young in many developed countries. Growing global population As of 2025, there are 8.1 billion people living on the planet, and this is expected to reach more than nine billion before 2040. Moreover, the global population is expected to reach 10 billions around 2060, before slowing and then even falling slightly by 2100. As the population growth rates indicate, a significant share of the population increase will happen in Africa.
Amount of data created, consumed, and stored 2010-2023, with forecasts to...
statista.com
Updated Nov 21, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2024). Amount of data created, consumed, and stored 2010-2023, with forecasts to 2028 [Dataset]. https://www.statista.com/statistics/871513/worldwide-data-created/
Explore at:
Dataset updated
Nov 21, 2024
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
May 2024
Area covered
Worldwide
Description
The total amount of data created, captured, copied, and consumed globally is forecast to increase rapidly, reaching 149 zettabytes in 2024. Over the next five years up to 2028, global data creation is projected to grow to more than 394 zettabytes. In 2020, the amount of data created and replicated reached a new high. The growth was higher than previously expected, caused by the increased demand due to the COVID-19 pandemic, as more people worked and learned from home and used home entertainment options more often. Storage capacity also growing Only a small percentage of this newly created data is kept though, as just two percent of the data produced and consumed in 2020 was saved and retained into 2021. In line with the strong growth of the data volume, the installed base of storage capacity is forecast to increase, growing at a compound annual growth rate of 19.2 percent over the forecast period from 2020 to 2025. In 2020, the installed base of storage capacity reached 6.7 zettabytes.
a
Indicator 17.19.2: Proportion of countries with birth registration data that...
sdgs.amerigeoss.org
arc-gis-hub-home-arcgishub.hub.arcgis.com
+1more
Updated Aug 17, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
UN DESA Statistics Division (2020). Indicator 17.19.2: Proportion of countries with birth registration data that are at least 90 percent complete (percent) [Dataset]. https://sdgs.amerigeoss.org/datasets/eb124a11783d464ebf99b4ba0f44d2f6
Explore at:
Dataset updated
Aug 17, 2020
Dataset authored and provided by
UN DESA Statistics Division
Area covered
Description
Series Name: Proportion of countries with birth registration data that are at least 90 percent complete (percent)Series Code: SG_REG_BRTH90Release Version: 2020.Q2.G.03 This dataset is the part of the Global SDG Indicator Database compiled through the UN System in preparation for the Secretary-General's annual report on Progress towards the Sustainable Development Goals.Indicator 17.19.2: Proportion of countries that (a) have conducted at least one population and housing census in the last 10 years; and (b) have achieved 100 per cent birth registration and 80 per cent death registrationTarget 17.19: By 2030, build on existing initiatives to develop measurements of progress on sustainable development that complement gross domestic product, and support statistical capacity-building in developing countriesGoal 17: Strengthen the means of implementation and revitalize the Global Partnership for Sustainable DevelopmentFor more information on the compilation methodology of this dataset, see https://unstats.un.org/sdgs/metadata/
Enterprise Survey 2009-2014, Panel Data - Malawi
microdata.worldbank.org
catalog.ihsn.org
Updated Oct 7, 2015
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
World Bank (2015). Enterprise Survey 2009-2014, Panel Data - Malawi [Dataset]. https://microdata.worldbank.org/index.php/catalog/2360
Explore at:
Dataset updated
Oct 7, 2015
Dataset authored and provided by
World Bankhttp://worldbank.org/
Time period covered
2009 - 2014
Area covered
Malawi
Description
Abstract

The documented dataset covers Enterprise Survey (ES) panel data collected in Malawi in 2009 and 2014, as part of Africa Enterprise Surveys roll-out, an initiative of the World Bank.

New Enterprise Surveys target a sample consisting of longitudinal (panel) observations and new cross-sectional data. Panel firms are prioritized in the sample selection, comprising up to 50% of the sample in the current wave. For all panel firms, regardless of the sample, current eligibility or operating status is determined and included in panel datasets.

Malawi ES 2014 was conducted between April 2014 and February 2015, Malawi ES 2009 was carried out in May - July 2009. The objective of the Enterprise Survey is to obtain feedback from enterprises on the state of the private sector as well as to help in building a panel of enterprise data that will make it possible to track changes in the business environment over time, thus allowing, for example, impact assessments of reforms. Through interviews with firms in the manufacturing and services sectors, the survey assesses the constraints to private sector growth and creates statistically significant business environment indicators that are comparable across countries.

Stratified random sampling was used to select the surveyed businesses. The data was collected using face-to-face interviews.

Data from 673 establishments was analyzed: 436 businesses were from 2014 ES only, 63 - from 2009 ES only, and 174 firms were from both 2009 and 2014 panels.

The standard Enterprise Survey topics include firm characteristics, gender participation, access to finance, annual sales, costs of inputs and labor, workforce composition, bribery, licensing, infrastructure, trade, crime, competition, capacity utilization, land and permits, taxation, informality, business-government relations, innovation and technology, and performance measures. Over 90 percent of the questions objectively measure characteristics of a country’s business environment. The remaining questions assess the survey respondents’ opinions on what are the obstacles to firm growth and performance.

Geographic coverage

National

Analysis unit

The primary sampling unit of the study is an establishment. An establishment is a physical location where business is carried out and where industrial operations take place or services are provided. A firm may be composed of one or more establishments. For example, a brewery may have several bottling plants and several establishments for distribution. For the purposes of this survey an establishment must make its own financial decisions and have its own financial statements separate from those of the firm. An establishment must also have its own management and control over its payroll.

Universe

The whole population, or the universe, covered in the Enterprise Surveys is the non-agricultural private economy. It comprises: all manufacturing sectors according to the ISIC Revision 3.1 group classification (group D), construction sector (group F), services sector (groups G and H), and transport, storage, and communications sector (group I). Note that this population definition excludes the following sectors: financial intermediation (group J), real estate and renting activities (group K, except sub-sector 72, IT, which was added to the population under study), and all public or utilities sectors. Companies with 100% government ownership are not eligible to participate in the Enterprise Surveys.

Kind of data

Sample survey data [ssd]

Sampling procedure

For the Malawi ES, multiple sample frames were used: a sample frame was built using data compiled from local and municipal business registries. Due to the fact that the previous round of surveys utilized different stratification criteria in the 2009 survey sample, the presence of panel firms was limited to a maximum of 50% of the achieved interviews in each stratum. That sample is referred to as the panel.

Mode of data collection

Face-to-face [f2f]

Research instrument

The following survey instruments were used for Malawi ES 2009 and 2014: - Manufacturing Module Questionnaire - Services Module Questionnaire

The survey is fielded via manufacturing or services questionnaires in order not to ask questions that are irrelevant to specific types of firms, e.g. a question that relates to production and nonproduction workers should not be asked of a retail firm. In addition to questions that are asked across countries, all surveys are customized and contain country-specific questions. An example of customization would be including tourism-related questions that are asked in certain countries when tourism is an existing or potential sector of economic growth. There is a skip pattern in the Service Module Questionnaire for questions that apply only to retail firms.

Cleaning operations

Data entry and quality controls are implemented by the contractor and data is delivered to the World Bank in batches (typically 10%, 50% and 100%). These data deliveries are checked for logical consistency, out of range values, skip patterns, and duplicate entries. Problems are flagged by the World Bank and corrected by the implementing contractor through data checks, callbacks, and revisiting establishments.

Response rate

Survey non-response must be differentiated from item non-response. The former refers to refusals to participate in the survey altogether whereas the latter refers to the refusals to answer some specific questions. Enterprise Surveys suffer from both problems and different strategies were used to address these issues.

Item non-response was addressed by two strategies: a- For sensitive questions that may generate negative reactions from the respondent, such as corruption or tax evasion, enumerators were instructed to collect "Refusal to respond" (-8) as a different option from "Don't know" (-9). b- Establishments with incomplete information were re-contacted in order to complete this information, whenever necessary.

Survey non-response was addressed by maximizing efforts to contact establishments that were initially selected for interview. Attempts were made to contact the establishment for interview at different times/days of the week before a replacement establishment (with similar strata characteristics) was suggested for interview. Survey non-response did occur but substitutions were made in order to potentially achieve strata-specific goals.
N
Blue Earth County, MN Population Pyramid Dataset: Age Groups, Male and...
neilsberg.com
csv, json
Updated Jul 24, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2024). Blue Earth County, MN Population Pyramid Dataset: Age Groups, Male and Female Population, and Total Population for Demographics Analysis // 2024 Edition [Dataset]. https://www.neilsberg.com/research/datasets/f0113b0d-4983-11ef-ae5d-3860777c1fe6/
Explore at:
json, csvAvailable download formats
Dataset updated
Jul 24, 2024
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Minnesota, Blue Earth County
Variables measured
Male and Female Population Under 5 Years, Male and Female Population over 85 years, Male and Female Total Population for Age Groups, Male and Female Population Between 5 and 9 years, Male and Female Population Between 10 and 14 years, Male and Female Population Between 15 and 19 years, Male and Female Population Between 20 and 24 years, Male and Female Population Between 25 and 29 years, Male and Female Population Between 30 and 34 years, Male and Female Population Between 35 and 39 years, and 9 more
Measurement technique
The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2018-2022 5-Year Estimates. To measure the three variables, namely (a) male population, (b) female population and (b) total population, we initially analyzed and categorized the data for each of the age groups. For age groups we divided it into roughly a 5 year bucket for ages between 0 and 85. For over 85, we aggregated data into a single group for all ages. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset tabulates the data for the Blue Earth County, MN population pyramid, which represents the Blue Earth County population distribution across age and gender, using estimates from the U.S. Census Bureau American Community Survey (ACS) 2018-2022 5-Year Estimates. It lists the male and female population for each age group, along with the total population for those age groups. Higher numbers at the bottom of the table suggest population growth, whereas higher numbers at the top indicate declining birth rates. Furthermore, the dataset can be utilized to understand the youth dependency ratio, old-age dependency ratio, total dependency ratio, and potential support ratio.

Key observations

Youth dependency ratio, which is the number of children aged 0-14 per 100 persons aged 15-64, for Blue Earth County, MN, is 23.6.

Old-age dependency ratio, which is the number of persons aged 65 or over per 100 persons aged 15-64, for Blue Earth County, MN, is 20.9.

Total dependency ratio for Blue Earth County, MN is 44.5.

Potential support ratio, which is the number of youth (working age population) per elderly, for Blue Earth County, MN is 4.8.

Content

When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2018-2022 5-Year Estimates.

Age groups:

Under 5 years

5 to 9 years

10 to 14 years

15 to 19 years

20 to 24 years

25 to 29 years

30 to 34 years

35 to 39 years

40 to 44 years

45 to 49 years

50 to 54 years

55 to 59 years

60 to 64 years

65 to 69 years

70 to 74 years

75 to 79 years

80 to 84 years

85 years and over

Variables / Data Columns

Age Group: This column displays the age group for the Blue Earth County population analysis. Total expected values are 18 and are define above in the age groups section.

Population (Male): The male population in the Blue Earth County for the selected age group is shown in the following column.

Population (Female): The female population in the Blue Earth County for the selected age group is shown in the following column.

Total Population: The total population of the Blue Earth County for the selected age group is shown in the following column.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Recommended for further research

This dataset is a part of the main dataset for Blue Earth County Population by Age. You can refer the same here
e
World - Regulatory Indicators for Sustainable Energy - Dataset -...
energydata.info
Updated Sep 30, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). World - Regulatory Indicators for Sustainable Energy - Dataset - ENERGYDATA.INFO [Dataset]. https://energydata.info/dataset/world-regulatory-indicators-sustainable-energy-2016
Explore at:
Dataset updated
Sep 30, 2024
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
World
Description
Regulatory Indicators for Sustainable Energy (RISE) is a comprehensive policy scorecard assessing the investment climate for sustainable energy and focusing on three key areas: energy access, energy efficiency and renewable energy. RISE covers 111 countries across the developed and developing worlds, which together represent over 90% of global population, GDP and energy consumption. With 28 indicators, 85 sub-indicators and 158 data points per country, RISE helps policy makers to understand how they are doing, compare across countries, learn from peer groups, and identify priority actions for the future. The source data and documents for 111 countries are available at http://rise.worldbank.org/library To learn more, please visit http://rise.worldbank.org/
Enterprise Survey 2009-2017, Panel Data - Liberia
microdata.worldbank.org
catalog.ihsn.org
+1more
Updated Jun 15, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
World Bank (2018). Enterprise Survey 2009-2017, Panel Data - Liberia [Dataset]. https://microdata.worldbank.org/index.php/catalog/3027
Explore at:
Dataset updated
Jun 15, 2018
Dataset provided by
World Bankhttp://worldbank.org/
Liberia Institute for Statistics and Geo-Information Services
Time period covered
2009 - 2017
Area covered
Liberia
Description
Abstract

The documented dataset covers Enterprise Survey (ES) panel data collected in Liberia in 2009 and 2017, as part of the Enterprise Survey initiative of the World Bank. An Indicator Survey is similar to an Enterprise Survey; it is implemented for smaller economies where the sampling strategies inherent in an Enterprise Survey are often not applicable due to the limited universe of firms.

The objective of the 2009-2017 Enterprise Survey is to obtain feedback from enterprises in client countries on the state of the private sector as well as to build a panel of enterprise data that will make it possible to track changes in the business environment over time and allow, for example, impact assessments of reforms. Through interviews with firms in the manufacturing and services sectors, the Indicator Survey data provides information on the constraints to private sector growth and is used to create statistically significant business environment indicators that are comparable across countries.

As part of its strategic goal of building a climate for investment, job creation, and sustainable growth, the World Bank has promoted improving the business environment as a key strategy for development, which has led to a systematic effort in collecting enterprise data across countries. The Enterprise Surveys (ES) are an ongoing World Bank project in collecting both objective data based on firms' experiences and enterprises' perception of the environment in which they operate.

Geographic coverage

National

Analysis unit

The primary sampling unit of the study is the establishment. An establishment is a physical location where business is carried out and where industrial operations take place or services are provided. A firm may be composed of one or more establishments. For example, a brewery may have several bottling plants and several establishments for distribution. For the purposes of this survey an establishment must make its own financial decisions and have its own financial statements separate from those of the firm. An establishment must also have its own management and control over its payroll.

Universe

The whole population, or the universe, covered in the Enterprise Surveys is the non-agricultural economy. It comprises: all manufacturing sectors according to the ISIC Revision 3.1 group classification (group D), construction sector (group F), services sector (groups G and H), and transport, storage, and communications sector (group I). Note that this population definition excludes the following sectors: financial intermediation (group J), real estate and renting activities (group K, except sub-sector 72, IT, which was added to the population under study), and all public or utilities sectors.

Kind of data

Sample survey data [ssd]

Sampling procedure

The sample for the 2009-2017 Liberia Enterprise Survey (ES) was selected using stratified random sampling, following the methodology explained in the Sampling Note. Stratified random was preferred over simple random sampling for several reasons: - To obtain unbiased estimates for different subdivisions of the population with some known level of precision. - To obtain unbiased estimates for the whole population. The whole population, or universe of the study, is the non-agricultural economy. It comprises: all manufacturing sectors according to the group classification of ISIC Revision 3.1: (group D), construction sector (group F), services sector (groups G and H), and transport, storage, and communications sector (group I). Note that this definition excludes the following sectors: financial intermediation (group J), real estate and renting activities (group K, except subsector 72, IT, which was added to the population under study), and all public or utilities sectors.

To make sure that the final total sample includes establishments from all different sectors and that it is not concentrated in one or two of industries/sizes/regions.

To exploit the benefits of stratified sampling where population estimates, in most cases, will be more precise than using a simple random sampling method (i.e., lower standard errors, other things being equal.)

Stratification may produce a smaller bound on the error of estimation than would be produced by a simple random sample of the same size. This result is particularly true if measurements within strata are homogeneous.

The cost per observation in the survey may be reduced by stratification of the population elements into convenient groupings.

Three levels of stratification were used in this country: industry, establishment size, and region. Industry stratification was designed as follows: the universe was stratified as into manufacturing and services industries. Manufacturing (ISIC Rev. 3.1 codes 15 - 37), and Services (ISIC codes 45, 50-52, 55, 60-64, and 72). For the Liberia ES, size stratification was defined as follows: small (5 to 19 employees), medium (20 to 99 employees), and large (100 or more employees).
Regional stratification for the Liberia ES was done across three regions: Montserrado, Margibi, and Nimba.

Mode of data collection

Face-to-face [f2f]

Research instrument

The current survey instruments are available: - Services and Manufacturing Questionnaire - Screener Questionnaire.

The standard Enterprise Survey topics include firm characteristics, gender participation, access to finance, annual sales, costs of inputs/labor, workforce composition, bribery, licensing, infrastructure, trade, crime, competition, capacity utilization, land and permits, taxation, informality, business-government relations, innovation and technology, and performance measures. Over 90% of the questions objectively ascertain characteristics of a country's business environment. The remaining questions assess the survey respondents' opinions on what are the obstacles to firm growth and performance.

Cleaning operations

Data entry and quality controls are implemented by the contractor and data is delivered to the World Bank in batches (typically 10%, 50% and 100%). These data deliveries are checked for logical consistency, out of range values, skip patterns, and duplicate entries. Problems are flagged by the World Bank and corrected by the implementing contractor through data checks, callbacks, and revisiting establishments.

Response rate

There was a high response rate especially as a result of positive attitude towards the international community in collaboration with the government in their reconstruction efforts after a period of civil strife.There was also very positive attitude towards World Bank initiatives.
k
Penn World Table 10.01
datasource.kapsarc.org
data.kapsarc.org
Updated Oct 29, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Penn World Table 10.01 [Dataset]. https://datasource.kapsarc.org/explore/dataset/penn-world-table-90/
Explore at:
Dataset updated
Oct 29, 2024
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Explore the Penn World Table dataset featuring key economic indicators like real GDP, population, human capital index, and more. Access detailed information and analysis for various countries.

Expenditure, GDP, PPP, output, Population, working hours, Index, Household, Consumption, Capital , IRR, prices

Albania, Algeria, Angola, Antigua and Barbuda, Argentina, Armenia, Australia, Austria, Azerbaijan, Bahamas, Bahrain, Bangladesh, Barbados, Belarus, Belgium, Belize, Benin, Bhutan, Bolivia, Bosnia and Herzegovina, Botswana, Brazil, Brunei, Bulgaria, Burkina Faso, Burundi, CÃƒÂ´te d'Ivoire, Cabo Verde, Cambodia, Cameroon, Canada, Central African Republic, Chad, Chile, China, Colombia, Comoros, Congo, Costa Rica, Croatia, Cyprus, Denmark, Djibouti, Dominica, Dominican Republic, Ecuador, Egypt, El Salvador, Equatorial Guinea, Estonia, Eswatini, Ethiopia, Fiji, Finland, France, Gabon, Gambia, Georgia, Germany, Ghana, Greece, Grenada, Guatemala, Guinea, Guinea-Bissau, Guyana, Haiti, Honduras, Hungary, Iceland, India, Indonesia, Iran, Iraq, Ireland, Israel, Italy, Jamaica, Japan, Jordan, Kazakhstan, Kenya, Kuwait, Kyrgyzstan, Latvia, Lebanon, Lesotho, Liberia, Lithuania, Luxembourg, Madagascar, Malawi, Malaysia, Maldives, Mali, Malta, Mauritania, Mauritius, Mexico, Moldova, Mongolia, Montenegro, Morocco, Mozambique, Myanmar, Namibia, Nepal, Netherlands, New Zealand, Nicaragua, Niger, Nigeria, North Macedonia, Norway, Oman, Pakistan, Panama, Paraguay, Peru, Philippines, Poland, Portugal, Qatar, Romania, Russia, Rwanda, Saint Kitts and Nevis, Saint Lucia, Sao Tome and Principe, Saudi Arabia, Senegal, Serbia, Seychelles, Sierra Leone, Singapore, Slovakia, Slovenia, South Africa, Spain, Sri Lanka, Sudan, Suriname, Sweden, Switzerland, Syria, Tajikistan, Tanzania, Thailand, Togo, Trinidad and Tobago, Tunisia, Turkey, Turkmenistan, Uganda, Ukraine, United Arab Emirates, United Kingdom, Uruguay, Uzbekistan, Venezuela, Yemen, Zambia, Zimbabwe, World Follow data.kapsarc.org for timely data to advance energy economics research. When using these data, please refer to the following paper:Feenstra, Robert C., Robert Inklaar and Marcel P. Timmer (2015), "The Next Generation of the Penn World Table" American Economic Review, 105(10), 3150-3182, available for download at www.ggdc.net/pwt
Enterprise Survey 2010 - Mexico
microdata.worldbank.org
dev.ihsn.org
+1more
Updated Sep 26, 2013
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Enterprise Survey 2010 - Mexico [Dataset]. https://microdata.worldbank.org/index.php/catalog/870
Explore at:
Dataset updated
Sep 26, 2013
Dataset authored and provided by
World Bankhttp://worldbank.org/
Time period covered
2010 - 2011
Area covered
Mexico
Description
Abstract

This research was conducted in Mexico between August 2010 and June 2011 as part of the Latin America and Caribbean (LAC) Enterprise Survey 2010, an initiative of the World Bank. Data from 1480 establishments was analyzed. Stratified random sampling was used to select the surveyed businesses.

The objective of the study is to obtain feedback from enterprises in client countries on the state of the private sector as well as to help in building a panel of enterprise data that will make it possible to track changes in the business environment over time, thus allowing, for example, impact assessments of reforms. Through face-to-face interviews with firms in the manufacturing and services sectors, the survey assesses the constraints to private sector growth and creates statistically significant business environment indicators that are comparable across countries.

The standard Enterprise Survey topics include firm characteristics, gender participation, access to finance, annual sales, costs of inputs/labor, workforce composition, bribery, licensing, infrastructure, trade, crime, competition, capacity utilization, land and permits, taxation, informality, business-government relations, innovation and technology, and performance measures. Over 90% of the questions objectively ascertain characteristics of a country’s business environment. The remaining questions assess the survey respondents’ opinions on what are the obstacles to firm growth and performance.

Geographic coverage

National

Analysis unit

The primary sampling unit of the study is the establishment. An establishment is a physical location where business is carried out and where industrial operations take place or services are provided. A firm may be composed of one or more establishments. For example, a brewery may have several bottling plants and several establishments for distribution. For the purposes of this survey an establishment must make its own financial decisions and have its own financial statements separate from those of the firm. An establishment must also have its own management and control over its payroll.

Universe

The whole population, or the universe, covered in the Enterprise Surveys is the non-agricultural economy. It comprises: all manufacturing sectors according to the ISIC Revision 3.1 group classification (group D), construction sector (group F), services sector (groups G and H), and transport, storage, and communications sector (group I). Note that this population definition excludes the following sectors: financial intermediation (group J), real estate and renting activities (group K, except sub-sector 72, IT, which was added to the population under study), and all public or utilities sectors.

Kind of data

Sample survey data [ssd]

Sampling procedure

The study was conducted using stratified random sampling. Three levels of stratification were used in the sample: firm sector, firm size, and geographic region.

Industry stratification was designed in the following way: the universe was stratified into seven manufacturing industries and one "other" manufacturing category, - two services categories, retail and IT, and one "other" services category. Each of the manufacturing categories had a target of 160 interviews; the "other" manufacturing category and the three services categories all had targets of 120 interviews.

Size stratification was defined following the standardized definition for the Enterprise Surveys: small (5 to 19 employees), medium (20 to 99 employees), and large (more than 99 employees). For stratification purposes, the number of employees was defined on the basis of reported permanent full-time workers. This seems to be an appropriate definition of the labor force since seasonal/casual/part-time employment is not a common practice, except in the sectors of construction and agriculture.

Regional stratification was defined in eight locations (city and the surrounding business area): Mexico City, Estado de Mexico (MAMC), Guadalajara, Monterrey, Puebla, Monclova, Veracruz, and Leon.

Ciudad Juarez and Coahuila, which were included in the 2006 round of the Enterprise Surveys, were omitted in 2010 due to security concerns.

For Mexico, two sample frames were used. The first was supplied by the World Bank and consists of enterprises interviewed in Mexico 2006. The World Bank required that attempts should be made to re-interview establishments responding to the Mexico 2006 survey where they were within the selected geographical locations and met eligibility criteria. That sample is referred to as the Panel. The second sample frame was produced from the 2009 Economic Census of INEGI (Instituto Nacional de Geografía e Informática), the national bureau of statistics.

INEGI's database uses the SCIAN 2007 classification for economic activities while the Enterprise Surveys are based on the ISIC classification. Therefore, a conversion between the two classifications was made.

The two sample frames were then used for the selection of a sample with the aim of obtaining interviews with 1,600 establishments with five or more employees.

The quality of the frame was assessed at the outset of the project through visits to a random subset of firms and local contractor knowledge. The sample frame was not immune from the typical problems found in establishment surveys: positive rates of non-eligibility, repetition, non-existent units, etc. In addition, the sample frame contained no telephone/fax numbers so the local contractor had to screen the contacts by visiting them. Due to response rate and ineligibility issues, additional sample had to be extracted by the World Bank in order to obtain enough eligible contacts and meet the sample targets.

Given the impact that non-eligible units included in the sample universe may have on the results, adjustments may be needed when computing the appropriate weights for individual observations. The percentage of confirmed non-eligible units as a proportion of the total number of sampled establishments contacted for the survey was 12.55% (1079 out of 8600).

Mode of data collection

Face-to-face [f2f]

Research instrument

The current survey instruments are available: - Core Questionnaire [ISIC Rev.3.1: 45, 50, 51, 52, 55, 60-64, 72]; - Core Questionnaire + Manufacturing Module [ISIC Rev.3.1: 15-37]; - Core Questionnaire + Retail Module [ISIC Rev.3.1: 52]; - Screener Questionnaire.

The "Core Questionnaire" is the heart of the Enterprise Survey and contains the survey questions asked of all firms across the world. There are also two other survey instruments - the "Core Questionnaire + Manufacturing Module" and the "Core Questionnaire + Retail Module." The survey is fielded via three instruments in order to not ask questions that are irrelevant to specific types of firms, e.g. a question that relates to production and nonproduction workers should not be asked of a retail firm. In addition to questions that are asked across countries, all surveys are customized and contain country-specific questions. An example of customization would be including tourism-related questions that are asked in certain countries when tourism is an existing or potential sector of economic growth.

The standard Enterprise Survey topics include firm characteristics, gender participation, access to finance, annual sales, costs of inputs/labor, workforce composition, bribery, licensing, infrastructure, trade, crime, competition, capacity utilization, land and permits, taxation, informality, business-government relations, innovation and technology, and performance measures. The questionnaire also assesses the survey respondents' opinions on what are the obstacles to firm growth and performance.

Cleaning operations

Data entry and quality controls are implemented by the contractor and data is delivered to the World Bank in batches (typically 10%, 50% and 100%). These data deliveries are checked for logical consistency, out of range values, skip patterns, and duplicate entries. Problems are flagged by the World Bank and corrected by the implementing contractor through data checks, callbacks, and revisiting establishments.

Response rate

The number of realized interviews per contacted establishment was 0.17. The estimate is based on the total number of firms contacted including ineligible establishments. This number is the result of two factors: explicit refusals to participate in the survey, as reflected by the rate of rejection (which includes rejections of the screener and the main survey) and the quality of the sample frame, as represented by the presence of ineligible units. The number of rejections per contact was 0.29.

Complete information regarding the sampling methodology, sample frame, weights, response rates, and implementation can be found in "Description of Mexico ES 2010 Implementation" in external resources.
n
Coronavirus (Covid-19) Data in the United States
nytimes.com
openicpsr.org
+3more
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
New York Times, Coronavirus (Covid-19) Data in the United States [Dataset]. https://www.nytimes.com/interactive/2020/us/coronavirus-us-cases.html
Explore at:
Dataset provided by
New York Times
Description
The New York Times is releasing a series of data files with cumulative counts of coronavirus cases in the United States, at the state and county level, over time. We are compiling this time series data from state and local governments and health departments in an attempt to provide a complete record of the ongoing outbreak.
Since late January, The Times has tracked cases of coronavirus in real time as they were identified after testing. Because of the widespread shortage of testing, however, the data is necessarily limited in the picture it presents of the outbreak.
We have used this data to power our maps and reporting tracking the outbreak, and it is now being made available to the public in response to requests from researchers, scientists and government officials who would like access to the data to better understand the outbreak.
The data begins with the first reported coronavirus case in Washington State on Jan. 21, 2020. We will publish regular updates to the data in this repository.
e
Global coastal flood hazard - Dataset - ENERGYDATA.INFO
energydata.info
Updated Nov 28, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2023). Global coastal flood hazard - Dataset - ENERGYDATA.INFO [Dataset]. https://energydata.info/dataset/global-coastal-flood-hazard-0
Explore at:
Dataset updated
Nov 28, 2023
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The tropical cyclonic strong wind and storm surge model use information from 2594 historical tropical cyclones, topography, terrain roughness, and bathymetry. The historical tropical cyclones used in GAR15 cyclone wind and storm surge model are from five different oceanic basins: Northeast Pacific, Northwest Pacific, South Pacific, North Indian, South Indian and North Atlantic and the tracks were obtained from the IBTrACS database (Knapp et al. 2010). This database represents the repository of information associated with tropical cyclones that is the most up to date. Topography was taken from the Shuttle Radar Topography Mission (SRTM) of NASA, which provides terrain elevation grids at a 90 meters resolution, delivered by quadrants over the world. To account for surface roughness, polygons of urban areas worldwide were obtained from the Socioeconomic Data and Applications Centre, SEDAC (CIESIN et al., 2011). This was considered a good proxy of the spatial variation of surface roughness. A digital bathymetry model is employed with a spatial resolution of 30 arc-seconds, taken from the GEBCO_08 (General Bathymetric Chart of the Oceans) Grid Database of the British Oceanographic Data Centre (2009). Bathymetry is the information about the underwater floor of the ocean having direct influence on the formation of the storm surge. More information about the cyclone wind and storm surge hazard can be found in CIMNE et al., 2015a. Hazard analysis was performed using the software CAPRA Team Tropical Cyclones Hazard Modeler (Bernal, 2014). The vulnerability models used in the risk calculation for GAR correlate loss to the wind speed for 3-seconds gusts. For GAR15, the risk was calculated with the CAPRA-GIS platform which is risk modelling tool of the CAPRA suite (www.ecapra.org). The risk assessment was also conducted by CIMNE and Ingeniar to produced AAL and PML values for cyclone risk.
Global Land Cover 1992-2020
cacgeoportal.com
climate.esri.ca
+5more
Updated Apr 2, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Esri (2020). Global Land Cover 1992-2020 [Dataset]. https://www.cacgeoportal.com/datasets/1453082255024699af55c960bc3dc1fe
Explore at:
Dataset updated
Apr 2, 2020
Dataset authored and provided by
Esrihttp://esri.com/
Area covered
Description
This layer is a time series of the annual ESA CCI (Climate Change Initiative) land cover maps of the world. ESA has produced land cover maps for the years 1992-2020. These are available at the European Space Agency Climate Change Initiative website.Time Extent: 1992-2020Cell Size: 300 meter Source Type: ThematicPixel Type: 8 Bit UnsignedData Projection: GCS WGS84Mosaic Projection: Web Mercator Auxiliary Sphere Extent: GlobalSource: ESA Climate Change InitiativeUpdate Cycle: Annual until 2020, no updates thereafterWhat can you do with this layer? This layer may be added to ArcGIS Online maps and applications and shown in a time series to watch a "time lapse" view of land cover change since 1992 for any part of the world. The same behavior exists when the layer is added to ArcGIS Pro. In addition to displaying all layers in a series, this layer may be queried so that only one year is displayed in a map. This layer can be used in analysis. For example, the layer may be added to ArcGIS Pro with a query set to display just one year. Then, an area count of land cover types may be produced for a feature dataset using the zonal statistics tool. Statistics may be compared with the statistics from other years to show a trend. To sum up area by land cover using this service, or any other analysis, be sure to use an equal area projection, such as Albers or Equal Earth. Different Classifications Available to Map Five processing templates are included in this layer. The processing templates may be used to display a smaller set of land cover classes.Cartographic Renderer (Default Template)Displays all ESA CCI land cover classes.*Forested lands TemplateThe forested lands template shows only forested lands (classes 50-90).Urban Lands TemplateThe urban lands template shows only urban areas (class 190).Converted Lands TemplateThe converted lands template shows only urban lands and lands converted to agriculture (classes 10-40 and 190).Simplified RendererDisplays the map in ten simple classes which match the ten simplified classes used in 2050 Land Cover projections from Clark University.Any of these variables can be displayed or analyzed by selecting their processing template. In ArcGIS Online, select the Image Display Options on the layer. Then pull down the list of variables from the Renderer options. Click Apply and Close. In ArcGIS Pro, go into the Layer Properties. Select Processing Templates from the left hand menu. From the Processing Template pull down menu, select the variable to display. Using Time By default, the map will display as a time series animation, one year per frame. A time slider will appear when you add this layer to your map. To see the most current data, move the time slider until you see the most current year. In addition to displaying the past quarter century of land cover maps as an animation, this time series can also display just one year of data by use of a definition query. For a step by step example using ArcGIS Pro on how to display just one year of this layer, as well as to compare one year to another, see the blog called Calculating Impervious Surface Change. Hierarchical ClassificationLand cover types are defined using the land cover classification (LCCS) developed by the United Nations, FAO. It is designed to be as compatible as possible with other products, namely GLCC2000, GlobCover 2005 and 2009. This is a heirarchical classification system. For example, class 60 means "closed to open" canopy broadleaved deciduous tree cover. But in some places a more specific type of broadleaved deciduous tree cover may be available. In that case, a more specific code 61 or 62 may be used which specifies "open" (61) or "closed" (62) cover. Land Cover Processing To provide consistency over time, these maps are produced from baseline land cover maps, and are revised for changes each year depending on the best available satellite data from each period in time. These revisions were made from AVHRR 1km time series from 1992 to 1999, SPOT-VGT time series between 1999 and 2013, and PROBA-V data for years 2013, 2014 and 2015. When MERIS FR or PROBA-V time series are available, changes detected at 1 km are re-mapped at 300 m. The last step consists in back- and up-dating the 10-year baseline LC map to produce the 24 annual LC maps from 1992 to 2015. Source data The datasets behind this layer were extracted from NetCDF files and TIFF files produced by ESA. Years 1992-2015 were acquired from ESA CCI LC version 2.0.7 in TIFF format, and years 2016-2018 were acquired from version 2.1.1 in NetCDF format. These are downloadable from ESA with an account, after agreeing to their terms of use. https://maps.elie.ucl.ac.be/CCI/viewer/download.php CitationESA. Land Cover CCI Product User Guide Version 2. Tech. Rep. (2017). Available at: maps.elie.ucl.ac.be/CCI/viewer/download/ESACCI-LC-Ph2-PUGv2_2.0.pdfMore technical documentation on the source datasets is available here:https://cds.climate.copernicus.eu/cdsapp#!/dataset/satellite-land-cover?tab=doc*Index of all classes in this layer:10 Cropland, rainfed11 Herbaceous cover12 Tree or shrub cover20 Cropland, irrigated or post-flooding30 Mosaic cropland (>50%) / natural vegetation (tree, shrub, herbaceous cover) (<50%)40 Mosaic natural vegetation (tree, shrub, herbaceous cover) (>50%) / cropland (<50%) 50 Tree cover, broadleaved, evergreen, closed to open (>15%)60 Tree cover, broadleaved, deciduous, closed to open (>15%)61 Tree cover, broadleaved, deciduous, closed (>40%)62 Tree cover, broadleaved, deciduous, open (15-40%)70 Tree cover, needleleaved, evergreen, closed to open (>15%)71 Tree cover, needleleaved, evergreen, closed (>40%)72 Tree cover, needleleaved, evergreen, open (15-40%)80 Tree cover, needleleaved, deciduous, closed to open (>15%)81 Tree cover, needleleaved, deciduous, closed (>40%)82 Tree cover, needleleaved, deciduous, open (15-40%)90 Tree cover, mixed leaf type (broadleaved and needleleaved)100 Mosaic tree and shrub (>50%) / herbaceous cover (<50%)110 Mosaic herbaceous cover (>50%) / tree and shrub (<50%)120 Shrubland121 Shrubland evergreen122 Shrubland deciduous130 Grassland140 Lichens and mosses150 Sparse vegetation (tree, shrub, herbaceous cover) (<15%)151 Sparse tree (<15%)152 Sparse shrub (<15%)153 Sparse herbaceous cover (<15%)160 Tree cover, flooded, fresh or brakish water170 Tree cover, flooded, saline water180 Shrub or herbaceous cover, flooded, fresh/saline/brakish water190 Urban areas200 Bare areas201 Consolidated bare areas202 Unconsolidated bare areas210 Water bodies
u
Data from: Global subnational Gini coefficient (income inequality) and gross...
iro.uiowa.edu
zenodo.org
Updated Nov 29, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Matti Kummu; Venla Niva; Daniel Chrisendo; Juan Carlos Rocha; Roman Hoffmann; Vilma Sandström; Frederick Solt; Sina Masoumzadeh Sayyar (2024). Global subnational Gini coefficient (income inequality) and gross national income (GNI) per capita PPP datasets for 1990-2021 [Dataset]. https://iro.uiowa.edu/esploro/outputs/dataset/Global-subnational-Gini-coefficient-income-inequality/9984757687502771
Explore at:
Dataset updated
Nov 29, 2024
Dataset provided by
Zenodo
Authors
Matti Kummu; Venla Niva; Daniel Chrisendo; Juan Carlos Rocha; Roman Hoffmann; Vilma Sandström; Frederick Solt; Sina Masoumzadeh Sayyar
Time period covered
Nov 29, 2024
Description
This dataset provides a gridded subnational datasets for Income inequality (Gini coefficient) at admin 1 level Gross national income (GNI) per capita PPP at admin 1 level The datasets are based on reported subnational admin data and spans three decades from 1990 to 2021. The dataset is presented in details in the following publication. Please cite this paper when using data. Chrisendo D, Niva V, Hoffman R, Sayyar SM, Rocha J, Sandström V, Solt F, Kummu M. 2024. Income inequality has increased for over two-thirds of the global population. Preprint. doi: https://doi.org/10.21203/rs.3.rs-5548291/v1 Code is available at following repositories: Gini coefficient data creation: https://github.com/mattikummu/subnatGini GNI per capita data creation: https://github.com/mattikummu/subnatGNI analyses for the article: https://github.com/mattikummu/gini_gni_analyses The following data is given (formats in brackets) Income inequality (Gini coefficient) at admin 0 level (national) (GeoTIFF, gpkg, csv) Income inequality (Gini coefficient) at admin 1 level (subnational) (GeoTIFF, gpkg, csv) Gross national income (GNI) per capita PPP at admin 0 level (national) (GeoTIFF, gpkg, csv) Gross national income (GNI) per capita PPP at admin 1 level (subnational) (GeoTIFF, gpkg, csv) Slope for Gini coefficient at admin 1 level (GeoTIFF; slope is given also in gpk and csv files) Slope for GNI per capita at admin 1 level (GeoTIFF; slope is given also in gpk and csv files) Input data for the script that was used to generate the Gini coefficient (input_data_gini.zip) Input data for the script that was used to generate the GNI per capita PPP (input_data_GNI.zip) Files are named as followsFormat: raster data (GeoTIFF) starts with rast_*, polygon data (gpkg) with polyg_*, and tabulated with tabulated_*. Admin levels: adm0 for admin 0 level, adm1 for admin 1 levelProduct type: _gini_disp_ for gini coefficient based on disposable income _gni_perCapita_ for GNI per capita PPP Metadata Grids Resolution: 5 arc-min (0.083333333 degrees) Spatial extent: Lon: -180, 180; -90, 90 (xmin, xmax, ymin, ymax) Coordinate ref system: EPSG:4326 - WGS 84 Format: Multiband geotiff; one band for each year over 1990-2021 Unit: no unit for Gini coefficient and PPP USD in 2017 international dollars for GNI per capita Geospatial polygon (gpkg) files: Spatial extent: -180, 180; -90, 83.67 (xmin, xmax, ymin, ymax) Temporal extent: annual over 1990-2021 Coordinate ref system: EPSG:4326 - WGS 84 Format: gkpk Unit: no unit for Gini coefficient and PPP USD in 2017 international dollars for GNI per capita
C
Canada Population: 100 Years & Over
ceicdata.com
Updated Jan 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CEICdata.com (2025). Canada Population: 100 Years & Over [Dataset]. https://www.ceicdata.com/en/canada/population/population-100-years--over
Explore at:
Dataset updated
Jan 15, 2025
Dataset provided by
CEICdata.com
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Jun 1, 2013 - Jun 1, 2024
Area covered
Canada
Variables measured
Population
Description
Canada Population: 100 Years & Over data was reported at 11.672 Person th in 2024. This records an increase from the previous number of 11.493 Person th for 2023. Canada Population: 100 Years & Over data is updated yearly, averaging 6.603 Person th from Jun 2000 (Median) to 2024, with 25 observations. The data reached an all-time high of 11.672 Person th in 2024 and a record low of 3.393 Person th in 2000. Canada Population: 100 Years & Over data remains active status in CEIC and is reported by Statistics Canada. The data is categorized under Global Database’s Canada – Table CA.G001: Population.
Data from: GeoCoV19: A Dataset of Hundreds of Millions of Multilingual...
zenodo.org
data.niaid.nih.gov
zip
Updated Jun 16, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Umair Qazi; Muhammad Imran; Muhammad Imran; Ferda Ofli; Ferda Ofli; Umair Qazi (2020). GeoCoV19: A Dataset of Hundreds of Millions of Multilingual COVID-19 Tweets with Location Information [Dataset]. http://doi.org/10.5281/zenodo.3878599
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.3878599
Dataset updated
Jun 16, 2020
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Umair Qazi; Muhammad Imran; Muhammad Imran; Ferda Ofli; Ferda Ofli; Umair Qazi
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
We present GeoCoV19, a large-scale Twitter dataset related to the ongoing COVID-19 pandemic. The dataset has been collected over a period of 90 days from February 1 to May 1, 2020 and consists of more than 524 million multilingual tweets. As the geolocation information is essential for many tasks such as disease tracking and surveillance, we employed a gazetteer-based approach to extract toponyms from user location and tweet content to derive their geolocation information using the Nominatim (Open Street Maps) data at different geolocation granularity levels. In terms of geographical coverage, the dataset spans over 218 countries and 47K cities in the world. The tweets in the dataset are from more than 43 million Twitter users, including around 209K verified accounts. These users posted tweets in 62 different languages.
Data from: OSDG Community Dataset (OSDG-CD)
data.niaid.nih.gov
explore.openaire.eu
Updated Jun 3, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
PPMI (2024). OSDG Community Dataset (OSDG-CD) [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_5550237
Explore at:
Dataset updated
Jun 3, 2024
Dataset provided by
United Nations Development Programmehttp://www.undp.org/
OSDG
PPMI
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The OSDG Community Dataset (OSDG-CD) is a public dataset of thousands of text excerpts, which were validated by over 1,400 OSDG Community Platform (OSDG-CP) citizen scientists from over 140 countries, with respect to the Sustainable Development Goals (SDGs).

Dataset Information

In support of the global effort to achieve the Sustainable Development Goals (SDGs), OSDG is realising a series of SDG-labelled text datasets. The OSDG Community Dataset (OSDG-CD) is the direct result of the work of more than 1,400 volunteers from over 130 countries who have contributed to our understanding of SDGs via the OSDG Community Platform (OSDG-CP). The dataset contains tens of thousands of text excerpts (henceforth: texts) which were validated by the Community volunteers with respect to SDGs. The data can be used to derive insights into the nature of SDGs using either ontology-based or machine learning approaches.

📘 The file contains 43,0210 (+390) text excerpts and a total of 310,328 (+3,733) assigned labels.

To learn more about the project, please visit the OSDG website and the official GitHub page. Explore a detailed overview of the OSDG methodology in our recent paper "OSDG 2.0: a multilingual tool for classifying text data by UN Sustainable Development Goals (SDGs)".

Source Data

The dataset consists of paragraph-length text excerpts derived from publicly available documents, including reports, policy documents and publication abstracts. A significant number of documents (more than 3,000) originate from UN-related sources such as SDG-Pathfinder and SDG Library. These sources often contain documents that already have SDG labels associated with them. Each text is comprised of 3 to 6 sentences and is about 90 words on average.

Methodology

All the texts are evaluated by volunteers on the OSDG-CP. The platform is an ambitious attempt to bring together researchers, subject-matter experts and SDG advocates from all around the world to create a large and accurate source of textual information on the SDGs. The Community volunteers use the platform to participate in labelling exercises where they validate each text's relevance to SDGs based on their background knowledge.

In each exercise, the volunteer is shown a text together with an SDG label associated with it – this usually comes from the source – and asked to either accept or reject the suggested label.

There are 3 types of exercises:

All volunteers start with the mandatory introductory exercise that consists of 10 pre-selected texts. Each volunteer must complete this exercise before they can access 2 other exercise types. Upon completion, the volunteer reviews the exercise by comparing their answers with the answers of the rest of the Community using aggregated statistics we provide, i.e., the share of those who accepted and rejected the suggested SDG label for each of the 10 texts. This helps the volunteer to get a feel for the platform.

SDG-specific exercises where the volunteer validates texts with respect to a single SDG, e.g., SDG 1 No Poverty.

All SDGs exercise where the volunteer validates a random sequence of texts where each text can have any SDG as its associated label.

After finishing the introductory exercise, the volunteer is free to select either SDG-specific or All SDGs exercises. Each exercise, regardless of its type, consists of 100 texts. Once the exercise is finished, the volunteer can either label more texts or exit the platform. Of course, the volunteer can finish the exercise early. All progress is saved and recorded still.

To ensure quality, each text is validated by up to 9 different volunteers and all texts included in the public release of the data have been validated by at least 3 different volunteers.

It is worth keeping in mind that all exercises present the volunteers with a binary decision problem, i.e., either accept or reject a suggested label. The volunteers are never asked to select one or more SDGs that a certain text might relate to. The rationale behind this set-up is that asking a volunteer to select from 17 SDGs is extremely inefficient. Currently, all texts are validated against only one associated SDG label.

Column Description

doi - Digital Object Identifier of the original document

text_id - unique text identifier

text - text excerpt from the document

sdg - the SDG the text is validated against

labels_negative - the number of volunteers who rejected the suggested SDG label

labels_positive - the number of volunteers who accepted the suggested SDG label

agreement - agreement score based on the formula (agreement = \frac{|labels_{positive} - labels_{negative}|}{labels_{positive} + labels_{negative}})

Further Information

Do not hesitate to share with us your outputs, be it a research paper, a machine learning model, a blog post, or just an interesting observation. All queries can be directed to community@osdg.ai.
d
Geolytica POIData.xyz Points of Interest (POI) Geo Data - Brazil
datarade.ai
.csv
Updated Mar 16, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Geolytica (2021). Geolytica POIData.xyz Points of Interest (POI) Geo Data - Brazil [Dataset]. https://datarade.ai/data-products/geolytica-poidata-xyz-points-of-interest-poi-geo-data-brazil-geolytica
Explore at:
.csvAvailable download formats
Dataset updated
Mar 16, 2021
Dataset authored and provided by
Geolytica
Area covered
Brazil
Description
https://store.poidata.xyz/br

Point-of-interest (POI) is defined as a physical entity (such as a business) in a geo location (point) which may be (of interest).

We strive to provide the most accurate, complete and up to date point of interest datasets for all countries of the world. The Brazil POI Dataset is one of our worldwide POI datasets with over 90% coverage.

This is our process flow:

Our machine learning systems continuously crawl for new POI data Our geoparsing and geocoding calculates their geo locations Our categorization systems cleanup and standardize the datasets Our data pipeline API publishes the datasets on our data store

POI Data is in a constant flux - especially so during times of drastic change such as the Covid-19 pandemic.

Every minute worldwide on an average day over 200 businesses will move, over 600 new businesses will open their doors and over 400 businesses will cease to exist.

In today's interconnected world, of the approximately 200 million POIs worldwide, over 94% have a public online presence. As a new POI comes into existence its information will appear very quickly in location based social networks (LBSNs), other social media, pictures, websites, blogs, press releases. Soon after that, our state-of-the-art POI Information retrieval system will pick it up.

We offer our customers perpetual data licenses for any dataset representing this ever changing information, downloaded at any given point in time. This makes our company's licensing model unique in the current Data as a Service - DaaS Industry. Our customers don't have to delete our data after the expiration of a certain "Term", regardless of whether the data was purchased as a one time snapshot, or via a recurring payment plan on our data update pipeline.

The main differentiators between us vs the competition are our flexible licensing terms and our data freshness.

The main attribute coverage is as follows:

Poi Field Data Coverage (%) poi_name 100 brand 6 poi_tel 53 formatted_address 100 main_category 100 latitude 100 longitude 100 neighborhood 1 source_url 30 email 5 opening_hours 58

A data sample may be downloaded at https://store.poidata.xyz/datafiles/br_sample.csv and the data may be previewed on a map at https://store.poidata.xyz/br
Enterprise Survey 2009-2016, Panel Data - Lesotho
microdata.worldbank.org
datacatalog.ihsn.org
+1more
Updated May 11, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
World Bank (2017). Enterprise Survey 2009-2016, Panel Data - Lesotho [Dataset]. https://microdata.worldbank.org/index.php/catalog/2835
Explore at:
Dataset updated
May 11, 2017
Dataset authored and provided by
World Bankhttp://worldbank.org/
Time period covered
2008 - 2016
Area covered
Lesotho
Description
Abstract

The documented dataset covers Enterprise Survey (ES) panel data collected in Lesotho in 2009 and 2016, as part of Africa Enterprise Surveys rollout, an initiative of the World Bank. The objective of the Enterprise Survey is to obtain feedback from enterprises on the state of the private sector as well as to help in building a panel of enterprise data that will make it possible to track changes in the business environment over time, thus allowing, for example, impact assessments of reforms.

Enterprise Surveys target a sample consisting of longitudinal (panel) observations and new cross-sectional data. Panel firms are prioritized in the sample selection, comprising up to 50% of the sample in the current wave. For all panel firms, regardless of the sample, current eligibility or operating status is determined and included in panel datasets.

Lesotho ES 2009 was conducted from September 2008 to February 2009, Lesotho ES 2016 was carried out in June - August 2016. Stratified random sampling was used to select the surveyed businesses. Data was collected using face-to-face interviews.

Data from 301 establishments was analyzed: 90 businesses were from 2009 only, 89 - from 2016 only, and 122 firms were from 2009 and 2016.

The standard Enterprise Survey topics include firm characteristics, gender participation, access to finance, annual sales, costs of inputs and labor, workforce composition, bribery, licensing, infrastructure, trade, crime, competition, capacity utilization, land and permits, taxation, informality, business-government relations, innovation and technology, and performance measures. Over 90 percent of the questions objectively measure characteristics of a country’s business environment. The remaining questions assess the survey respondents’ opinions on what are the obstacles to firm growth and performance.

Geographic coverage

National

Analysis unit

The primary sampling unit of the study is an establishment. An establishment is a physical location where business is carried out and where industrial operations take place or services are provided. A firm may be composed of one or more establishments. For example, a brewery may have several bottling plants and several establishments for distribution. For the purposes of this survey an establishment must make its own financial decisions and have its own financial statements separate from those of the firm. An establishment must also have its own management and control over its payroll.

Universe

The whole population, or the universe, covered in the Enterprise Surveys is the non-agricultural private economy. It comprises: all manufacturing sectors according to the ISIC Revision 3.1 group classification (group D), construction sector (group F), services sector (groups G and H), and transport, storage, and communications sector (group I). Note that this population definition excludes the following sectors: financial intermediation (group J), real estate and renting activities (group K, except sub-sector 72, IT, which was added to the population under study), and all public or utilities sectors. Companies with 100% government ownership are not eligible to participate in the Enterprise Surveys.

Kind of data

Sample survey data [ssd]

Sampling procedure

Two levels of stratification were used in this country: industry and establishment size.

Industry stratification was designed as follows: the universe was stratified as into manufacturing and services industries - Manufacturing (ISIC Rev. 3.1 codes 15 - 37), and Services (ISIC codes 45, 50-52, 55, 60-64, and 72).

For the Lesotho ES, size stratification was defined as follows: small (5 to 19 employees), medium (20 to 99 employees), and large (100 or more employees). Regional stratification did not take place for the Lesotho ES.

In 2009, it was not possible to obtain a single usable frame for Lesotho. Instead frames were obtained from two government branches: the Chamber of Commerce and the Ministry of Trade, Industry, Cooperatives and Marketing. Those frames were merged and duplicates removed to provide the frame used for the survey.

In 2016 ES, the sample frame consisted of listings of firms from two sources: for panel firms the list of 151 firms from the Lesotho 2009 ES was used and for fresh firms (i.e., firms not covered in 2009) firm data from Lesotho Bureau of Statistics Business Register, published in August 2015, was used.

Mode of data collection

Face-to-face [f2f]

Research instrument

The following survey instruments were used for Lesotho ES: - Manufacturing Module Questionnaire - Services Module Questionnaire

The survey is fielded via manufacturing or services questionnaires in order not to ask questions that are irrelevant to specific types of firms, e.g. a question that relates to production and nonproduction workers should not be asked of a retail firm. In addition to questions that are asked across countries, all surveys are customized and contain country-specific questions. An example of customization would be including tourism-related questions that are asked in certain countries when tourism is an existing or potential sector of economic growth. There is a skip pattern in the Service Module Questionnaire for questions that apply only to retail firms.

Cleaning operations

Data entry and quality controls are implemented by the contractor and data is delivered to the World Bank in batches (typically 10%, 50% and 100%). These data deliveries are checked for logical consistency, out of range values, skip patterns, and duplicate entries. Problems are flagged by the World Bank and corrected by the implementing contractor through data checks, callbacks, and revisiting establishments.

Response rate

Survey non-response must be differentiated from item non-response. The former refers to refusals to participate in the survey altogether whereas the latter refers to the refusals to answer some specific questions. Enterprise Surveys suffer from both problems and different strategies were used to address these issues.

Item non-response was addressed by two strategies: a- For sensitive questions that may generate negative reactions from the respondent, such as corruption or tax evasion, enumerators were instructed to collect "Refusal to respond" (-8) as a different option from "Don't know" (-9). b- Establishments with incomplete information were re-contacted in order to complete this information, whenever necessary.

Survey non-response was addressed by maximizing efforts to contact establishments that were initially selected for interview. Attempts were made to contact the establishment for interview at different times/days of the week before a replacement establishment (with similar strata characteristics) was suggested for interview. Survey non-response did occur but substitutions were made in order to potentially achieve strata-specific goals.
N
Income Bracket Analysis by Age Group Dataset: Age-Wise Distribution of White...
neilsberg.com
csv, json
Updated Feb 25, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2025). Income Bracket Analysis by Age Group Dataset: Age-Wise Distribution of White Earth Township, Minnesota Household Incomes Across 16 Income Brackets // 2025 Edition [Dataset]. https://www.neilsberg.com/research/datasets/f377f170-f353-11ef-8577-3860777c1fe6/
Explore at:
csv, jsonAvailable download formats
Dataset updated
Feb 25, 2025
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
White Earth Township, Minnesota
Variables measured
Number of households with income $200,000 or more, Number of households with income less than $10,000, Number of households with income between $15,000 - $19,999, Number of households with income between $20,000 - $24,999, Number of households with income between $25,000 - $29,999, Number of households with income between $30,000 - $34,999, Number of households with income between $35,000 - $39,999, Number of households with income between $40,000 - $44,999, Number of households with income between $45,000 - $49,999, Number of households with income between $50,000 - $59,999, and 6 more
Measurement technique
The data presented in this dataset is derived from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates. It delineates income distributions across 16 income brackets (mentioned above) following an initial analysis and categorization. Using this dataset, you can find out the total number of households within a specific income bracket along with how many households with that income bracket for each of the 4 age cohorts (Under 25 years, 25-44 years, 45-64 years and 65 years and over). For additional information about these estimations, please contact us via email at research@neilsberg.com
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset presents the the household distribution across 16 income brackets among four distinct age groups in White Earth township: Under 25 years, 25-44 years, 45-64 years, and over 65 years. The dataset highlights the variation in household income, offering valuable insights into economic trends and disparities within different age categories, aiding in data analysis and decision-making..

Key observations

Upon closer examination of the distribution of households among age brackets, it reveals that there are 14(4.24%) households where the householder is under 25 years old, 147(44.55%) households with a householder aged between 25 and 44 years, 90(27.27%) households with a householder aged between 45 and 64 years, and 79(23.94%) households where the householder is over 65 years old.

The age group of 45 to 64 years exhibits the highest median household income, while the largest number of households falls within the 25 to 44 years bracket. This distribution hints at economic disparities within the township of White Earth township, showcasing varying income levels among different age demographics.

Content

When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

Income brackets:

Less than $10,000

$10,000 to $14,999

$15,000 to $19,999

$20,000 to $24,999

$25,000 to $29,999

$30,000 to $34,999

$35,000 to $39,999

$40,000 to $44,999

$45,000 to $49,999

$50,000 to $59,999

$60,000 to $74,999

$75,000 to $99,999

$100,000 to $124,999

$125,000 to $149,999

$150,000 to $199,999

$200,000 or more

Variables / Data Columns

Household Income: This column showcases 16 income brackets ranging from Under $10,000 to $200,000+ ( As mentioned above).

Under 25 years: The count of households led by a head of household under 25 years old with income within a specified income bracket.

25 to 44 years: The count of households led by a head of household 25 to 44 years old with income within a specified income bracket.

45 to 64 years: The count of households led by a head of household 45 to 64 years old with income within a specified income bracket.

65 years and over: The count of households led by a head of household 65 years and over old with income within a specified income bracket.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Recommended for further research

This dataset is a part of the main dataset for White Earth township median household income by age. You can refer the same here
d
Geolytica POIData.xyz Points of Interest (POI) Geo Data - Chile
datarade.ai
.csv
Updated Mar 16, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Geolytica (2021). Geolytica POIData.xyz Points of Interest (POI) Geo Data - Chile [Dataset]. https://datarade.ai/data-products/geolytica-poidata-xyz-points-of-interest-poi-geo-data-chile-geolytica
Explore at:
.csvAvailable download formats
Dataset updated
Mar 16, 2021
Dataset authored and provided by
Geolytica
Area covered
Chile
Description
https://store.poidata.xyz/cl

Point-of-interest (POI) is defined as a physical entity (such as a business) in a geo location (point) which may be (of interest).

We strive to provide the most accurate, complete and up to date point of interest datasets for all countries of the world. The Chile POI Dataset is one of our worldwide POI datasets with over 90% coverage.

This is our process flow:

Our machine learning systems continuously crawl for new POI data Our geoparsing and geocoding calculates their geo locations Our categorization systems cleanup and standardize the datasets Our data pipeline API publishes the datasets on our data store

POI Data is in a constant flux - especially so during times of drastic change such as the Covid-19 pandemic.

Every minute worldwide on an average day over 200 businesses will move, over 600 new businesses will open their doors and over 400 businesses will cease to exist.

In today's interconnected world, of the approximately 200 million POIs worldwide, over 94% have a public online presence. As a new POI comes into existence its information will appear very quickly in location based social networks (LBSNs), other social media, pictures, websites, blogs, press releases. Soon after that, our state-of-the-art POI Information retrieval system will pick it up.

We offer our customers perpetual data licenses for any dataset representing this ever changing information, downloaded at any given point in time. This makes our company's licensing model unique in the current Data as a Service - DaaS Industry. Our customers don't have to delete our data after the expiration of a certain "Term", regardless of whether the data was purchased as a one time snapshot, or via a recurring payment plan on our data update pipeline.

The main differentiators between us vs the competition are our flexible licensing terms and our data freshness.

The main attribute coverage is as follows:

Poi Field Data Coverage (%) poi_name 100 brand 4 poi_tel 43 formatted_address 100 main_category 93 latitude 100 longitude 100 neighborhood 6 source_url 37 email 5 opening_hours 38

A data sample may be downloaded at https://store.poidata.xyz/datafiles/cl_sample.csv and the data may be previewed on a map at https://store.poidata.xyz/cl

Facebook

Twitter

Click to copy link

Link copied

Cite

Statista (2025). World population by age and region 2024 [Dataset]. https://www.statista.com/statistics/265759/world-population-by-age-and-region/

World population by age and region 2024

Explore at:

71 scholarly articles cite this dataset (View in Google Scholar)

Dataset updated

Mar 11, 2025

Dataset authored and provided by

Statistahttp://statista.com/

Area covered

World

Description

Globally, about 25 percent of the population is under 15 years of age and 10 percent is over 65 years of age. Africa has the youngest population worldwide. In Sub-Saharan Africa, more than 40 percent of the population is below 15 years, and only three percent are above 65, indicating the low life expectancy in several of the countries. In Europe, on the other hand, a higher share of the population is above 65 years than the population under 15 years. Fertility rates The high share of children and youth in Africa is connected to the high fertility rates on the continent. For instance, South Sudan and Niger have the highest population growth rates globally. However, about 50 percent of the world’s population live in countries with low fertility, where women have less than 2.1 children. Some countries in Europe, like Latvia and Lithuania, have experienced a population decline of one percent, and in the Cook Islands, it is even above two percent. In Europe, the majority of the population was previously working-aged adults with few dependents, but this trend is expected to reverse soon, and it is predicted that by 2050, the older population will outnumber the young in many developed countries. Growing global population As of 2025, there are 8.1 billion people living on the planet, and this is expected to reach more than nine billion before 2040. Moreover, the global population is expected to reach 10 billions around 2060, before slowing and then even falling slightly by 2100. As the population growth rates indicate, a significant share of the population increase will happen in Africa.

Clear search

Close search

Google apps

Main menu

World population by age and region 2024

Amount of data created, consumed, and stored 2010-2023, with forecasts to...

Indicator 17.19.2: Proportion of countries with birth registration data that...

Enterprise Survey 2009-2014, Panel Data - Malawi

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Sampling procedure

Mode of data collection

Research instrument

Cleaning operations

Response rate

Blue Earth County, MN Population Pyramid Dataset: Age Groups, Male and...

About this dataset

Content

Inspiration

Recommended for further research

World - Regulatory Indicators for Sustainable Energy - Dataset -...

Enterprise Survey 2009-2017, Panel Data - Liberia

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Sampling procedure

Mode of data collection

Research instrument

Cleaning operations

Response rate

Penn World Table 10.01

Enterprise Survey 2010 - Mexico

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Sampling procedure

Mode of data collection

Research instrument

Cleaning operations

Response rate

Coronavirus (Covid-19) Data in the United States

Global coastal flood hazard - Dataset - ENERGYDATA.INFO

Global Land Cover 1992-2020

Data from: Global subnational Gini coefficient (income inequality) and gross...

Canada Population: 100 Years & Over

Data from: GeoCoV19: A Dataset of Hundreds of Millions of Multilingual...

Data from: OSDG Community Dataset (OSDG-CD)

Geolytica POIData.xyz Points of Interest (POI) Geo Data - Brazil

Enterprise Survey 2009-2016, Panel Data - Lesotho

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Sampling procedure

Mode of data collection

Research instrument

Cleaning operations

Response rate

Income Bracket Analysis by Age Group Dataset: Age-Wise Distribution of White...

About this dataset

Content

Inspiration

Recommended for further research

Geolytica POIData.xyz Points of Interest (POI) Geo Data - Chile

World population by age and region 2024