Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Population, female (% of total population) in World was reported at 49.72 % in 2024, according to the World Bank collection of development indicators, compiled from officially recognized sources. World - Population, female (% of total) - actual values, historical data, forecasts and projections were sourced from the World Bank on October of 2025.
Facebook
TwitterAccording to a WIN World Survey, 72% respondents over the world say that gender equality at home has definitely or to some extent been achieved in their country. This is a series of polls being released in honor of International Women’s Day, celebrated on the 8th of March every year. A sample of 29,368 men and women from 40 countries across the globe was asked, “Would you say gender equality has been definitely/to some extent/not really/not at all achieved in your country at home?” 72% of respondents in participating countries say that gender equality at home has definitely or to some extent been achieved in their country, while 24% say that it has not really, or not at all been achieved. 4% did not know or did not respond. Globally, the net index for gender equality at home is 48%. Results for Pakistan similar to rest of the world: Respondents from Pakistan had similar views, with 75% saying gender equality is definitely or to some achieved, while 24% disagreed. Net index (% Definitely achieved + To some extent achieved) – (% Not really achieved + Not at all achieved) for Pakistan is 51%. Global gender breakdown: Analysis on the basis of gender shows that 75% males, and 70% females were of the opinion that gender equality at home has been achieved. Country wise Analysis: Philippines ranks the highest Of the 40 countries surveyed, all except Japan have a positive net index for gender equality at home. Philippines ranks the highest with a net index of 81%, followed by Vietnam at 79%. In contrast, Japan has the lowest index at -9%.
Facebook
TwitterFacebook’s Survey on Gender Equality at Home generates a global snapshot of women and men’s access to resources, their time spent on unpaid care work, and their attitudes about equality. This survey covers topics about gender dynamics and norms, unpaid caregiving, and life during the COVID-19 pandemic. Aggregated data is available publicly on Humanitarian Data Exchange (HDX). De-identified microdata is also available to eligible nonprofits and universities through Facebook’s Data for Good (DFG) program. For more information, please email dataforgood@fb.com.
This survey is fielded once a year in over 200 countries and 60 languages. The data can help researchers track trends in gender equality and progress on the Sustainable Development Goals.
The survey was fielded to active Facebook users.
Sample survey data [ssd]
Respondents were sampled across seven regions: - East Asia and Pacific; Europe and Central Asia - Latin America and Caribbean - Middle East and North Africa - North America - Sub-Saharan Africa - South Asia
For the purposes of this report, responses have been aggregated up to the regional level; these regional estimates form the basis of this report and its associated products (Regional Briefs). In order to ensure respondent confidentiality, these estimates are based on responses where a sufficient number of people responded to each question and thus where confidentiality can be assured. This results in a sample of 461,748 respondents.
The sampling frame for this survey is the global database of Facebook users who were active on the platform at least once over the past 28 days, which offers a number of advantages: It allows for the design, implementation, and launch of a survey in a timely manner. Large sample sizes allow for more questions to be asked through random assignment of modules, avoiding respondent fatigue. Samples may be drawn from diverse segments of the online population. Knowledge of the overall sampling frame allowed for more rigorous probabilistic sampling techniques and non-response adjustments than is typical for online and phone surveys
Internet [int]
The survey includes a total of 75 questions, split across into the following sections: - Basic demographics and gender norms - Decision making and resource allocation across household members - Unpaid caregiving - Additional household demographics and COVID-19 impact - Optional questions for special groups (e.g. students, business owners, the employed, and the unemployed)
Questions were developed collaboratively by a team of economists and gender experts from the World Bank, UN Women, Equal Measures 2030, and Ladysmith. Some of the questions have been borrowed from other surveys that employ alternative modes of administration (e.g., face-to-face, telephone surveys, etc.); this allows for comparability and identification of potential gaps and biases inherent to Facebook and other online survey platforms. As such, the survey also generates methodological insights that are useful to researchers undertaking alternative modes of data collection during the COVID-19 era.
In order to avoid “survey fatigue,” wherein respondents begin to disengage from the survey content and responses become less reliable, each respondent was only asked to answer a subset of questions. Specifically, each respondent saw a maximum of 30 questions, comprising demographics (asked of all respondents) and a set of additional questions randomly and purposely allocated to them.
Response rates to online surveys vary widely depending on a number of factors including survey length, region, strength of the relationship with invitees, incentive mechanisms, invite copy, interest of respondents in the topic and survey design.
Any survey data is prone to several forms of error and biases that need to be considered to understand how closely the results reflect the intended population. In particular, the following components of the total survey error are noteworthy:
Sampling error is a natural characteristic of every survey based on samples and reflects the uncertainty in any survey result that is attributable to the fact that not the whole population is surveyed.
Other factors beyond sampling error that contribute to such potential differences are frame or coverage error and nonresponse error.
Survey Limitations The survey only captures respondents who: (1) have access to the Internet (2) are Facebook users (3) opt to take this survey through the Facebook platform. Knowledge of the overall demographics of the online population in each region allows for calibration such that estimates are representative at this level. However, this means the results only tell us something about the online population in each region, not the overall population. As such, the survey cannot generate global estimates or meaningful comparisons across countries and regions, given the heterogeneity in internet connectivity across countries. Estimates have only been generated for respondents who gave their gender as male or female. The survey included an “other” option but very few respondents selected it, making it impossible to generate meaningful estimates for non-binary populations. It is important to note that the survey was not designed to paint a comprehensive picture of household dynamics but rather to shed light on respondents’ reported experiences and roles within households
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Originally, the dataset come from the CDC and is a major part of the Behavioral Risk Factor Surveillance System (BRFSS), which conducts annual telephone surveys to gather data on the health status of U.S. residents. As the CDC describes: "Established in 1984 with 15 states, BRFSS now collects data in all 50 states as well as the District of Columbia and three U.S. territories. BRFSS completes more than 400,000 adult interviews each year, making it the largest continuously conducted health survey system in the world.". The most recent dataset (as of February 15, 2022) includes data from 2020. It consists of 401,958 rows and 279 columns. The vast majority of columns are questions asked to respondents about their health status, such as "Do you have serious difficulty walking or climbing stairs?" or "Have you smoked at least 100 cigarettes in your entire life? [Note: 5 packs = 100 cigarettes]".
To improve the efficiency and relevance of our analysis, we removed certain attributes from the original BRFSS dataset. Many of the 279 original attributes included administrative codes, metadata, or survey-specific variables that do not contribute meaningfully to heart disease prediction—such as respondent IDs, timestamps, state-level identifiers, and detailed lifestyle questions unrelated to cardiovascular health. By focusing on a carefully selected subset of 18 attributes directly linked to medical, behavioral, and demographic factors known to influence heart health, we streamlined the dataset. This not only reduced computational complexity but also improved model interpretability and performance by eliminating noise and irrelevant information. All predicting variables could be divided into 4 broad categories:
Demographic factors: sex, age category (14 levels), race, BMI (Body Mass Index)
Diseases: weather respondent ever had such diseases as asthma, skin cancer, diabetes, stroke or kidney disease (not including kidney stones, bladder infection or incontinence)
Unhealthy habits:
General Health:
Below is a description of the features collected for each patient:
|
S. No. |
Original Variable/Attribute |
Coded Variable/Attribute |
Interpretation |
|
1. |
CVDINFR4 |
HeartDisease |
Those who have ever had CHD or myocardial infarction |
|
2. |
_BMI5CAT |
BMI |
Body Mass Index |
|
3. |
_SMOKER3 |
Smoking |
Have you ever smoked more than 100 cigarettes in your life? (The answer is either yes or no) |
|
4. |
_RFDRHV7 |
AlcoholDrinking |
Adult men who drink more than 14 drinks per week and adult women who consume more than 7 drinks per week are considered heavy drinkers |
|
5. |
CVDSTRK3 |
Stroke |
(Ever told) (you had) a stroke? |
|
6. |
PHYSHLTH |
PhysicalHealth |
It includes physical illness and injury during the past 30 days |
|
7. |
MENTHLTH |
MentalHealth |
How many days in the last 30 days have you had poor mental health? |
|
8. |
DIFFWALK |
DiffWalking |
Are you having trouble walking or climbing stairs? |
|
9. |
SEXVAR |
Sex |
Are you male or female? |
|
10. |
_AGE_G |
AgeCategory |
Out of given fourteen age groups, which group do you fall into? |
Facebook
TwitterThe global number of Facebook users was forecast to continuously increase between 2023 and 2027 by in total 391 million users (+14.36 percent). After the fourth consecutive increasing year, the Facebook user base is estimated to reach 3.1 billion users and therefore a new peak in 2027. Notably, the number of Facebook users was continuously increasing over the past years. User figures, shown here regarding the platform Facebook, have been estimated by taking into account company filings or press material, secondary research, app downloads and traffic data. They refer to the average monthly active users over the period and count multiple accounts by persons only once.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Key information about United States Employed Persons
Facebook
TwitterWorldPop produces different types of gridded population count datasets, depending on the methods used and end application.
Please make sure you have read our Mapping Populations overview page before choosing and downloading a dataset.
A description of the modelling methods used for age and gender structures can be found in
"https://pophealthmetrics.biomedcentral.com/articles/10.1186/1478-7954-11-11" target="_blank">
Tatem et al and
Pezzulo et al. Details of the input population count datasets used can be found here, and age/gender structure proportion datasets here.
Both top-down 'unconstrained' and 'constrained' versions of the datasets are available, and the differences between the two methods are outlined
here. The datasets represent the outputs from a project focused on construction of consistent 100m resolution population count datasets for all countries of the World structured by male/female and 5-year age classes (plus a <1 year class). These efforts necessarily involved some shortcuts for consistency. The unconstrained datasets are available for each year from 2000 to 2020.
The constrained datasets are only available for 2020 at present, given the time periods represented by the building footprint and built settlement datasets used in the mapping.
Data for earlier dates is available directly from WorldPop.
WorldPop (www.worldpop.org - School of Geography and Environmental Science, University of Southampton; Department of Geography and Geosciences, University of Louisville; Departement de Geographie, Universite de Namur) and Center for International Earth Science Information Network (CIESIN), Columbia University (2018). Global High Resolution Population Denominators Project - Funded by The Bill and Melinda Gates Foundation (OPP1134076). https://dx.doi.org/10.5258/SOTON/WP00646
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
School enrollment, primary (gross), gender parity index (GPI) in Norway was reported at 1.0031 % in 2020, according to the World Bank collection of development indicators, compiled from officially recognized sources. Norway - Ratio of female to male primary enrollment - actual values, historical data, forecasts and projections were sourced from the World Bank on October of 2025.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Key information about Syria Employed Persons
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
School enrollment, secondary (gross), gender parity index (GPI) in World was reported at 0.98868 % in 2020, according to the World Bank collection of development indicators, compiled from officially recognized sources. World - Ratio of female to male secondary enrollment - actual values, historical data, forecasts and projections were sourced from the World Bank on October of 2025.
Facebook
TwitterAs of April 2024, around 16.5 percent of global active Instagram users were men between the ages of 18 and 24 years. More than half of the global Instagram population worldwide was aged 34 years or younger.
Teens and social media
As one of the biggest social networks worldwide, Instagram is especially popular with teenagers. As of fall 2020, the photo-sharing app ranked third in terms of preferred social network among teenagers in the United States, second to Snapchat and TikTok. Instagram was one of the most influential advertising channels among female Gen Z users when making purchasing decisions. Teens report feeling more confident, popular, and better about themselves when using social media, and less lonely, depressed and anxious.
Social media can have negative effects on teens, which is also much more pronounced on those with low emotional well-being. It was found that 35 percent of teenagers with low social-emotional well-being reported to have experienced cyber bullying when using social media, while in comparison only five percent of teenagers with high social-emotional well-being stated the same. As such, social media can have a big impact on already fragile states of mind.
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Long jump results, women and men, for all Olympics between 2008 and 2024: 2008, 2012, 2016, 2020, and 2024.
The dataset is ideal for those interested in sports analytics, performance trends, track and fied, athletics or long jump statistics, as it offers comprehensivelong jump data across multiple Olympic Games.
Dataset Highlights - Results from multiple Olympic Games (2008–2024) - Detailed jump-by-jump performance data for athletes - Separate records for preliminary and final rounds - Data from both men's and women's long jump events
Competition format - In the preliminary round, all athletes get three jumps. - The top athletes in the preliminary round proceed to the final round. This is typically the top 12 athletes from the preliminary round. - In the final round, all athletes get three jumps. The top eight athletes get an additional three jumps for a total of six jumps in the final round. - The winner is determined by the longest distance during the final round. Note that the preliminary round does not count.
Original source - The original source of this data is Wikipedia. - Here is an example page: Wikipedia 2008 Olympic Women's Long Jump Results
Column Descriptions - Rank: Athlete’s rank after the prelim round which consists of three jumps. Note that this is not the final ranking. - Group: Qualifying group (A or B) the athlete competed in during the preliminaries. - Name: Name of the athlete. - Country: Country the athlete represents. - Jump_1_Prelim: Distance (in meters) of the athlete’s first jump in the preliminary round. - Jump_2_Prelim: Distance of the athlete’s second jump in the preliminary round. - - Jump_3_Prelim: Distance of the athlete’s third jump in the preliminary round. - Jump_1_Final: Distance of the athlete’s first jump in the final round. - Jump_2_Final: Distance of the athlete’s second jump in the final round. - Jump_3_Final: Distance of the athlete’s third jump in the final round. - Jump_4_Final: Distance of the athlete’s fourth jump in the final round (if applicable). - Jump_5_Final: Distance of the athlete’s fifth jump in the final round (if applicable). - Jump_6_Final: Distance of the athlete’s sixth jump in the final round (if applicable). "- - Year: Year of the Olympic Games (e.g., 2024). - Gender: Gender of the athlete (Men or Women).
Usage Ideas - Analyze performance trends across multiple Olympic Games. - Compare the performance of male and female athletes in long jump. - Study jump-by-jump performance for individual athletes or countries. - Investigate correlations between jump performance in preliminary and final rounds. - Whether you are a sports enthusiast, data analyst, or machine learning practitioner, this dataset offers a rich source of information for understanding Olympic long jump performances over time
Sample Python notebook: https://www.kaggle.com/code/michaeldelamaza/find-long-jump-results-of-a-particular-athlete/edit
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Key information about Vietnam Employed Persons
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Project Tycho datasets contain case counts for reported disease conditions for countries around the world. The Project Tycho data curation team extracts these case counts from various reputable sources, typically from national or international health authorities, such as the US Centers for Disease Control or the World Health Organization. These original data sources include both open- and restricted-access sources. For restricted-access sources, the Project Tycho team has obtained permission for redistribution from data contributors. All datasets contain case count data that are identical to counts published in the original source and no counts have been modified in any way by the Project Tycho team, except for aggregation of individual case count data into daily counts when that was the best data available for a disease and location. The Project Tycho team has pre-processed datasets by adding new variables, such as standard disease and location identifiers, that improve data interpretability. We also formatted the data into a standard data format. All geographic locations at the country and admin1 level have been represented at the same geographic level as in the data source, provided an ISO code or codes could be identified, unless the data source specifies that the location is listed at an inaccurate geographical level. For more information about decisions made by the curation team, recommended data processing steps, and the data sources used, please see the README that is included in the dataset download ZIP file.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Chad: Ratio of female to male students in tertiary level education: The latest value from 2020 is 0.39 percent, an increase from 0.29 percent in 2015. In comparison, the world average is 1.15 percent, based on data from 131 countries. Historically, the average for Chad from 1972 to 2020 is 0.14 percent. The minimum value, 0 percent, was reached in 1972 while the maximum of 0.39 percent was recorded in 2020.
Facebook
Twitterhttps://www.ine.es/aviso_legalhttps://www.ine.es/aviso_legal
Women and Men in Spain: Female part time work according to period. Percentage of all women employed. Spain, UE-27 and UE-28. Annual. National. Nota: UE27_2020: 27 países (desde 2020). UE-28: 28 países (2013-2020).
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Laos: Ratio of female to male students in secondary school: The latest value from 2021 is 0.95 percent, unchanged from 0.95 percent in 2020. In comparison, the world average is 1.02 percent, based on data from 52 countries. Historically, the average for Laos from 1971 to 2021 is 0.73 percent. The minimum value, 0.37 percent, was reached in 1971 while the maximum of 0.95 percent was recorded in 2020.
Facebook
TwitterAttribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
This starter data kit collects extracts from global, open datasets relating to climate hazards and infrastructure systems.
These extracts are derived from global datasets which have been clipped to the national scale (or subnational, in cases where national boundaries have been split, generally to separate outlying islands or non-contiguous regions), using Natural Earth (2023) boundaries, and is not meant to express an opinion about borders, territory or sovereignty.
Human-induced climate change is increasing the frequency and severity of climate and weather extremes. This is causing widespread, adverse impacts to societies, economies and infrastructures. Climate risk analysis is essential to inform policy decisions aimed at reducing risk. Yet, access to data is often a barrier, particularly in low and middle-income countries. Data are often scattered, hard to find, in formats that are difficult to use or requiring considerable technical expertise. Nevertheless, there are global, open datasets which provide some information about climate hazards, society, infrastructure and the economy. This "data starter kit" aims to kickstart the process and act as a starting point for further model development and scenario analysis.
Hazards:
Exposure:
Contextual information:
The spatial intersection of hazard and exposure datasets is a first step to analyse vulnerability and risk to infrastructure and people.
To learn more about related concepts, there is a free short course available through the Open University on Infrastructure and Climate Resilience. This overview of the course has more details.
These Python libraries may be a useful place to start analysis of the data in the packages produced by this workflow:
snkit helps clean network data
nismod-snail is designed to help implement infrastructure
exposure, damage and risk calculations
The open-gira
repository contains a larger workflow for global-scale open-data infrastructure risk and resilience analysis.
For a more developed example, some of these datasets were key inputs to a regional climate risk assessment of current and future flooding risks to transport networks in East Africa, which has a related online visualisation tool at https://east-africa.infrastructureresilience.org/ and is described in detail in Hickford et al (2023).
References
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
School age population, secondary education, male (number) in Iraq was reported at 2625435 Persons in 2020, according to the World Bank collection of development indicators, compiled from officially recognized sources. Iraq - Population of the official age for secondary education, male - actual values, historical data, forecasts and projections were sourced from the World Bank on September of 2025.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Armed forces personnel, total in World was reported at 27406000 in 2020, according to the World Bank collection of development indicators, compiled from officially recognized sources. World - Armed forces personnel, total - actual values, historical data, forecasts and projections were sourced from the World Bank on September of 2025.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Population, female (% of total population) in World was reported at 49.72 % in 2024, according to the World Bank collection of development indicators, compiled from officially recognized sources. World - Population, female (% of total) - actual values, historical data, forecasts and projections were sourced from the World Bank on October of 2025.