Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Have you ever wondered how the population landscape of our planet looks in 2025? This dataset brings together the latest population statistics for 233 countries and territories, carefully collected from Worldometers.info — one of the most trusted global data sources.
📊 It reveals how countries are growing, shrinking, and evolving demographically. From population density to fertility rate, from migration trends to urbanization, every number tells a story about humanity’s future.
🌆 You can explore which nations are rapidly expanding, which are aging, and how urban populations are transforming global living patterns. This dataset includes key metrics like yearly population change, net migration, land area, fertility rate, and each country’s share of the world population.
🧠 Ideal for data analysis, visualization, and machine learning, it can be used to study global trends, forecast population growth, or build engaging dashboards in Python, R, or Tableau. It’s also perfect for students and researchers exploring geography, demographics, or development studies.
📈 Whether you’re analyzing Asia’s population boom, Europe’s aging curve, or Africa’s youthful surge — this dataset gives you a complete view of the world’s demographic balance in 2025. 🌎 With 233 rows and 12 insightful columns, it’s ready for your next EDA, visualization, or predictive modeling project.
🚀 Dive in, explore the data, and uncover what the world looks like — one country at a time.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Actual value and historical data chart for World Population Female Percent Of Total
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The total population in the United States was estimated at 341.2 million people in 2024, according to the latest census figures and projections from Trading Economics. This dataset provides - United States Population - actual values, historical data, forecast, chart, statistics, economic calendar and news.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Actual value and historical data chart for World Population Total
Facebook
TwitterWhen evaluating the real-world treatment effect, the analysis based on randomized clinical trials (RCTs) often introduces generalizability bias due to the difference in risk factors between the trial participants and the real-world patient population. This problem of lack of generalizability associated with the RCT-only analysis can be addressed by leveraging observational studies with large sample sizes that are representative of the real-world population. A set of novel statistical methods, termed “genRCT”, for improving the generalizability of the trial has been developed using calibration weighting, which enforces the covariates balance between the RCT and observational study. This paper aims to review statistical methods for generalizing the RCT findings by harnessing information from large observational studies that represent real-world patients. Specifically, we discuss the choices of data sources and variables to meet key theoretical assumptions and principles. We introduce and compare estimation methods for continuous, binary, and survival endpoints. We showcase the use of the R package genRCT through a case study that estimates the average treatment effect of adjuvant chemotherapy for the stage 1B non-small cell lung patients represented by a large cancer registry.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Actual value and historical data chart for World Population Ages 15 64 Percent Of Total
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This Dataset provides comprehensive demographic information on global populations from 1950 to the present. It offers insights into various aspects of population dynamics, including population counts, gender ratios, birth and death rates, life expectancy, and migration patterns.
SortOrder: Numeric identifier for sorting.
LocID: Location identifier.
Notes: Additional notes or comments (blank in this dataset).
ISO3_code: ISO 3-character country code.
ISO2_code: ISO 2-character country code.
SDMX_code: Statistical Data and Metadata Exchange code.
LocTypeID: Location type identifier.
LocTypeName: Location type name.
ParentID: Identifier for the parent location.
Location: Name of the location.
VarID: Identifier for the variant.
Variant: Type of population variant.
Time: Year or time period.
TPopulation1Jan: Total population on January 1st.
TPopulation1July: Total population on July 1st.
TPopulationMale1July: Total male population on July 1st.
TPopulationFemale1July: Total female population on July 1st.
PopDensity: Population density (people per square kilometer).
PopSexRatio: Population sex ratio (male/female).
MedianAgePop: Median age of the population.
NatChange: Natural change in population.
NatChangeRT: Natural change rate (per 1,000 people).
PopChange: Population change.
PopGrowthRate: Population growth rate (percentage).
DoublingTime: Time for population to double (in years).
Births: Total number of births.
Births1519: Births to mothers aged 15-19.
CBR: Crude birth rate (per 1,000 people).
TFR: Total fertility rate (average number of children per woman).
NRR: Net reproduction rate.
MAC: Mean age at childbearing.
SRB: Sex ratio at birth (male/female).
Deaths: Total number of deaths.
DeathsMale: Total male deaths.
DeathsFemale: Total female deaths.
CDR: Crude death rate (per 1,000 people).
LEx: Life expectancy at birth.
LExMale: Life expectancy for males at birth.
LExFemale: Life expectancy for females at birth.
LE15: Life expectancy at age 15.
LE15Male: Life expectancy for males at age 15.
LE15Female: Life expectancy for females at age 15.
LE65: Life expectancy at age 65.
LE65Male: Life expectancy for males at age 65.
LE65Female: Life expectancy for females at age 65.
LE80: Life expectancy at age 80.
LE80Male: Life expectancy for males at age 80.
LE80Female: Life expectancy for females at age 80.
InfantDeaths: Number of infant deaths.
IMR: Infant mortality rate (per 1,000 live births).
LBsurvivingAge1: Children surviving to age 1.
Under5Deaths: Number of deaths under age 5.
NetMigrations: Net migration rate (per 1,000 people).
CNMR: Crude net migration rate.
Please upvote and show your support if you find this dataset valuable for your research or analysis. Your feedback and contributions help make this dataset more accessible to the Kaggle community. Thank you!
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Actual value and historical data chart for World Population Male Percent Of Total
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Actual value and historical data chart for World Population Ages 0 14 Percent Of Total
Facebook
Twitterhttp://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
Description
This Dataset contains details of World Population by country. According to the worldometer, the current population of the world is 8.2 billion people. Highest populated country is India followed by China and USA.
Attribute Information
Acknowledgements
https://www.worldometers.info/world-population/population-by-country/
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Population density (people per sq. km of land area) in World was reported at 61.6 sq. Km in 2022, according to the World Bank collection of development indicators, compiled from officially recognized sources. World - Population density (people per sq. km) - actual values, historical data, forecasts and projections were sourced from the World Bank on November of 2025.
Facebook
TwitterObjectives: Characterize patterns of weight change among subjects with obesity. Methods: A retrospective observational longitudinal study of subjects with obesity was conducted using the General Electric Centricity electronic medical record database. Subjects who were ≥18 years old with BMI ≥30 kg/m2 (first defining index BMI), had no medical conditions associated with unintentional weight loss, and had ≥4 BMI measurements/year for ≥2.5 years were included and categorized into groups (stable weight: within <5% of index BMI; modest weight loss: ≥5 to <10% of index BMI lost; moderate weight loss: ≥10 to <15% of index BMI lost; and high weight loss: ≥15% of index BMI lost) based on weight change during 6 months following index. No interventions were considered. Patterns of weight change were then assessed for 2 years. Results: A total of 177,743 subjects were included: 85.1% of subjects were in the stable weight, 9.3% in the modest, 2.3% in the moderate, and 3.3% in the high weight loss groups. The proportion of subjects who maintained or continued to lose weight decreased over the 2 year observation period; 11% of those with high weight loss continued to lose weight and 19% maintained their weight loss. This group had the lowest percentage of subjects who regained ≥50% of lost weight and the lowest proportion of subjects with weight cycling (defined as not continuously losing, gaining, or maintaining weight throughout the 2 year observation period relative to its beginning). This trend persisted in subgroups with class II–III obesity, pre-diabetes, and type 2 diabetes. Conclusion: Weight cycling and regain were commonly observed. Subjects losing the most weight during the initial period were more likely to continue losing weight.
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Summary
This metadata record provides details of the data supporting the claims of the related article: “Genomic context of NTRK1/2/3 fusion-positive tumours from a large real-world population”.
The related study aimed to interrogate a large real-world database of comprehensive genomic profiling data to describe the genomic landscape and prevalence of neurotrophic tropomyosin receptor kinase (NTRK) gene fusions.
Subject of data: Homo sapiens
Sample size: Data from 295,676 de-identified, consented-for-research cases between January 2013 and December 2019 from 75 different solid tumour types were profiled. Sample size for the clinical trials population was the efficacy-evaluable population, i.e., all patients who had received at least one dose of entrectinib and had at least 6 months of follow up.
Recruitment: This is a secondary analysis of data from the clinical trials listed below. Full methods have been published previously in: https://doi.org/10.1016/s1470-2045(19)30691-6
Trial registration number: ALKA-372-001 [EudraCT 2012-000148-88], STARTRK-1 [NCT02097810], STARTRK-2 [NCT02568267]
Data access
The data were generated and analysed under the auspices of Roche, which is a member of the Vivli Center for global clinical research data. Data access conditions are described at https://vivli.org/ourmember/roche/. To request access to individual patient-level data from the clinical trials, first locate the clinical trial in Vivli (https://search.vivli.org/ requires sign up and log in) using the trial registration number (given above), then click the ‘Request Study’ button and follow the instructions. In the event that you cannot see a specific study in the Roche list, an Enquiry Form can be submitted to confirm the availability of the specific study. To request access to related clinical study documents (eg: protocols, CSR, safety reports), please use Roche’s Clinical study documents request form: https://www.roche.com/research_and_development/who_we_are_how_we_work/research_and_clinical_trials/our_commitment_to_data_sharing/clinical_study_documents_request_form.htm.Patient-level data which were derived from the Foundation Research dataset and used in the related study cannot be shared as they contain patient genomic information that, depending on the prevalence of the identified alterations, could be used to identify individuals.
To maximise transparency and provide the most thorough information without compromising patients’ personal information, the authors have created a large number of supplementary files and made them openly available as part of this figshare data record. Data underlying Supplementary Figure 2 are in the file ‘data_underlying_supplementary_figure_2.xlsx’. Data underlying Supplementary Tables 1–4, 6–12, and 14 are in the file ‘FMI NTRK manuscript_Supplementary Tables_17May2021.xlsx’.
Corresponding author(s) for this study
Dr C. Benedikt Westphalen, Comprehensive Cancer Center Munich & Department of Medicine III, University Hospital, LMU Munich, Marchioninistr. 15, 81377, Munich, Germany. Tel: +49 (089) 4400-75250; E-mail: cwestpha@med.lmu.de
Study approval
Approval was obtained from the Western Institutional Review Board (Protocol No. 20152817). Written consent was obtained to use the de-identified patient samples for research.
Facebook
TwitterBackgroundHeart failure (HF) is frequent and its prevalence is increasing. We aimed to evaluate the epidemiologic features of HF patients, the 1-year follow-up outcomes and the independent predictors of those outcomes at a population level.Methods and resultsPopulation-based longitudinal study including all prevalent HF cases in Catalonia (Spain) on December 31st, 2012. Patients were divided in 3 groups: patients without a previous HF hospitalization, patients with a remote (>1 year) HF hospitalization and patients with a recent (<1 year) HF admission. We analyzed 1year all-cause and HF hospitalizations, and all-cause mortality. Logistic regression was used to identify the independent predictors of each of those outcomes. A total of 88,195 patients were included. Mean age was 77 years, 55% were women. Comorbidities were frequent. Fourteen percent of patients had never been hospitalized, 71% had a remote HF hospitalization and 15% a recent hospitalization. At 1-year follow-up, all-cause and HF hospitalization were 53% and 8.8%, respectively. One-year all-cause mortality rate was 14%, and was higher in patients with a recent HF hospitalization (24%). The presence of diabetes mellitus, atrial fibrillation or chronic kidney disease was independently associated with all-cause and HF hospitalization and all-cause mortality. Hospital admissions and emergency department visits the previous year were also found to be independently associated with the three study outcomes.ConclusionsOutcomes are different depending on the HF population studied. Some comorbidity, an all-cause hospitalization or emergency department visit the previous year were associated with a worse outcome.
Facebook
TwitterObjective: Case-control study designs are commonly used in retrospective analyses of Real-World Evidence (RWE). Due to the increasingly wide availability of RWE, it can be difficult to determine whether findings are robust or the result of testing multiple hypotheses.
Materials and Methods: We investigate the potential effects of modifying cohort definitions in a case-control association study between depression and Type 2 Diabetes Mellitus (T2D). We used a large (>75 million individuals) de-identified administrative claims database to observe the effects of minor changes to the requirements of glucose and hemoglobin A1c tests in the control group.
Results: We found that small permutations to the criteria used to define the control population result in significant shifts in both the demographic structure of the identified cohort as well as the odds ratio of association. These differences remain present when testing against age and sex-matched controls.
Discussion: Analyses of RWE ...
Facebook
Twitterhttps://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Clinical studies, especially randomized controlled trials, are essential for generating evidence for clinical practice. However, generalizability is a long-standing concern when applying trial results to real-world patients. Generalizability assessment is thus important, nevertheless, not consistently practiced. We performed a systematic scoping review to understand the practice of generalizability assessment. We identified 187 relevant papers and systematically organized these studies in a taxonomy with three dimensions: (1) data availability (i.e., before or after trial [a priori vs a posteriori generalizability]), (2) result outputs (i.e., score vs non-score), and (3) populations of interest. We further reported disease areas, underrepresented subgroups, and types of data used to profile target populations. We observed an increasing trend of generalizability assessments, but less than 30% of studies reported positive generalizability results. As a priori generalizability can be assessed using only study design information (primarily eligibility criteria), it gives investigators a golden opportunity to adjust the study design before the trial starts. Nevertheless, less than 40% of the studies in our review assessed a priori generalizability. With the wide adoption of electronic health records systems, rich real-world patient databases are increasingly available for generalizability assessment; however, informatics tools are lacking to support the adoption of generalizability assessment practice.
Methods We performed the literature search over the following 4 databases: MEDLINE, Cochrane, PychINFO, and CINAHL. Following the Institute of Medicine’s standards for systematic review and Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA), we conducted the scoping review in the following six steps: 1) gaining an initial understanding about clinical trial generalizability assessment, population representativeness, internal validity, and external validity, 2) identifying relevant keywords, 3) formulating four search queries to identify relevant articles in the 4 databases, 4) screening the articles by reviewing titles and abstracts, 5) reviewing articles’ full-text to further filter out irrelevant ones based on inclusion and exclusion criteria, and 6) coding the articles for data extraction.
Study selection and screening process
We used an iterative process to identify and refine the search keywords and search strategies. We identified 5,352 articles as of February 2019 from MEDLINE, CINAHL, PychINFO, and Cochrane. After removing duplicates, 3,569 records were assessed for relevancy by two researchers (ZH and XT) through reviewing the titles and abstracts against the inclusion and exclusion criteria. Conflicts were resolved with a third reviewer (JB). During the screening process, we also iteratively refined the inclusion and exclusion criteria. Out of the 3,569 articles, 3,275 were excluded through the title and abstract screening process. Subsequently, we reviewed the full texts of 294 articles, among which 106 articles were further excluded based on the exclusion criteria. The inter-rater reliability of the full-text review between the two annotators is 0.901 (i.e., Cohen’s kappa, p < .001). 187 articles were included in the final scoping review.
Data extraction and reporting
We coded and extracted data from the 187 eligible articles according to the following aspects: (1) whether the study performed an a priori generalizability assessment or a posteriori generalizability assessment or both; (2) the compared populations and the conclusions of the assessment; (3) the outputs of the results (e.g., generalizability scores, descriptive comparison); (4) whether the study focused on a specific disease. If so, we extracted the disease and disease category; (5) whether the study focused on a particular population subgroup (e.g., elderly). If so, we extracted the specific population subgroup; (6) the type(s) of the real-world patient data used to profile the target population (i.e., trial data, hospital data, regional data, national data, and international data). Note that trial data can also be regional, national, or even international, depending on the scale of the trial. Regardless, we considered them in the category of “trial data” as the study population of a trial is typically small compared to observational cohorts or real-world data. For observational cohorts or real-world data (e.g., EHRs), we extracted the specific scale of the database (i.e., regional, national, and international). For the studies that compared the characteristics of different populations to indicate generalizability issues, we further coded the populations that were compared (e.g., enrolled patients, eligible patients, general population, ineligible patients), and the types of characteristics that were compared (i.e., demographic information, clinical attributes and comorbidities, treatment outcomes, and adverse events). We then used Fisher’s exact test to assess whether there is a difference in the types of characteristics compared between a priori and a posteriori generalizability assessment studies.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Actual value and historical data chart for World Population Growth Annual Percent
Facebook
Twitterhttps://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Time series are a critical component of ecological analysis, used to track changes in biotic and abiotic variables. Information can be extracted from the properties of time series for tasks such as classification (e.g. assigning species to individual bird calls); clustering (e.g. clustering similar responses in population dynamics to abrupt changes in the environment or management interventions); prediction (e.g. accuracy of model predictions to original time series data); and anomaly detection (e.g. detecting possible catastrophic events from population time series). These common tasks in ecological research rely on the notion of (dis-) similarity, which can be determined using distance measures. A plethora of distance measures have been described, predominantly in the computer and information sciences, but many have not been introduced to ecologists. Furthermore, little is known about how to select appropriate distance measures for time-series-related tasks. Therefore, many potential applications remain unexplored. Here we describe 16 properties of distance measures that are likely to be of importance to a variety of ecological questions involving time series. We then test 42 distance measures for each property and use the results to develop an objective method to select appropriate distance measures for any task and ecological dataset. We demonstrate our selection method by applying it to a set of real-world data on breeding bird populations in the UK and discuss other potential applications for distance measures, along with associated technical issues common in ecology. Our real-world population trends exhibit a common challenge for time series comparisons: a high level of stochasticity. We demonstrate two different ways of overcoming this challenge, first by selecting distance measures with properties that make them well-suited to comparing noisy time series, and second by applying a smoothing algorithm before selecting appropriate distance measures. In both cases, the distance measures chosen through our selection method are not only fit-for-purpose but are consistent in their rankings of the population trends. The results of our study should lead to an improved understanding of, and greater scope for, the use of distance measures for comparing ecological time series, and help us answer new ecological questions. Methods Distance measure test results were produced using R and can be replicated using scripts available on GitHub at https://github.com/shawndove/Trend_compare. Detailed information on wading bird trends can be found in Jellesmark et al. (2021) below. Jellesmark, S., Ausden, M., Blackburn, T. M., Gregory, R. D., Hoffmann, M., Massimino, D., McRae, L., & Visconti, P. (2021). A counterfactual approach to measure the impact of wet grassland conservation on U.K. breeding bird populations. Conservation Biology, 35(5), 1575–1585. https://doi.org/10.1111/cobi.13692
Facebook
TwitterThe worldwide Jewish population experienced a marked decline during the 20th century due to the murder of six million Jewish people during the Holocaust, the genocide perpetrated by Nazi Germany and its allies during World War II. While there were almost 17 million Jewish people alive before the Holocaust, or the Shoah as it is known in Hebrew, after the war this was only around 11.5 million people. By using several different fertility scenarios, demographers have been able to reconstruct what the Jewish population would be in modern times if the genocide of Jewish people had not happened. In scenarios where there was a low or very low fertility rate, the Jewish population in 2000 would be in the range of 26 to 33 million people, double what it was in reality. In a scenario where the population growth rate was the same as that observed in the Jewish population after WWII, which was extremely low, the global Jewish population would have risen to over 20 million people.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Have you ever wondered how the population landscape of our planet looks in 2025? This dataset brings together the latest population statistics for 233 countries and territories, carefully collected from Worldometers.info — one of the most trusted global data sources.
📊 It reveals how countries are growing, shrinking, and evolving demographically. From population density to fertility rate, from migration trends to urbanization, every number tells a story about humanity’s future.
🌆 You can explore which nations are rapidly expanding, which are aging, and how urban populations are transforming global living patterns. This dataset includes key metrics like yearly population change, net migration, land area, fertility rate, and each country’s share of the world population.
🧠 Ideal for data analysis, visualization, and machine learning, it can be used to study global trends, forecast population growth, or build engaging dashboards in Python, R, or Tableau. It’s also perfect for students and researchers exploring geography, demographics, or development studies.
📈 Whether you’re analyzing Asia’s population boom, Europe’s aging curve, or Africa’s youthful surge — this dataset gives you a complete view of the world’s demographic balance in 2025. 🌎 With 233 rows and 12 insightful columns, it’s ready for your next EDA, visualization, or predictive modeling project.
🚀 Dive in, explore the data, and uncover what the world looks like — one country at a time.