60 datasets found
  1. o

    Synthetic Colorectal Cancer Global Dataset

    • opendatabay.com
    .undefined
    Updated Jun 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Opendatabay Labs (2025). Synthetic Colorectal Cancer Global Dataset [Dataset]. https://www.opendatabay.com/data/synthetic/ae2aba99-491d-45a1-a99e-7be14927f4af
    Explore at:
    .undefinedAvailable download formats
    Dataset updated
    Jun 28, 2025
    Dataset authored and provided by
    Opendatabay Labs
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    Patient Health Records & Digital Health
    Description

    The Synthetic Colorectal Cancer Global Dataset is a fully anonymised, high-dimensional synthetic dataset designed for global cancer research, predictive modelling, and educational use. It encompasses demographic, clinical, lifestyle, genetic, and healthcare access factors relevant to colorectal cancer incidence, outcomes, and survivability.

    Dataset Features

    • Patient_ID: Unique identifier for each patient.
    • Country: Patient's country of residence.
    • Age: Age at diagnosis (in years).
    • Gender: Biological sex of the patient (Male/Female/Other).
    • Cancer_Stage: Stage of colorectal cancer at diagnosis (e.g., Stage I–IV).
    • Tumor_Size_mm: Size of the tumor in millimeters.
    • Family_History: Presence of colorectal cancer in family history (True/False).
    • Smoking_History: Smoking behavior or history (e.g., Current, Former, Never).
    • Alcohol_Consumption: Level of alcohol consumption (e.g., High, Moderate, None).
    • Obesity_BMI: BMI classification related to obesity.
    • Diet_Risk: Diet-related cancer risk (e.g., High Fat, Low Fiber).
    • Physical_Activity: Level of physical activity (e.g., Sedentary, Active).
    • Diabetes: Diabetes diagnosis (True/False).
    • Inflammatory_Bowel_Disease: Presence of IBD (True/False).
    • Genetic_Mutation: Genetic mutations relevant to colorectal cancer (e.g., APC, KRAS).
    • Screening_History: History of cancer screenings (True/False).
    • Early_Detection: Whether cancer was detected early (True/False).
    • Treatment_Type: Primary treatment type (e.g., Surgery, Chemotherapy, Radiation).
    • Survival_5_years: 5-year survival status (True/False).
    • Mortality: Mortality outcome (Alive/Deceased).
    • Healthcare_Costs: Estimated treatment costs (in USD).
    • Incidence_Rate_per_100K: Country-level incidence rate per 100,000 people.
    • Mortality_Rate_per_100K: Country-level mortality rate per 100,000 people.
    • Urban_or_Rural: Patient's living area (Urban/Rural).
    • Economic_Classification: Country's economic level (e.g., Low, Middle, High income).
    • Healthcare_Access: Access level to healthcare services (e.g., Good, Limited).
    • Insurance_Status: Insurance coverage status (Insured/Uninsured).
    • Survival_Prediction: Model-derived survival prediction (probability or binary).

    Distribution

    https://storage.googleapis.com/opendatabay_public/ae2aba99-491d-45a1-a99e-7be14927f4af/299af3fa2502_patient_analysis_plots.png" alt="Synthetic Colorectal Cancer Global Data Distribution.png">

    Usage

    This dataset can be used for:

    • Global Cancer Research: Analyze how clinical, lifestyle, and socioeconomic factors affect colorectal cancer outcomes worldwide.
    • Predictive Modeling: Develop models to estimate survival probability or treatment outcomes.
    • Healthcare Policy Analysis: Study disparities in healthcare access and outcomes across countries.
    • Educational Use: Support training in epidemiology, oncology, public health, and machine learning.

    Coverage

    The dataset includes 100% synthetic yet clinically plausible records from diverse countries and demographic groups. It is anonymized and modeled to reflect real-world variability in risk factors, diagnosis stages, treatment, and survival without compromising patient privacy.

    License

    CC0 (Public Domain)

    Who Can Use It

    • Epidemiologists and Medical Researchers: To explore global patterns in colorectal cancer.
    • Public Health Experts and Policymakers: For assessing equity in healthcare access and cancer outcomes.
    • Data Scientists and Educators: As a rich dataset for teaching data analysis, classification, regression, and health informatics.
  2. Breast Cancer Prediction Dataset

    • kaggle.com
    Updated Sep 26, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Merishna Singh Suwal (2018). Breast Cancer Prediction Dataset [Dataset]. https://www.kaggle.com/datasets/merishnasuwal/breast-cancer-prediction-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 26, 2018
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Merishna Singh Suwal
    Description

    Worldwide, breast cancer is the most common type of cancer in women and the second highest in terms of mortality rates.Diagnosis of breast cancer is performed when an abnormal lump is found (from self-examination or x-ray) or a tiny speck of calcium is seen (on an x-ray). After a suspicious lump is found, the doctor will conduct a diagnosis to determine whether it is cancerous and, if so, whether it has spread to other parts of the body.

    This breast cancer dataset was obtained from the University of Wisconsin Hospitals, Madison from Dr. William H. Wolberg.

  3. Colorectal Cancer Global Dataset & Predictions

    • kaggle.com
    Updated Feb 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ankush Panday (2025). Colorectal Cancer Global Dataset & Predictions [Dataset]. http://doi.org/10.34740/kaggle/dsv/10873495
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 27, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Ankush Panday
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    This dataset contains real-world information about colorectal cancer cases from different countries. It includes patient demographics, lifestyle risks, medical history, cancer stage, treatment types, survival chances, and healthcare costs. The dataset follows global trends in colorectal cancer incidence, mortality, and prevention.

    Use this dataset to build models for cancer prediction, survival analysis, healthcare cost estimation, and disease risk factors.

    Dataset Structure Each row represents an individual case, and the columns include:

    Patient_ID (Unique identifier) Country (Based on incidence distribution) Age (Following colorectal cancer age trends) Gender (M/F, considering men have 30-40% higher risk) Cancer_Stage (Localized, Regional, Metastatic) Tumor_Size_mm (Randomized within medical limits) Family_History (Yes/No) Smoking_History (Yes/No) Alcohol_Consumption (Yes/No) Obesity_BMI (Normal/Overweight/Obese) Diet_Risk (Low/Moderate/High) Physical_Activity (Low/Moderate/High) Diabetes (Yes/No) Inflammatory_Bowel_Disease (Yes/No) Genetic_Mutation (Yes/No) Screening_History (Regular/Irregular/Never) Early_Detection (Yes/No) Treatment_Type (Surgery/Chemotherapy/Radiotherapy/Combination) Survival_5_years (Yes/No) Mortality (Yes/No) Healthcare_Costs (Country-dependent, $25K-$100K+) Incidence_Rate_per_100K (Country-level prevalence) Mortality_Rate_per_100K (Country-level mortality) Urban_or_Rural (Urban/Rural) Economic_Classification (Developed/Developing) Healthcare_Access (Low/Moderate/High) Insurance_Status (Insured/Uninsured) Survival_Prediction (Yes/No, based on factors)

  4. p

    Cervical Cancer Risk Classification - Dataset - CKAN

    • data.poltekkes-smg.ac.id
    Updated Oct 7, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Cervical Cancer Risk Classification - Dataset - CKAN [Dataset]. https://data.poltekkes-smg.ac.id/dataset/cervical-cancer-risk-classification
    Explore at:
    Dataset updated
    Oct 7, 2024
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Cervical Cancer Risk Factors for Biopsy: This Dataset is Obtained from UCI Repository and kindly acknowledged! This file contains a List of Risk Factors for Cervical Cancer leading to a Biopsy Examination! About 11,000 new cases of invasive cervical cancer are diagnosed each year in the U.S. However, the number of new cervical cancer cases has been declining steadily over the past decades. Although it is the most preventable type of cancer, each year cervical cancer kills about 4,000 women in the U.S. and about 300,000 women worldwide. In the United States, cervical cancer mortality rates plunged by 74% from 1955 - 1992 thanks to increased screening and early detection with the Pap test. AGE Fifty percent of cervical cancer diagnoses occur in women ages 35 - 54, and about 20% occur in women over 65 years of age. The median age of diagnosis is 48 years. About 15% of women develop cervical cancer between the ages of 20 - 30. Cervical cancer is extremely rare in women younger than age 20. However, many young women become infected with multiple types of human papilloma virus, which then can increase their risk of getting cervical cancer in the future. Young women with early abnormal changes who do not have regular examinations are at high risk for localized cancer by the time they are age 40, and for invasive cancer by age 50. SOCIOECONOMIC AND ETHNIC FACTORS Although the rate of cervical cancer has declined among both Caucasian and African-American women over the past decades, it remains much more prevalent in African-Americans -- whose death rates are twice as high as Caucasian women. Hispanic American women have more than twice the risk of invasive cervical cancer as Caucasian women, also due to a lower rate of screening. These differences, however, are almost certainly due to social and economic differences. Numerous studies report that high poverty levels are linked with low screening rates. In addition, lack of health insurance, limited transportation, and language difficulties hinder a poor woman’s access to screening services. HIGH SEXUAL ACTIVITY Human papilloma virus (HPV) is the main risk factor for cervical cancer. In adults, the most important risk factor for HPV is sexual activity with an infected person. Women most at risk for cervical cancer are those with a history of multiple sexual partners, sexual intercourse at age 17 years or younger, or both. A woman who has never been sexually active has a very low risk for developing cervical cancer. Sexual activity with multiple partners increases the likelihood of many other sexually transmitted infections (chlamydia, gonorrhea, syphilis).Studies have found an association between chlamydia and cervical cancer risk, including the possibility that chlamydia may prolong HPV infection. FAMILY HISTORY Women have a higher risk of cervical cancer if they have a first-degree relative (mother, sister) who has had cervical cancer. USE OF ORAL CONTRACEPTIVES Studies have reported a strong association between cervical cancer and long-term use of oral contraception (OC). Women who take birth control pills for more than 5 - 10 years appear to have a much higher risk HPV infection (up to four times higher) than those who do not use OCs. (Women taking OCs for fewer than 5 years do not have a significantly higher risk.) The reasons for this risk from OC use are not entirely clear. Women who use OCs may be less likely to use a diaphragm, condoms, or other methods that offer some protection against sexual transmitted diseases, including HPV. Some research also suggests that the hormones in OCs might help the virus enter the genetic material of cervical cells. HAVING MANY CHILDREN Studies indicate that having many children increases the risk for developing cervical cancer, particularly in women infected with HPV. SMOKING Smoking is associated with a higher risk for precancerous changes (dysplasia) in the cervix and for progression to invasive cervical cancer, especially for women infected with HPV. IMMUNOSUPPRESSION Women with weak immune systems, (such as those with HIV / AIDS), are more susceptible to acquiring HPV. Immunocompromised patients are also at higher risk for having cervical precancer develop rapidly into invasive cancer. DIETHYLSTILBESTROL (DES) From 1938 - 1971, diethylstilbestrol (DES), an estrogen-related drug, was widely prescribed to pregnant women to help prevent miscarriages. The daughters of these women face a higher risk for cervical cancer. DES is no longer prsecribed.

  5. f

    Table 1_Revising cancer incidence in a Central European country: a Hungarian...

    • figshare.com
    • frontiersin.figshare.com
    xlsx
    Updated Oct 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zoltán Kiss; Tamás G. Szabó; Csaba Polgár; Zsolt Horváth; Péter Nagy; Ibolya Fábián; Valéria Kovács; György Surján; Zsófia Barcza; István Kenessey; András Wéber; István Wittmann; Gergő Attila Molnár; Eszter Gyöngyösi; Angéla Benedek; Eugenia Karamousouli; Zsolt Abonyi-Tóth; Renáta Bertókné Tamás; Diána Viktória Fürtős; Krisztina Bogos; Judit Moldvay; Gabriella Gálffy; Lilla Tamási; Veronika Müller; Zoárd Tibor Krasznai; Gyula Ostoros; Zsolt Pápai-Székely; Anikó Maráz; Gabriella Branyiczkiné Géczy; Lászlóné Hilbert; Láśzló Tamás Berki; György Rokszin; Zoltán Vokó (2024). Table 1_Revising cancer incidence in a Central European country: a Hungarian nationwide study between 2011–2019 based on a health insurance fund database.xlsx [Dataset]. http://doi.org/10.3389/fonc.2024.1393132.s001
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Oct 1, 2024
    Dataset provided by
    Frontiers
    Authors
    Zoltán Kiss; Tamás G. Szabó; Csaba Polgár; Zsolt Horváth; Péter Nagy; Ibolya Fábián; Valéria Kovács; György Surján; Zsófia Barcza; István Kenessey; András Wéber; István Wittmann; Gergő Attila Molnár; Eszter Gyöngyösi; Angéla Benedek; Eugenia Karamousouli; Zsolt Abonyi-Tóth; Renáta Bertókné Tamás; Diána Viktória Fürtős; Krisztina Bogos; Judit Moldvay; Gabriella Gálffy; Lilla Tamási; Veronika Müller; Zoárd Tibor Krasznai; Gyula Ostoros; Zsolt Pápai-Székely; Anikó Maráz; Gabriella Branyiczkiné Géczy; Lászlóné Hilbert; Láśzló Tamás Berki; György Rokszin; Zoltán Vokó
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Hungary, Central Europe
    Description

    BackgroundThe nationwide HUN-CANCER EPI study examined cancer incidence and mortality rates in Hungary from 2011 to 2019.MethodsUsing data from the National Health Insurance Fund (NHIF) and Hungarian Central Statistical Office (HCSO), our retrospective study analyzed newly diagnosed malignancies between Jan 1, 2011, and Dec 31, 2019. Age-standardized incidence and mortality rates were calculated for all and for different tumor types using both the 1976 and 2013 European Standard Populations (ESP).FindingsThe number of newly diagnosed cancer cases decreased from 60,554 to 56,675 between 2011–2019. Age-standardized incidence rates were much lower in 2018, than previously estimated (475.5 vs. 580.5/100,000 person-years [PYs] in males and 383.6 vs. 438.5/100,000 PYs in females; ESP 1976). All-site cancer incidence showed a mean annual decrease of 1.9% (95% CI: 2.4%-1.4%) in men and 1.0% (95% CI:1.42%-0.66%) in women, parallel to mortality trends (-1.6% in males and -0.6% in females; ESP 2013). In 2018, the highest age-standardized incidence rates were found for lung (88.3), colorectal (82.2), and prostate cancer (62.3) in men, and breast (104.6), lung (47.7), and colorectal cancer (45.8) in women. The most significant decreases in incidence rates were observed for stomach (4.7%), laryngeal (4.4%), and gallbladder cancers (3.5%), with parallel decreases in mortality rates (3.9%, 2.7% and 3.2%, respectively).InterpretationWe found a lower incidence of newly diagnosed cancer cases for Hungary compared to previous estimates, and decreasing trends in cancer incidence and mortality, in line with global findings and the declining prevalence of smoking.

  6. p

    Breast Cancer Prediction Dataset - Dataset - CKAN

    • data.poltekkes-smg.ac.id
    Updated Oct 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Breast Cancer Prediction Dataset - Dataset - CKAN [Dataset]. https://data.poltekkes-smg.ac.id/dataset/breast-cancer-prediction-dataset
    Explore at:
    Dataset updated
    Oct 7, 2024
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Worldwide, breast cancer is the most common type of cancer in women and the second highest in terms of mortality rates.Diagnosis of breast cancer is performed when an abnormal lump is found (from self-examination or x-ray) or a tiny speck of calcium is seen (on an x-ray). After a suspicious lump is found, the doctor will conduct a diagnosis to determine whether it is cancerous and, if so, whether it has spread to other parts of the body. This breast cancer dataset was obtained from the University of Wisconsin Hospitals, Madison from Dr. William H. Wolberg.

  7. Oral Cancer Prediction Dataset

    • kaggle.com
    Updated Mar 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ankush Panday (2025). Oral Cancer Prediction Dataset [Dataset]. http://doi.org/10.34740/kaggle/dsv/10942559
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 6, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Ankush Panday
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    This dataset provides a detailed and structured overview of oral cancer cases worldwide. It includes key risk factors, symptoms, cancer staging, survival rates, treatment approaches, and economic burden to facilitate research and prediction modeling. The dataset is based on real-world oral cancer statistics, aligning with global health reports and studies.

    Key Highlights: Covers high-incidence regions (India, Pakistan, Sri Lanka, Taiwan) and emerging trends in Western nations. Includes tobacco, alcohol, HPV infection, betel quid use, and dietary factors as primary risk factors. Captures economic burden (treatment costs, workdays lost) to assess the financial impact of oral cancer. Provides cancer staging, survival rates, and early diagnosis indicators for better treatment predictions. This dataset is valuable for medical professionals, researchers, data scientists, and policymakers aiming to develop early detection models, assess regional disparities, and improve cancer prevention strategies.

    Columns Overview ID – Unique identifier Country – Country name Age – Age of the individual Gender – Male/Female Tobacco Use – Yes/No Alcohol Consumption – Yes/No HPV Infection – Yes/No Betel Quid Use – Yes/No Chronic Sun Exposure – Yes/No Poor Oral Hygiene – Yes/No Diet (Fruits & Vegetables Intake) – Low/Moderate/High Family History of Cancer – Yes/No Compromised Immune System – Yes/No Oral Lesions – Yes/No Unexplained Bleeding – Yes/No Difficulty Swallowing – Yes/No White or Red Patches in Mouth – Yes/No Tumor Size (cm) – Numerical value Cancer Stage – 0 (No Cancer), 1, 2, 3, 4 Treatment Type – Surgery/Radiation/Chemotherapy/Targeted Therapy/No Treatment Survival Rate (5-Year, %) Cost of Treatment (USD) Economic Burden (Lost Workdays per Year) Early Diagnosis (Yes/No) Oral Cancer (Diagnosis) – Yes/No (Target Variable)

  8. Cancer types causing Death

    • kaggle.com
    Updated Apr 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shuvo Kumar Basak-4004.o (2025). Cancer types causing Death [Dataset]. http://doi.org/10.34740/kaggle/dsv/11587862
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 27, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Shuvo Kumar Basak-4004.o
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Source: https://ourworldindata.org/cancer

    The dataset titled "Cancer Types Causing Death," sourced from Our World in Data, provides a comprehensive overview of global cancer mortality trends. According to the dataset, lung cancer leads as the most fatal cancer worldwide, with approximately 1.8 million deaths in 2022, accounting for 18.7% of all cancer-related fatalities . Following lung cancer, colorectal cancer ranks second, causing about 900,000 deaths (9.3%), while liver cancer and breast cancer account for 760,000 (7.8%) and 670,000 (6.9%) deaths, respectively. Stomach cancer also remains a significant cause of death, with 660,000 fatalities (6.8%) .

    The dataset highlights that lung cancer's prevalence is closely linked to tobacco use, particularly in regions like Asia. In contrast, breast cancer predominantly affects women, while colorectal cancer impacts both genders equally. Notably, the dataset indicates a decline in age-standardized death rates for certain cancers, such as stomach cancer, due to improved hygiene, sanitation, and antibiotic treatments targeting Helicobacter pylori infections . Our World in Data

    Additionally, the dataset underscores the global disparity in cancer mortality, with approximately 70% of cancer deaths occurring in low- and middle-income countries . This disparity is attributed to factors like limited access to early detection, treatment, and preventive measures. The dataset serves as a valuable resource for understanding the global burden of cancer and the need for targeted public health interventions. World Health Organization

  9. D

    Lung Cancer Diagnostic Tests Market Report | Global Forecast From 2025 To...

    • dataintelo.com
    csv, pdf, pptx
    Updated Jan 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2025). Lung Cancer Diagnostic Tests Market Report | Global Forecast From 2025 To 2033 [Dataset]. https://dataintelo.com/report/global-lung-cancer-diagnostic-tests-market
    Explore at:
    pdf, pptx, csvAvailable download formats
    Dataset updated
    Jan 7, 2025
    Dataset authored and provided by
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Lung Cancer Diagnostic Tests Market Outlook



    The lung cancer diagnostic tests market size was valued at USD 2.5 billion in 2023 and is projected to reach USD 6.1 billion by 2032, growing at a Compound Annual Growth Rate (CAGR) of 10.5% during the forecast period. This substantial growth can be attributed to the rising prevalence of lung cancer globally, advancements in diagnostic technologies, and increasing awareness regarding early detection and treatment of lung cancer. The growing aging population and the high incidence of smoking, which is a leading cause of lung cancer, further propel the demand for diagnostic tests.



    The increasing prevalence of lung cancer is one of the primary drivers of market growth. Lung cancer remains the leading cause of cancer-related deaths worldwide, necessitating the development of more accurate and early diagnostic methods. With advancements in medical technology, such as molecular diagnostics and non-invasive imaging techniques, the accuracy and efficiency of lung cancer diagnosis have significantly improved. These innovations not only enhance the detection rate but also facilitate personalized treatment plans, thereby improving patient outcomes.



    Furthermore, government initiatives and funding for cancer research play a crucial role in market expansion. Many countries are investing heavily in cancer research, leading to the development of new diagnostic tools and techniques. For instance, organizations such as the National Cancer Institute (NCI) in the United States provide substantial grants for lung cancer research, fostering innovations in diagnostics. In addition, public awareness campaigns and screening programs conducted by healthcare organizations and governments encourage early diagnosis, which is vital for successful treatment and survival rates.



    The integration of artificial intelligence (AI) and machine learning in diagnostic tools is another significant factor contributing to market growth. AI algorithms can analyze medical images with high precision, aiding radiologists in identifying lung cancer at earlier stages. Moreover, AI-driven software can evaluate large datasets from genetic and molecular tests, providing insights into the most effective treatment options based on individual patient profiles. This technological advancement not only enhances the accuracy of diagnostics but also reduces the time required for analysis, thereby increasing the efficiency of healthcare services.



    The EGFR Mutation Test is a pivotal advancement in the realm of lung cancer diagnostics, offering a more personalized approach to treatment. This test specifically identifies mutations in the Epidermal Growth Factor Receptor (EGFR) gene, which are often present in non-small cell lung cancer (NSCLC) patients. By detecting these mutations, healthcare providers can tailor therapies that target the specific genetic alterations, thereby improving treatment efficacy and patient outcomes. The growing adoption of EGFR Mutation Tests underscores the shift towards precision medicine, where treatments are increasingly customized based on individual genetic profiles. This approach not only enhances the effectiveness of therapies but also minimizes adverse effects, as treatments are more accurately aligned with the patient's unique genetic makeup.



    Regionally, North America holds the largest share of the lung cancer diagnostic tests market, followed by Europe and Asia Pacific. The dominance of North America can be attributed to the presence of advanced healthcare infrastructure, high healthcare expenditure, and a robust research landscape. The Asia Pacific region, however, is expected to witness the highest growth rate during the forecast period, driven by increasing healthcare investments, growing awareness about lung cancer, and rising incidences of the disease in countries like China and India. The growing middle-class population and improving healthcare access in these countries further support market growth.



    Test Type Analysis



    The lung cancer diagnostic tests market is segmented by test type into imaging tests, sputum cytology, tissue biopsy, molecular tests, and others. Imaging tests are one of the most commonly used diagnostic methods for lung cancer detection. Techniques such as X-rays, CT scans, and PET scans provide detailed visuals of the lungs, helping in identifying abnormal growths or tumors. The non-invasive nature of these tests and their ability to provide quick results make them a preferred choice among healthcare

  10. H

    Data from: Cancer Mondial

    • data.niaid.nih.gov
    Updated Jul 13, 2011
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2011). Cancer Mondial [Dataset]. http://doi.org/10.7910/DVN/W4YJIK
    Explore at:
    Dataset updated
    Jul 13, 2011
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Users can access data about cancer statistics, specifically incidence and mortality worldwide for the 27 major types of cancer. Background Cancer Mondial is maintained by the Section of Cancer Information (CIN) of International Agency for Research on Cancer by the World Health Organization. Users can access CIN databases including GLOBOCAN, CI5(Cancer Incidence in Five Continents), WHO, ACCIS(Automated Childhood Cancer Information System), ECO (European Cancer Observatory), NORDCAN and Survcan. User functionality Users can access a variety of databases. CIN Databases: GLOBOCAN provides acces s to the most recent estimates (for 2008) of the incidence of 27 major cancers and mortality from 27 major cancers worldwide. CI5 (Cancer Incidence in Five Continents) provides access to detailed information on the incidence of cancer recorded by cancer registries (regional or national) worldwide. WHO presents long time series of selected cancer mortality recorded in selected countries of the world. Collaborative projects: ACCIS (Automated Childhood Cancer Information System) provides access to data on cancer incidence and survival of children collected by European cancer registries. ECO (European Cancer Observatory) provides access to the estimates (for 2008) of the incidence of, and mortality f rom 25 major cancers in the countries of the European Union (EU-27). NORDCAN presents up-to-date long time series of cancer incidence, mortality, prevalence and survival from 40 cancers recorded by the Nordic countries. SurvCan presents cancer survival data from cancer registries in low and middle income regions of the world. Data Notes Data is available in different formats depending on which type of data is accessed. Some data is available in table, PDF, and html formats. Detailed information about the data is available.

  11. f

    Data from: Critical review of cancer mortality using hospital records and...

    • scielo.figshare.com
    jpeg
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Carolina Panis; Aedra Carla Bufalo Kawasaki; Claudicéia Risso Pascotto; Eglea Yamamoto Della Justina; Geraldo Emílio Vicentini; Léia Carolina Lucio; Rosebel Trindade Cunha Prates (2023). Critical review of cancer mortality using hospital records and potential years of life lost [Dataset]. http://doi.org/10.6084/m9.figshare.6179639.v1
    Explore at:
    jpegAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    SciELO journals
    Authors
    Carolina Panis; Aedra Carla Bufalo Kawasaki; Claudicéia Risso Pascotto; Eglea Yamamoto Della Justina; Geraldo Emílio Vicentini; Léia Carolina Lucio; Rosebel Trindade Cunha Prates
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    ABSTRACT Objective To determine and discuss cancer mortality rates in southern Brazil between 1988 and 2012. Methods This was a critical review of literature based on analysis of data concerning incidence and mortality of prostate cancer, breast cancer, bronchial and lung cancer, and uterine and ovarian cancer. Data were collected from the online database of the Brazil Instituto Nacional de Câncer José Alencar Gomes da Silva. Results The southern Brazil is the leading region of cancer incidence and mortality. Data on the cancer profile of this population are scarce especially in the States of Santa Catarina and Paraná. We observed inconsistency between data from hospital registers and death recorded. Conclusion Both cancer incidence and the mortality are high in Brazil. In addition, Brazil has great numbers of registers and deaths for cancer compared to worldwide rates. Regional risk factors might explain the high cancer rates.

  12. Lung Cancer Risk in 25 Countries

    • kaggle.com
    Updated Feb 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aiza Zeeshan (2025). Lung Cancer Risk in 25 Countries [Dataset]. https://www.kaggle.com/datasets/aizahzeeshan/lung-cancer-risk-in-25-countries/versions/1
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 8, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Aiza Zeeshan
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    This dataset contains information on lung cancer risk factors across various countries, focusing on demographic details, smoking behaviors, and family history. Researchers and public health professionals can use this data to study patterns of lung cancer incidence, identify trends related to smoking and passive smoking exposure, and assess the impact of family history on lung cancer risk.

    What You Can Perform with This Data:

    Risk Factor Analysis: Analyze how smoking habits, exposure to secondhand smoke, and family history correlate with lung cancer risk. Comparative Study: Compare lung cancer risk factors across different countries and regions. Demographic Insights: Explore how age and gender impact the prevalence of lung cancer risk factors. Statistical Modeling: Build models to predict lung cancer risk based on various factors such as smoking history, exposure to passive smoke, and genetic predisposition. Public Health Research: Identify populations with high-risk behaviors and suggest interventions or preventive measures.

  13. Breast Cancer India Statewise 2016-2021

    • kaggle.com
    Updated Apr 26, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NITISH SINGHAL (2022). Breast Cancer India Statewise 2016-2021 [Dataset]. https://www.kaggle.com/datasets/nitishsinghal/breast-cancer-india-statewise-20162021
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 26, 2022
    Dataset provided by
    Kaggle
    Authors
    NITISH SINGHAL
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Area covered
    India
    Description

    Breast cancer is the most frequently diagnosed cancer and the most frequent cause for cancer-related deaths in women worldwide. Globally, breast cancer accounted for 2.08 million out of 18.08 million new cancer cases (incidence rate of 11.6%) and 626,679 out of 9.55 million cancer-related deaths (6.6% of all cancer-related deaths) in 2018. 1,2 In India, breast cancer has surpassed cancers of the cervix and the oral cavity to be the most common cancer and the leading cause of cancer deaths. In 2018, 159,500 new cases of breast cancer were diagnosed, representing 27.7% of all new cancers among Indian women and 11.1% of all cancer deaths.

    In india breast cancer cases reporting and diagnotics have increased 10 times in past 3 years . All thanks to the various cancer awareness initiatives by both private and govt. organisations.

  14. S

    A biomarker-based database system for early diagnosis of nasopharyngeal...

    • scidb.cn
    Updated Feb 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chen Peng; Ma Xin (2025). A biomarker-based database system for early diagnosis of nasopharyngeal carcinoma (NPC-BM) [Dataset]. http://doi.org/10.57760/sciencedb.21419
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 27, 2025
    Dataset provided by
    Science Data Bank
    Authors
    Chen Peng; Ma Xin
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    According to the World Cancer Report 2020 published by the World Health Organization's Institute for Research on Cancer (IARC), there will be 19.29 million new cancer cases and 9.96 million deaths globally in 2020, of which 4.569 million new cases and 3.003 million deaths will occur in China, accounting for 23.7% and 30.2% of the global new cases and deaths, respectively. Among them, China had 4.569 million new cancer cases and 3.003 million deaths, accounting for 23.7% and 30.2% of the global new cases and deaths respectively. China has become the largest country in the world in terms of new cancer cases and deaths.Nasopharyngeal cancer is a kind of malignant tumor with a very high clinical incidence rate, and it is at the top of the list of malignant tumors in otorhinolaryngology. Due to the deep and hidden nasopharyngeal part, the complex relationship with the surrounding area, and the differences in clinical manifestations, early diagnosis is very difficult, and it is very easy to miss the optimal time of treatment due to missed or misdiagnosis. Due to the unique anatomical location and tumor biological behavior of nasopharyngeal cancer, simultaneous radiotherapy has been the main treatment for nasopharyngeal cancer, followed by radiotherapy, chemotherapy, targeted therapy, surgery, and traditional Chinese medicine.Early tumor diagnosis refers to the use of rapid and easy methods to screen out a very small number of tumor high-risk groups from a large number of target populations that appear healthy and have not yet developed symptoms, which can detect tumors early and reduce the risk of morbidity, especially for cancer types with high morbidity and mortality rates and a long developmental cycle, such as lung, gastric, and colorectal cancers. From a global perspective, China's cancer incidence and mortality rates are at a high level, and there are multiple reasons for this phenomenon - medical technology needs to be improved, the quality of the living environment is poor, the routine of life is irregular, and living habits are poor. Compared with chronic diseases such as cardiovascular disease and diabetes, tumor is a "fatal disease" that requires early diagnosis and treatment, and the earlier the diagnosis, the greater the hope of cure. To integrate the data resources and results of early diagnosis of nasopharyngeal cancer and to promote related research, a literature review and information extraction analysis were carried out, and a biomarker-based early diagnosis database of nasopharyngeal cancer was constructed to assist the early diagnosis of nasopharyngeal cancer. The database covers the types of biomarkers, name, specificity, sensitivity, AUC, cell lines used, sample type, sample size, references, and their links. The database contains many types of biomarkers and is a powerful tool for early screening and diagnosis of nasopharyngeal cancer.

  15. S

    Primary Liver Cancer CECT Imaging Dataset

    • scidb.cn
    Updated Aug 25, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jiawei Luo; Shixin Huang; Xixi Nie; Xiaoyu Wan (2024). Primary Liver Cancer CECT Imaging Dataset [Dataset]. http://doi.org/10.57760/sciencedb.12207
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 25, 2024
    Dataset provided by
    Science Data Bank
    Authors
    Jiawei Luo; Shixin Huang; Xixi Nie; Xiaoyu Wan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Primary liver cancer is a significant global health issue, characterized by high incidence and mortality rates worldwide. Accurate diagnosis and classification of subtypes are essential for selecting appropriate treatment options and enhancing patient outcomes. Contrast-enhanced computed tomography (CECT) has proven highly sensitive and specific in diagnosing liver cancer. Currently, publicly available datasets of liver cancer CECT scans are limited and often do not comprehensively cover liver cancer subtypes or include complete phasing of CT scans. We hypothesize that utilizing full-phase 3D CECT images, including the Plain, Arterial, Venous, and Delayed phases, can improve the diagnostic classification performance for liver cancer. To test this hypothesis, we have collected a large dataset from a single medical institution that includes 278 cases of liver cancer, featuring Hepatocellular Carcinoma (HCC), Intrahepatic Cholangiocarcinoma (ICC), and Combined Hepatocellular-Cholangiocarcinoma (cHCC-CCA), as well as CECT images from 83 non-liver cancer subjects. For each patient, we annotated the liver and lesion regions. This dataset, rich in liver cancer types and complete in CT phasing, facilitates the development and validation of diagnostic classification models and lesion segmentation models tailored to liver cancer CT imaging.The median age of participants was 59 years 51, 67, with 185 males (67.3% of the liver cancer group) . Each patient had complete 3D contrast-enhanced CT (CECT) data across the Plain, Arterial, Venous, and Delayed phases, stored as NIFTI files. A total of 50,560 slices containing lesions were collected, with a median lesion volume of 75.37 cm³ [26.70, 239.24] . The Python code for loading and processing the data can be found on GitHub (https://github.com/ljwa2323/PLC_CECT).

  16. H

    Data from: GloboCan

    • dataverse.harvard.edu
    Updated Jul 14, 2011
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Harvard Dataverse (2011). GloboCan [Dataset]. http://doi.org/10.7910/DVN/POOFND
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 14, 2011
    Dataset provided by
    Harvard Dataverse
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Users can access data about cancer incidence and mortality for all the countries of the world as of 2008. Background GloboCan is a project of the International Agency for Research on Cancer and the World Health Organization (WHO). GloboCan presents estimates of the burden of cancer in 184 countries or territories around the world. User functionality GloboCan provides access to the most recent estimates (from 2008) of the incidence and mortality of 27 major cancers. Users can create fact sheets or do online analysis to create tables, graphs, maps, and predictions. Users c an choose to create tables by population or by cancer type. Covariates for analysis include age group, sex, and continent. Users are able to choose between mortality and incidence statistics. Users can choose to create age specific cancer curves, bar charts, maps, and pie charts. The prediction option allows the user to estimate the future burden of a selected cancer in selected population for a selected year. Data Notes Data sources and methods are clearly outlined on the “Data Sources and Methods” section of the website. Users are able to download their online analysis in PDF or html format. GloboCan uses the definitions outlined in the United Nations, World Population Prospects, 2008 revision (except Cyprus located in Southern Europe and Taiwan is located in Eastern Asia).

  17. Cancer Registry Software Market Analysis, Size, and Forecast 2025-2029:...

    • technavio.com
    Updated Jun 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Technavio (2025). Cancer Registry Software Market Analysis, Size, and Forecast 2025-2029: North America (US, Canada, and Mexico), Europe (France, Germany, Italy, Spain, and UK), APAC (China and Japan), and Rest of World (ROW) [Dataset]. https://www.technavio.com/report/cancer-registry-software-market-industry-analysis
    Explore at:
    Dataset updated
    Jun 15, 2025
    Dataset provided by
    TechNavio
    Authors
    Technavio
    Time period covered
    2021 - 2025
    Area covered
    Canada, United States, Global
    Description

    Snapshot img

    Cancer Registry Software Market Size 2025-2029

    The cancer registry software market size is forecast to increase by USD 121.9 million, at a CAGR of 14% between 2024 and 2029.

    The market is witnessing significant growth due to the escalating prevalence of cancer cases worldwide. The increasing incidence of various types of cancer necessitates the implementation of advanced registry software solutions to manage and analyze patient data more efficiently. Moreover, the burgeoning clinical research in oncology further drives the demand for these systems, as they facilitate data collection, management, and analysis for research purposes. However, the market faces challenges in the form of stringent data privacy and security concerns. With the growing amount of sensitive patient information being stored and shared digitally, ensuring robust data security becomes crucial. The potential risks of data breaches and unauthorized access can significantly impact both patients and healthcare providers, necessitating the adoption of advanced security measures. Companies in the market must prioritize data security and privacy to gain the trust of healthcare organizations and patients alike.

    What will be the Size of the Cancer Registry Software Market during the forecast period?

    Explore in-depth regional segment analysis with market size data - historical 2019-2023 and forecasts 2025-2029 - in the full report.
    Request Free SampleThe market is a dynamic and evolving landscape, continually adapting to advancements in healthcare technology and the growing demand for comprehensive cancer data management. This market encompasses various applications, including disease registry management, cancer staging system, data warehousing, cancer incidence tracking, registry software architecture, data integration platform, clinical data capture, case reporting system, statistical reporting, cancer screening programs, and more. These tools play a crucial role in cancer surveillance systems, enabling the collection, analysis, and reporting of epidemiological data for public health surveillance. They facilitate data encryption for patient data privacy, ensuring HIPAA compliance. Data interoperability and data quality metrics are essential components, allowing for seamless integration of various health informatics tools. Real-time data updates and database management systems are integral to maintaining accurate and up-to-date information. Predictive modeling tools and data mining techniques contribute to risk factor identification and mortality data analysis. Data visualization tools offer valuable insights into the complexities of cancer data. Cancer registry software architecture supports population-based registry initiatives, ensuring secure data storage and registry reporting features. Oncology data management tools enable clinical data capture, case reporting, and statistical reporting, enhancing overall patient care. The ongoing development and refinement of these tools reflect the continuous unfolding of market activities and evolving patterns in cancer data management.

    How is this Cancer Registry Software Industry segmented?

    The cancer registry software industry research report provides comprehensive data (region-wise segment analysis), with forecasts and estimates in 'USD million' for the period 2025-2029, as well as historical data from 2019-2023 for the following segments. End-userGovernment and third partyPharma biotech and medical device companiesHospitals and medical practicePrivate payersResearch institutesTypeStand-alone softwareIntegrated softwareDeploymentOn-premisesCloud-basedGeographyNorth AmericaUSCanadaMexicoEuropeFranceGermanyItalySpainUKAPACChinaJapanRest of World (ROW)

    By End-user Insights

    The government and third party segment is estimated to witness significant growth during the forecast period.Cancer registry software solutions play a vital role in assisting government and third-party agencies in managing and analyzing data related to cancer cases. These systems enable the tracking of cancer incidence, prevalence, and mortality rates, providing essential information for public health planning, resource allocation, and policy development. Analyzing trends and patterns in registry data helps identify high-risk populations, geographic disparities, and emerging cancer types. Governments utilize cancer registry software to monitor and improve the quality of cancer care. By evaluating variations in treatment practices and adherence to clinical guidelines, they can benchmark outcomes against national or international standards. Additionally, these software solutions facilitate data interoperability, ensuring data quality metrics and HIPAA compliance. Data encryption, data visualization tools, and predictive modeling capabilities enhance the functionality of cancer registry software. Epidemiological data analysis and risk factor identificatio

  18. f

    Table_5_Racial Disparities and Sex Differences in Early- and Late-Onset...

    • frontiersin.figshare.com
    xlsx
    Updated Jun 9, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jessica L. Petrick; Lauren E. Barber; Shaneda Warren Andersen; Andrea A. Florio; Julie R. Palmer; Lynn Rosenberg (2023). Table_5_Racial Disparities and Sex Differences in Early- and Late-Onset Colorectal Cancer Incidence, 2001–2018.xlsx [Dataset]. http://doi.org/10.3389/fonc.2021.734998.s010
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 9, 2023
    Dataset provided by
    Frontiers
    Authors
    Jessica L. Petrick; Lauren E. Barber; Shaneda Warren Andersen; Andrea A. Florio; Julie R. Palmer; Lynn Rosenberg
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BackgroundColorectal cancer (CRC) incidence rates have increased in younger individuals worldwide. We examined the most recent early- and late-onset CRC rates for the US.MethodsAge-standardized incidence rates (ASIR, per 100,000) of CRC were calculated using the US Cancer Statistics Database’s high-quality population-based cancer registry data from the entire US population. Results were cross-classified by age (20-49 [early-onset] and 50-74 years [late-onset]), race/ethnicity (non-Hispanic White, non-Hispanic Black, Hispanic, American Indian/Alaskan Native, Asian/Pacific Islander), sex, anatomic location (proximal, distal, rectal), and histology (adenocarcinoma, neuroendocrine).ResultsDuring 2001 through 2018, early-onset CRC rates significantly increased among American Indians/Alaskan Natives, Hispanics, and Whites. Compared to Whites, early-onset CRC rates are now 21% higher in American Indians/Alaskan Natives and 6% higher in Blacks. Rates of early-onset colorectal neuroendocrine tumors have increased in Whites, Blacks, and Hispanics; early-onset colorectal neuroendocrine tumor rates are 2-times higher in Blacks compared to Whites. Late-onset colorectal adenocarcinoma rates are decreasing, while late-onset colorectal neuroendocrine tumor rates are increasing, in all racial/ethnic groups. Late-onset CRC rates remain 29% higher in Blacks and 15% higher in American Indians/Alaskan Natives compared to Whites. Overall, CRC incidence was higher in men than women, but incidence of early-onset distal colon cancer was higher in women.ConclusionsThe early-onset CRC disparity between Blacks and Whites has decreased, due to increasing rates in Whites—rates in Blacks have remained stable. However, rates of colorectal neuroendocrine tumors are increasing in Blacks. Blacks and American Indians/Alaskan Natives have the highest rates of both early- and late-onset CRC.ImpactOngoing prevention efforts must ensure access to and uptake of CRC screening for Blacks and American Indians/Alaskan Natives.

  19. r

    Effectiveness of cervical screening after age 60 according to screening...

    • researchdata.se
    • data.europa.eu
    Updated Oct 18, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pär Sparén (2017). Effectiveness of cervical screening after age 60 according to screening history: nationwide cohort study [Dataset]. http://doi.org/10.5878/002910
    Explore at:
    (100052)Available download formats
    Dataset updated
    Oct 18, 2017
    Dataset provided by
    Karolinska Institutet
    Authors
    Pär Sparén
    Time period covered
    Jan 1, 1970 - Dec 31, 2011
    Area covered
    Sweden
    Description

    The relatively high incidence of cervical cancer in women at older ages is an issue in countries performing cervical screening for decades. Controversy remains on when and how to cease screening. Existing population-based studies on effectiveness of cervical screening at older ages have not considered women’s screening history. We performed a nationwide cohort study to investigate the incidence of cervical cancer after age 60 and its association with cervical screening at ages 61-65, stratified by screening history at ages 51-60. Using the Total Population Register, we identified women born between January 1919 and December 1945, resident in Sweden since age 51. According to the year that each county started the electronic record of cervical screening and women’s resident county, we further identified 569,132 women that have cervical screening record available since age 51. Women’s screening records, cervical cancer occurrence, and level of education were retrieved from the Swedish National Cervical Screening Registry, the National Cancer Register, and LISA (Longitudinal integration database for health insurance and labour market studies) respectively. We presented the cumulative incidence of cervical cancer from age 61-80 by using competing risk regression models, and compared the hazard ratio of cervical cancer by screening status at ages 61-65 from Cox models, adjusted for birth cohort and level of education, conditioning on screening history in their 50s. We find that Cervical screening at ages 61-65 is associated with a statistically significant reduction of subsequent cervical cancer risk for women unscreened, or screened with abnormalities, in their 50s. In women screened negative in their 50s, the risk for future cancer is not sizeable, and the risk reduction associated with continued screening appears limited. These findings should inform the current debate regarding age and criteria to discontinue cervical screening.

    Purpose:

    In order to provide evidence for age and criteria to discontinue cervical screening, we use this data to investigate the impact of cervical screening at ages 61-65 on cervical cancer incidence and stage at ages 61-80, stratifying by screening history at ages 51-60.

    This data comprises women born between January 1919 and December 1945, resident in Sweden since age 51, and having cervical screening record available since age 51. It contains the following variables: - Seq_nr: sequence number indicating each individual woman, from 1 to 569,132. - Edu_cat: level of education in three categories: 1=low (less than high school); 2=high school; 3=university exam and above; .=missing. Data are retrieved from LISA (Longitudinal integration database for health insurance and labour market studies). - Birth_cat: five categories of birth-year: 1=1919-1925; 2=1926-1930; 3=1931-1935; 4=1936-1940; 5=1941-1945. - Scr_51_60: Screening history at ages 51-60, in five categories: 1=adequately screened, negative; 2=inadequately screened, negative; 3=unscreened; 4=having low-grade abnormality; 5=having high-grade abnormality. Data are retrieved from the Swedish National Cervical Screening Registry. - Age_first_scr_6165: age at having the first screening test at ages 61-65. (Missing value indicates there is no screening test at ages 61-65). Data are retrieved from the Swedish National Cervical Screening Registry. - Orgscr_county: If in the county that had more than 40% women being screened at ages 61-65: 0=no; 1=yes. - Age_entry: age when entering the cohort, which is 61 for all women. - Age_exit: age when the follow-up is finished. - Cx_fail: the event of finishing follow-up: 1=having cervical cancer; 2=competing events (death or having total hysterectomy); 3=censoring (emigration, turning age 81, or 2011-12-31). The information is retrieved from the Swedish National Cancer Registry (cervical cancer), Cause of Death Register (death), Patient Register (hysterectomy), and Migration Register (emigration). The dataset also includes three variables created by Swedish National Dataservice (SND-study, SND-dataset, SND-version).

  20. g

    Standardized course of care for patients with cancer, proportion (%) |...

    • gimi9.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Standardized course of care for patients with cancer, proportion (%) | gimi9.com [Dataset]. https://gimi9.com/dataset/eu_http-api-kolada-se-v2-kpi-n70643/
    Explore at:
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Proportion of patients examined within a standardized course of care out of the total number of people who are expected to receive a cancer diagnosis during the year. The calculation is based on data from the National Board of Health and Welfare. Standardized course of care (SVF) is a national approach that aims to reduce unnecessary waiting and uncertainty for the patient. All SVFs start with a well-founded suspicion of cancer. What is a well-founded suspicion, how it should be investigated and how long this may take, is stated in the national care program for each cancer diagnosis. The time from well-founded suspicion to the start of treatment is measured in the same way throughout the country. An SVF describes the investigations to be carried out in the event of suspicion of a particular cancer disease, as well as the maximum time limits within which the investigations must be completed and the first treatment must be started. The denominator consists of the total number of people who are expected to receive a cancer diagnosis during the year. The calculation is based on the number of cancer cases in the last three years with data from the Cancer Registry at the National Board of Health and Welfare.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Opendatabay Labs (2025). Synthetic Colorectal Cancer Global Dataset [Dataset]. https://www.opendatabay.com/data/synthetic/ae2aba99-491d-45a1-a99e-7be14927f4af

Synthetic Colorectal Cancer Global Dataset

Explore at:
.undefinedAvailable download formats
Dataset updated
Jun 28, 2025
Dataset authored and provided by
Opendatabay Labs
License

CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically

Area covered
Patient Health Records & Digital Health
Description

The Synthetic Colorectal Cancer Global Dataset is a fully anonymised, high-dimensional synthetic dataset designed for global cancer research, predictive modelling, and educational use. It encompasses demographic, clinical, lifestyle, genetic, and healthcare access factors relevant to colorectal cancer incidence, outcomes, and survivability.

Dataset Features

  • Patient_ID: Unique identifier for each patient.
  • Country: Patient's country of residence.
  • Age: Age at diagnosis (in years).
  • Gender: Biological sex of the patient (Male/Female/Other).
  • Cancer_Stage: Stage of colorectal cancer at diagnosis (e.g., Stage I–IV).
  • Tumor_Size_mm: Size of the tumor in millimeters.
  • Family_History: Presence of colorectal cancer in family history (True/False).
  • Smoking_History: Smoking behavior or history (e.g., Current, Former, Never).
  • Alcohol_Consumption: Level of alcohol consumption (e.g., High, Moderate, None).
  • Obesity_BMI: BMI classification related to obesity.
  • Diet_Risk: Diet-related cancer risk (e.g., High Fat, Low Fiber).
  • Physical_Activity: Level of physical activity (e.g., Sedentary, Active).
  • Diabetes: Diabetes diagnosis (True/False).
  • Inflammatory_Bowel_Disease: Presence of IBD (True/False).
  • Genetic_Mutation: Genetic mutations relevant to colorectal cancer (e.g., APC, KRAS).
  • Screening_History: History of cancer screenings (True/False).
  • Early_Detection: Whether cancer was detected early (True/False).
  • Treatment_Type: Primary treatment type (e.g., Surgery, Chemotherapy, Radiation).
  • Survival_5_years: 5-year survival status (True/False).
  • Mortality: Mortality outcome (Alive/Deceased).
  • Healthcare_Costs: Estimated treatment costs (in USD).
  • Incidence_Rate_per_100K: Country-level incidence rate per 100,000 people.
  • Mortality_Rate_per_100K: Country-level mortality rate per 100,000 people.
  • Urban_or_Rural: Patient's living area (Urban/Rural).
  • Economic_Classification: Country's economic level (e.g., Low, Middle, High income).
  • Healthcare_Access: Access level to healthcare services (e.g., Good, Limited).
  • Insurance_Status: Insurance coverage status (Insured/Uninsured).
  • Survival_Prediction: Model-derived survival prediction (probability or binary).

Distribution

https://storage.googleapis.com/opendatabay_public/ae2aba99-491d-45a1-a99e-7be14927f4af/299af3fa2502_patient_analysis_plots.png" alt="Synthetic Colorectal Cancer Global Data Distribution.png">

Usage

This dataset can be used for:

  • Global Cancer Research: Analyze how clinical, lifestyle, and socioeconomic factors affect colorectal cancer outcomes worldwide.
  • Predictive Modeling: Develop models to estimate survival probability or treatment outcomes.
  • Healthcare Policy Analysis: Study disparities in healthcare access and outcomes across countries.
  • Educational Use: Support training in epidemiology, oncology, public health, and machine learning.

Coverage

The dataset includes 100% synthetic yet clinically plausible records from diverse countries and demographic groups. It is anonymized and modeled to reflect real-world variability in risk factors, diagnosis stages, treatment, and survival without compromising patient privacy.

License

CC0 (Public Domain)

Who Can Use It

  • Epidemiologists and Medical Researchers: To explore global patterns in colorectal cancer.
  • Public Health Experts and Policymakers: For assessing equity in healthcare access and cancer outcomes.
  • Data Scientists and Educators: As a rich dataset for teaching data analysis, classification, regression, and health informatics.
Search
Clear search
Close search
Google apps
Main menu