100+ datasets found

CDC WONDER: Cancer Statistics
healthdata.gov
data.virginia.gov
+5more
application/rdfxml +5
Updated Feb 13, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2021). CDC WONDER: Cancer Statistics [Dataset]. https://healthdata.gov/dataset/CDC-WONDER-Cancer-Statistics/mv5s-m59f
Explore at:
xml, tsv, application/rssxml, csv, application/rdfxml, jsonAvailable download formats
Dataset updated
Feb 13, 2021
Description
The United States Cancer Statistics (USCS) online databases in WONDER provide cancer incidence and mortality data for the United States for the years since 1999, by year, state and metropolitan areas (MSA), age group, race, ethnicity, sex, childhood cancer classifications and cancer site. Report case counts, deaths, crude and age-adjusted incidence and death rates, and 95% confidence intervals for rates. The USCS data are the official federal statistics on cancer incidence from registries having high-quality data and cancer mortality statistics for 50 states and the District of Columbia. USCS are produced by the Centers for Disease Control and Prevention (CDC) and the National Cancer Institute (NCI), in collaboration with the North American Association of Central Cancer Registries (NAACCR). Mortality data are provided by the Centers for Disease Control and Prevention (CDC), National Center for Health Statistics (NCHS), National Vital Statistics System (NVSS).
p
BREAST CANCER - Dataset - CKAN
data.poltekkes-smg.ac.id
Updated Oct 7, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). BREAST CANCER - Dataset - CKAN [Dataset]. https://data.poltekkes-smg.ac.id/dataset/breast-cancer
Explore at:
Dataset updated
Oct 7, 2024
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset of breast cancer patients was obtained from the 2017 November update of the SEER Program of the NCI, which provides information on population-based cancer statistics. The dataset involved female patients with infiltrating duct and lobular carcinoma breast cancer (SEER primary cites recode NOS histology codes 8522/3) diagnosed in 2006-2010. Patients with unknown tumour size, examined regional LNs, positive regional LNs, and patients whose survival months were less than 1 month were excluded; thus, 4024 patients were ultimately included.
i
SEER Breast Cancer Data
ieee-dataport.org
data.niaid.nih.gov
+2more
Updated Jun 16, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
jing teng (2025). SEER Breast Cancer Data [Dataset]. https://ieee-dataport.org/open-access/seer-breast-cancer-data
Explore at:
Dataset updated
Jun 16, 2025
Authors
jing teng
Description
examined regional LNs
Cancer Statistics | DATA.GOV.HK
data.gov.hk
Updated Jul 25, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
data.gov.hk (2024). Cancer Statistics | DATA.GOV.HK [Dataset]. https://data.gov.hk/en-data/dataset/hk-dh-dh_ncddhss-ncdd-dataset-11
Explore at:
Dataset updated
Jul 25, 2024
Dataset provided by
data.gov.hk
Description
Number of Cancer New Cases and Registered Deaths by Ten Leading Cancer Disease Group by Sex 2022
Oral Cancer Prediction Dataset
kaggle.com
Updated Mar 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ankush Panday (2025). Oral Cancer Prediction Dataset [Dataset]. http://doi.org/10.34740/kaggle/dsv/10942559
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.34740/kaggle/dsv/10942559
Dataset updated
Mar 6, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Ankush Panday
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
This dataset provides a detailed and structured overview of oral cancer cases worldwide. It includes key risk factors, symptoms, cancer staging, survival rates, treatment approaches, and economic burden to facilitate research and prediction modeling. The dataset is based on real-world oral cancer statistics, aligning with global health reports and studies.

Key Highlights: Covers high-incidence regions (India, Pakistan, Sri Lanka, Taiwan) and emerging trends in Western nations. Includes tobacco, alcohol, HPV infection, betel quid use, and dietary factors as primary risk factors. Captures economic burden (treatment costs, workdays lost) to assess the financial impact of oral cancer. Provides cancer staging, survival rates, and early diagnosis indicators for better treatment predictions. This dataset is valuable for medical professionals, researchers, data scientists, and policymakers aiming to develop early detection models, assess regional disparities, and improve cancer prevention strategies.

Columns Overview ID – Unique identifier Country – Country name Age – Age of the individual Gender – Male/Female Tobacco Use – Yes/No Alcohol Consumption – Yes/No HPV Infection – Yes/No Betel Quid Use – Yes/No Chronic Sun Exposure – Yes/No Poor Oral Hygiene – Yes/No Diet (Fruits & Vegetables Intake) – Low/Moderate/High Family History of Cancer – Yes/No Compromised Immune System – Yes/No Oral Lesions – Yes/No Unexplained Bleeding – Yes/No Difficulty Swallowing – Yes/No White or Red Patches in Mouth – Yes/No Tumor Size (cm) – Numerical value Cancer Stage – 0 (No Cancer), 1, 2, 3, 4 Treatment Type – Surgery/Radiation/Chemotherapy/Targeted Therapy/No Treatment Survival Rate (5-Year, %) Cost of Treatment (USD) Economic Burden (Lost Workdays per Year) Early Diagnosis (Yes/No) Oral Cancer (Diagnosis) – Yes/No (Target Variable)
Cancer Statistics in US States
kaggle.com
Updated Jun 17, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ms. Nancy Al Aswad (2022). Cancer Statistics in US States [Dataset]. https://www.kaggle.com/datasets/nancyalaswad90/cancer-statistics-in-us-states
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 17, 2022
Dataset provided by
Kaggle
Authors
Ms. Nancy Al Aswad
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Area covered
United States
Description
What are Cancer Statistics in US States?

The circled group of good survivors has genetic indicators of poor survivors (i.e. low ESR1 levels, which is typically the prognostic indicator of poor outcomes in breast cancer) – understanding this group could be critical for helping improve mortality rates for this disease. Why this group survived was quickly analysed by using the Outcome Column (here Event Death - which is binary - 0,1) as a Data Lens (which we term Supervised vs Unsupervised analyses).

How to use this dataset

A network was built using only gene expression with 272 breast cancer patients (as rows), and 1570 columns.

Metadata includes patient info, treatment, and survival.

Each node is a group of patients similar to each other. Flares (left) represent sub-populations that are distinct from the larger population. (One differentiating factor between the two flares is estrogen expression (low = top flare, high = bottom flare)).

A bottom flare is a group of patients with 100% survival. The top flare shows a range of survival – very poor towards the tip (red), and very good near the base (circled).

Acknowledgments

When we use this dataset in our research, we credit the authors as :

License : CC BY 4.0.

This data set is taken from https://query.data.world/s/yi422lv7mkhnydnt4ixrfujmoaglpk .

The main idea for uploading this dataset is to practice data analysis with my students, as I am working in college and want my student to train our studying ideas in a big dataset, It may be not up to date and I mention the collecting years, but it is a good resource of data to practice
Cancer registration statistics, England
ons.gov.uk
cy.ons.gov.uk
xlsx
Updated Apr 26, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Office for National Statistics (2019). Cancer registration statistics, England [Dataset]. https://www.ons.gov.uk/peoplepopulationandcommunity/healthandsocialcare/conditionsanddiseases/datasets/cancerregistrationstatisticscancerregistrationstatisticsengland
Explore at:
xlsxAvailable download formats
Dataset updated
Apr 26, 2019
Dataset provided by
Office for National Statisticshttp://www.ons.gov.uk/
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Description
Cancer diagnoses and age-standardised incidence rates for all types of cancer by age and sex including breast, prostate, lung and colorectal cancer.
H
SEER Cancer Statistics Database
data.niaid.nih.gov
Updated Jul 11, 2011
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2011). SEER Cancer Statistics Database [Dataset]. http://doi.org/10.7910/DVN/C9KBBC
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/C9KBBC
Dataset updated
Jul 11, 2011
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Users can access data about cancer statistics in the United States including but not limited to searches by type of cancer and race, sex, ethnicity, age at diagnosis, and age at death. Background Surveillance Epidemiology and End Results (SEER) database’s mission is to provide information on cancer statistics to help reduce the burden of disease in the U.S. population. The SEER database is a project to the National Cancer Institute. The SEER database collects information on incidence, prevalence, and survival from specific geographic areas representing 28 percent of the United States population. User functionality Users can access a variety of reso urces. Cancer Stat Fact Sheets allow users to look at summaries of statistics by major cancer type. Cancer Statistic Reviews are available from 1975-2008 in table format. Users are also able to build their own tables and graphs using Fast Stats. The Cancer Query system provides more flexibility and a larger set of cancer statistics than F ast Stats but requires more input from the user. State Cancer Profiles include dynamic maps and graphs enabling the investigation of cancer trends at the county, state, and national levels. SEER research data files and SEER*Stat software are available to download through your Internet connection (SEER*Stat’s client-server mode) or via discs shipped directly to you. A signed data agreement form is required to access the SEER data Data Notes Data is available in different formats depending on which type of data is accessed. Some data is available in table, PDF, and html formats. Detailed information about the data is available under “Data Documentation and Variable Recodes”.
A
‘🎗️ Cancer Rates by U.S. State’ analyzed by Analyst-2
analyst-2.ai
Updated Feb 13, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2022). ‘🎗️ Cancer Rates by U.S. State’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-cancer-rates-by-u-s-state-5f6a/af56eb24/?iid=000-919&v=presentation
Explore at:
Dataset updated
Feb 13, 2022
Dataset authored and provided by
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
United States
Description
Analysis of ‘🎗️ Cancer Rates by U.S. State’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/yamqwe/cancer-rates-by-u-s-statee on 13 February 2022.

--- Dataset description provided by original source is as follows ---

About this dataset

In the following maps, the U.S. states are divided into groups based on the rates at which people developed or died from cancer in 2013, the most recent year for which incidence data are available.

The rates are the numbers out of 100,000 people who developed or died from cancer each year.

Incidence Rates by State
The number of people who get cancer is called cancer incidence. In the United States, the rate of getting cancer varies from state to state.

*Rates are per 100,000 and are age-adjusted to the 2000 U.S. standard population.

‡Rates are not shown if the state did not meet USCS publication criteria or if the state did not submit data to CDC.

†Source: U.S. Cancer Statistics Working Group. United States Cancer Statistics: 1999–2013 Incidence and Mortality Web-based Report. Atlanta (GA): Department of Health and Human Services, Centers for Disease Control and Prevention, and National Cancer Institute; 2016. Available at: http://www.cdc.gov/uscs.

Death Rates by State
Rates of dying from cancer also vary from state to state.

*Rates are per 100,000 and are age-adjusted to the 2000 U.S. standard population.

†Source: U.S. Cancer Statistics Working Group. United States Cancer Statistics: 1999–2013 Incidence and Mortality Web-based Report. Atlanta (GA): Department of Health and Human Services, Centers for Disease Control and Prevention, and National Cancer Institute; 2016. Available at: http://www.cdc.gov/uscs.

Source: https://www.cdc.gov/cancer/dcpc/data/state.htm

This dataset was created by Adam Helsinger and contains around 100 samples along with Range, Rate, technical information and other features such as: - Range - Rate - and more.

How to use this dataset

Analyze Range in relation to Rate

Study the influence of Range on Rate

More datasets

Acknowledgements

If you use this dataset in your research, please credit Adam Helsinger

Start A New Notebook!

--- Original source retains full ownership of the source dataset ---
o
Synthetic Oral Cancer Prediction Dataset
opendatabay.com
.undefined
Updated Jun 26, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Opendatabay Labs (2025). Synthetic Oral Cancer Prediction Dataset [Dataset]. https://www.opendatabay.com/data/synthetic/09f348fc-a2e8-4132-9f1b-195765d80afc
Explore at:
.undefinedAvailable download formats
Dataset updated
Jun 26, 2025
Dataset authored and provided by
Opendatabay Labs
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Area covered
Patient Health Records & Digital Health
Description
The Synthetic Oral Cancer Prediction Dataset is designed for educational and research purposes to analyse factors associated with oral cancer risk, progression, and treatment outcomes. The dataset includes anonymised, synthetic data on various clinical, lifestyle, and demographic factors for individuals diagnosed with oral cancer.

Dataset Features

ID: Unique identifier for each participant.

Country: Country of residence of the participant.

Age: Age of the participant (in years).

Gender: Gender of the participant (Male/Female).

Tobacco Use: History of tobacco use (Yes/No).

Alcohol Consumption: History of alcohol consumption (Yes/No).

HPV Infection: Presence of human papillomavirus infection (Yes/No).

Betel Quid Use: History of Betel quid use (Yes/No).

Chronic Sun Exposure: History of chronic sun exposure (Yes/No).

Poor Oral Hygiene: Poor oral hygiene habits (Yes/No).

Diet (Fruits & Vegetables Intake): Frequency of consuming fruits and vegetables (Yes/No).

Family History of Cancer: Family history of cancer (Yes/No).

Compromised Immune System: Whether the participant has a compromised immune system (Yes/No).

Oral Lesions: Presence of oral lesions (Yes/No).

Unexplained Bleeding: Presence of unexplained bleeding (Yes/No).

Difficulty Swallowing: Difficulty in swallowing (Yes/No).

White or Red Patches in Mouth: Presence of white or red patches in the mouth (Yes/No).

Tumor Size (cm): Size of the tumor in centimeters.

Cancer Stage: Stage of the oral cancer (1-4).

Treatment Type: Type of treatment received (e.g., Surgery, Radiation, Chemotherapy).

Survival Rate (5-Year, %): 5-year survival rate in percentage.

Cost of Treatment (USD): Total cost of treatment in USD.

Economic Burden (Lost Workdays per Year): Economic burden due to lost workdays each year.

Early Diagnosis: Whether early diagnosis was made (Yes/No).

Oral Cancer (Diagnosis): Diagnosis of oral cancer (Yes/No).

Distribution

https://storage.googleapis.com/opendatabay_public/09f348fc-a2e8-4132-9f1b-195765d80afc/622bf59174d1_plot_output.png" alt="Synthetic oral cancer dataset plot_output.png">

Usage

This dataset can be used for the following applications:

Cancer Research: Investigate the relationship between various lifestyle, clinical, and demographic factors with oral cancer risk and progression.

Predictive Modeling: Build machine learning models to predict cancer diagnosis, survival rate, or treatment outcomes based on participant data.

Healthcare and Public Health: Study the impact of lifestyle factors (e.g., tobacco, alcohol, diet) on the development and progression of oral cancer.

Educational Purposes: Provide a dataset for students and researchers in oncology, medical data science, and public health fields to analyze cancer risk factors and treatment outcomes.

Coverage

This synthetic dataset is fully anonymized and complies with data privacy standards. It includes a wide array of factors that support diverse research and analysis in the oncology and public health domains.

License

CC0 (Public Domain)

Who Can Use It

Cancer Researchers: To explore correlations between lifestyle factors, clinical features, and treatment outcomes in oral cancer.

Oncologists and Healthcare Providers: To analyze the effectiveness of different treatments and factors that affect prognosis and survival.

Public Health Professionals: To study the broader societal and economic impacts of oral cancer and develop preventive measures.

Data Scientists and Machine Learning Practitioners: To develop predictive models for diagnosing oral cancer and improving treatment planning.

Educators and Students: As a resource for studying cancer risk analysis, healthcare data science, and public health analytics.
d
[ARCHIVED] Health Statistics Cancer Rates 2002-2010
datasets.ai
data.novascotia.ca
+2more
0, 21, 40, 41, 55, 8
Updated Sep 11, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Government of Nova Scotia | Gouvernment de la Nouvelle-Écosse (2024). [ARCHIVED] Health Statistics Cancer Rates 2002-2010 [Dataset]. https://datasets.ai/datasets/7ecc1a67-1c9f-8790-b659-50edd59e94a9
Explore at:
55, 8, 21, 41, 40, 0Available download formats
Dataset updated
Sep 11, 2024
Dataset authored and provided by
Government of Nova Scotia | Gouvernment de la Nouvelle-Écosse
Description
[ARCHIVED] Community Counts data is retained for archival purposes only, such as research, reference and record-keeping. This data has not been maintained or updated. Users looking for the latest information should refer to Statistics Canada’s Census Program (https://www12.statcan.gc.ca/census-recensement/index-eng.cfm?MM=1) for the latest data, including detailed results about Nova Scotia. This table reports cancer rates by primary site, age and sex. Geographies available: county, district health authorities
G
Number and rates of new cases of primary cancer, by cancer type, age group...
open.canada.ca
www150.statcan.gc.ca
+2more
csv, html, xml
Updated Feb 3, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statistics Canada (2025). Number and rates of new cases of primary cancer, by cancer type, age group and sex [Dataset]. https://open.canada.ca/data/en/dataset/e667992c-5f2e-425a-8a44-a880930d82d8
Explore at:
csv, xml, htmlAvailable download formats
Dataset updated
Feb 3, 2025
Dataset provided by
Statistics Canada
License
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
Description
Number and rate of new cancer cases diagnosed annually from 1992 to the most recent diagnosis year available. Included are all invasive cancers and in situ bladder cancer with cases defined using the Surveillance, Epidemiology and End Results (SEER) Groups for Primary Site based on the World Health Organization International Classification of Diseases for Oncology, Third Edition (ICD-O-3). Random rounding of case counts to the nearest multiple of 5 is used to prevent inappropriate disclosure of health-related information.
a
Cancer (in persons of all ages): England
hub.arcgis.com
data.catchmentbasedapproach.org
Updated Apr 6, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The Rivers Trust (2021). Cancer (in persons of all ages): England [Dataset]. https://hub.arcgis.com/datasets/c5c07229db684a65822fdc9a29388b0b
Explore at:
Dataset updated
Apr 6, 2021
Dataset authored and provided by
The Rivers Trust
Area covered

Description
SUMMARYThis analysis, designed and executed by Ribble Rivers Trust, identifies areas across England with the greatest levels of cancer (in persons of all ages). Please read the below information to gain a full understanding of what the data shows and how it should be interpreted.ANALYSIS METHODOLOGYThe analysis was carried out using Quality and Outcomes Framework (QOF) data, derived from NHS Digital, relating to cancer (in persons of all ages).This information was recorded at the GP practice level. However, GP catchment areas are not mutually exclusive: they overlap, with some areas covered by 30+ GP practices. Therefore, to increase the clarity and usability of the data, the GP-level statistics were converted into statistics based on Middle Layer Super Output Area (MSOA) census boundaries.The percentage of each MSOA’s population (all ages) with cancer was estimated. This was achieved by calculating a weighted average based on:The percentage of the MSOA area that was covered by each GP practice’s catchment areaOf the GPs that covered part of that MSOA: the percentage of registered patients that have that illness The estimated percentage of each MSOA’s population with cancer was then combined with Office for National Statistics Mid-Year Population Estimates (2019) data for MSOAs, to estimate the number of people in each MSOA with cancer, within the relevant age range.Each MSOA was assigned a relative score between 1 and 0 (1 = worst, 0 = best) based on:A) the PERCENTAGE of the population within that MSOA who are estimated to have cancerB) the NUMBER of people within that MSOA who are estimated to have cancerAn average of scores A & B was taken, and converted to a relative score between 1 and 0 (1= worst, 0 = best). The closer to 1 the score, the greater both the number and percentage of the population in the MSOA that are estimated to have cancer, compared to other MSOAs. In other words, those are areas where it’s estimated a large number of people suffer from cancer, and where those people make up a large percentage of the population, indicating there is a real issue with cancer within the population and the investment of resources to address that issue could have the greatest benefits.LIMITATIONS1. GP data for the financial year 1st April 2018 – 31st March 2019 was used in preference to data for the financial year 1st April 2019 – 31st March 2020, as the onset of the COVID19 pandemic during the latter year could have affected the reporting of medical statistics by GPs. However, for 53 GPs (out of 7670) that did not submit data in 2018/19, data from 2019/20 was used instead. Note also that some GPs (997 out of 7670) did not submit data in either year. This dataset should be viewed in conjunction with the ‘Health and wellbeing statistics (GP-level, England): Missing data and potential outliers’ dataset, to determine areas where data from 2019/20 was used, where one or more GPs did not submit data in either year, or where there were large discrepancies between the 2018/19 and 2019/20 data (differences in statistics that were > mean +/- 1 St.Dev.), which suggests erroneous data in one of those years (it was not feasible for this study to investigate this further), and thus where data should be interpreted with caution. Note also that there are some rural areas (with little or no population) that do not officially fall into any GP catchment area (although this will not affect the results of this analysis if there are no people living in those areas).2. Although all of the obesity/inactivity-related illnesses listed can be caused or exacerbated by inactivity and obesity, it was not possible to distinguish from the data the cause of the illnesses in patients: obesity and inactivity are highly unlikely to be the cause of all cases of each illness. By combining the data with data relating to levels of obesity and inactivity in adults and children (see the ‘Levels of obesity, inactivity and associated illnesses: Summary (England)’ dataset), we can identify where obesity/inactivity could be a contributing factor, and where interventions to reduce obesity and increase activity could be most beneficial for the health of the local population.3. It was not feasible to incorporate ultra-fine-scale geographic distribution of populations that are registered with each GP practice or who live within each MSOA. Populations might be concentrated in certain areas of a GP practice’s catchment area or MSOA and relatively sparse in other areas. Therefore, the dataset should be used to identify general areas where there are high levels of cancer, rather than interpreting the boundaries between areas as ‘hard’ boundaries that mark definite divisions between areas with differing levels of cancer.TO BE VIEWED IN COMBINATION WITH:This dataset should be viewed alongside the following datasets, which highlight areas of missing data and potential outliers in the data:Health and wellbeing statistics (GP-level, England): Missing data and potential outliersLevels of obesity, inactivity and associated illnesses (England): Missing dataDOWNLOADING THIS DATATo access this data on your desktop GIS, download the ‘Levels of obesity, inactivity and associated illnesses: Summary (England)’ dataset.DATA SOURCESThis dataset was produced using:Quality and Outcomes Framework data: Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital.GP Catchment Outlines. Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital. Data was cleaned by Ribble Rivers Trust before use.MSOA boundaries: © Office for National Statistics licensed under the Open Government Licence v3.0. Contains OS data © Crown copyright and database right 2021.Population data: Mid-2019 (June 30) Population Estimates for Middle Layer Super Output Areas in England and Wales. © Office for National Statistics licensed under the Open Government Licence v3.0. © Crown Copyright 2020.COPYRIGHT NOTICEThe reproduction of this data must be accompanied by the following statement:© Ribble Rivers Trust 2021. Analysis carried out using data that is: Copyright © 2020, Health and Social Care Information Centre. The Health and Social Care Information Centre is a non-departmental body created by statute, also known as NHS Digital; © Office for National Statistics licensed under the Open Government Licence v3.0. Contains OS data © Crown copyright and database right 2021. © Crown Copyright 2020.CaBA HEALTH & WELLBEING EVIDENCE BASEThis dataset forms part of the wider CaBA Health and Wellbeing Evidence Base.
Breast Cancer India Statewise 2016-2021
kaggle.com
Updated Apr 26, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NITISH SINGHAL (2022). Breast Cancer India Statewise 2016-2021 [Dataset]. https://www.kaggle.com/datasets/nitishsinghal/breast-cancer-india-statewise-20162021
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 26, 2022
Dataset provided by
Kaggle
Authors
NITISH SINGHAL
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Area covered
India
Description
Breast cancer is the most frequently diagnosed cancer and the most frequent cause for cancer-related deaths in women worldwide. Globally, breast cancer accounted for 2.08 million out of 18.08 million new cancer cases (incidence rate of 11.6%) and 626,679 out of 9.55 million cancer-related deaths (6.6% of all cancer-related deaths) in 2018. 1,2 In India, breast cancer has surpassed cancers of the cervix and the oral cavity to be the most common cancer and the leading cause of cancer deaths. In 2018, 159,500 new cases of breast cancer were diagnosed, representing 27.7% of all new cancers among Indian women and 11.1% of all cancer deaths.

In india breast cancer cases reporting and diagnotics have increased 10 times in past 3 years . All thanks to the various cancer awareness initiatives by both private and govt. organisations.
Data from: County-level cumulative environmental quality associated with...
catalog.data.gov
s.cnmilf.com
Updated Nov 12, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. EPA Office of Research and Development (ORD) (2020). County-level cumulative environmental quality associated with cancer incidence. [Dataset]. https://catalog.data.gov/dataset/county-level-cumulative-environmental-quality-associated-with-cancer-incidence
Explore at:
Dataset updated
Nov 12, 2020
Dataset provided by
United States Environmental Protection Agencyhttp://www.epa.gov/
Description
Population based cancer incidence rates were abstracted from National Cancer Institute, State Cancer Profiles for all available counties in the United States for which data were available. This is a national county-level database of cancer data that are collected by state public health surveillance systems. All-site cancer is defined as any type of cancer that is captured in the state registry data, though non-melanoma skin cancer is not included. All-site age-adjusted cancer incidence rates were abstracted separately for males and females. County-level annual age-adjusted all-site cancer incidence rates for years 2006–2010 were available for 2687 of 3142 (85.5%) counties in the U.S. Counties for which there are fewer than 16 reported cases in a specific area-sex-race category are suppressed to ensure confidentiality and stability of rate estimates; this accounted for 14 counties in our study. Two states, Kansas and Virginia, do not provide data because of state legislation and regulations which prohibit the release of county level data to outside entities. Data from Michigan does not include cases diagnosed in other states because data exchange agreements prohibit the release of data to third parties. Finally, state data is not available for three states, Minnesota, Ohio, and Washington. The age-adjusted average annual incidence rate for all counties was 453.7 per 100,000 persons. We selected 2006–2010 as it is subsequent in time to the EQI exposure data which was constructed to represent the years 2000–2005. We also gathered data for the three leading causes of cancer for males (lung, prostate, and colorectal) and females (lung, breast, and colorectal). The EQI was used as an exposure metric as an indicator of cumulative environmental exposures at the county-level representing the period 2000 to 2005. A complete description of the datasets used in the EQI are provided in Lobdell et al. and methods used for index construction are described by Messer et al. The EQI was developed for the period 2000– 2005 because it was the time period for which the most recent data were available when index construction was initiated. The EQI includes variables representing each of the environmental domains. The air domain includes 87 variables representing criteria and hazardous air pollutants. The water domain includes 80 variables representing overall water quality, general water contamination, recreational water quality, drinking water quality, atmospheric deposition, drought, and chemical contamination. The land domain includes 26 variables representing agriculture, pesticides, contaminants, facilities, and radon. The built domain includes 14 variables representing roads, highway/road safety, public transit behavior, business environment, and subsidized housing environment. The sociodemographic environment includes 12 variables representing socioeconomics and crime. This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: Human health data are not available publicly. EQI data are available at: https://edg.epa.gov/data/Public/ORD/NHEERL/EQI. Format: Data are stored as csv files. This dataset is associated with the following publication: Jagai, J., L. Messer, K. Rappazzo , C. Gray, S. Grabich , and D. Lobdell. County-level environmental quality and associations with cancer incidence#. Cancer. John Wiley & Sons Incorporated, New York, NY, USA, 123(15): 2901-2908, (2017).
Cancer mortality trends, by sex and cancer type
www150.statcan.gc.ca
ouvert.canada.ca
+1more
Updated Feb 4, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Cancer mortality trends, by sex and cancer type [Dataset]. https://www150.statcan.gc.ca/t1/tbl1/en/tv.action?pid=1310083901
Explore at:
Unique identifier
https://doi.org/10.25318/1310083901-eng
Dataset updated
Feb 4, 2022
Dataset provided by
Statistics Canadahttps://statcan.gc.ca/en
Area covered
Canada
Description
Annual percent change and average annual percent change in age-standardized cancer mortality rates since 1984 to the most recent data year. The table includes a selection of commonly diagnosed invasive cancers and causes of death are defined based on the World Health Organization International Classification of Diseases, ninth revision (ICD-9) from 1984 to 1999 and on its tenth revision (ICD-10) from 2000 to the most recent year.
State Cancer Profiles Web site
catalog.data.gov
healthdata.gov
+3more
Updated Jul 26, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department of Health & Human Services (2023). State Cancer Profiles Web site [Dataset]. https://catalog.data.gov/dataset/state-cancer-profiles-web-site
Explore at:
Dataset updated
Jul 26, 2023
Dataset provided by
United States Department of Health and Human Serviceshttp://www.hhs.gov/
Description
The State Cancer Profiles (SCP) web site provides statistics to help guide and prioritize cancer control activities at the state and local levels. SCP is a collaborative effort using local and national level cancer data from the Centers for Disease Control and Prevention's National Program of Cancer Registries (NPCR) and National Cancer Institute's Surveillance, Epidemiology and End Results Registries (SEER). SCP address select types of cancer and select behavioral risk factors for which there are evidence-based control interventions. The site provides incidence, mortality and prevalence comparison tables as well as interactive graphs and maps and support data. The graphs and maps provide visual support for deciding where to focus cancer control efforts.
p
Urinary biomarkers for pancreatic cancer - Dataset - CKAN
data.poltekkes-smg.ac.id
Updated Oct 8, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Urinary biomarkers for pancreatic cancer - Dataset - CKAN [Dataset]. https://data.poltekkes-smg.ac.id/dataset/urinarybiomarkers-for-pancreatic-cancer
Explore at:
Dataset updated
Oct 8, 2024
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Pancreatic cancer is an extremely deadly type of cancer. Once diagnosed, the five-year survival rate is less than 10%. However, if pancreatic cancer is caught early, the odds of surviving are much better. Unfortunately, many cases of pancreatic cancer show no symptoms until the cancer has spread throughout the body. A diagnostic test to identify people with pancreatic cancer could be enormously helpful.
d
[MI] Rapid Cancer Registration Data
digital.nhs.uk
Updated Jul 3, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). [MI] Rapid Cancer Registration Data [Dataset]. https://digital.nhs.uk/data-and-information/publications/statistical/mi-rapid-cancer-registration-data
Explore at:
Dataset updated
Jul 3, 2025
License
https://digital.nhs.uk/about-nhs-digital/terms-and-conditionshttps://digital.nhs.uk/about-nhs-digital/terms-and-conditions
Description
Rapid Cancer Registration Data (RCRD) provides a quick, indicative source of cancer data. It is provided to support the planning and provision of cancer services. The data is based on a rapid processing of cancer registration data sources, in particular on Cancer Outcomes and Services Dataset (COSD) information. In comparison, National Cancer Registration Data (NCRD) relies on additional data sources, enhanced follow-up with trusts and expert processing by cancer registration officers. The Rapid Cancer Registration Data (RCRD) may be useful for service improvement projects including healthcare planning and prioritisation. However, it is poorly suited for epidemiological research due to limitations in the data quality and completeness.
cancer and sexuality Dataset
kaggle.com
Updated Jun 25, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ayoub chaoui (2021). cancer and sexuality Dataset [Dataset]. https://www.kaggle.com/datasets/ayoubchaoui/cancer-and-sexuality-dataset/discussion?sort=undefined
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 25, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
ayoub chaoui
License
https://www.worldbank.org/en/about/legal/terms-of-use-for-datasetshttps://www.worldbank.org/en/about/legal/terms-of-use-for-datasets
Description
Dataset

This dataset was created by ayoub chaoui

Released under World Bank Dataset Terms of Use

Contents

Facebook

Twitter

Click to copy link

Link copied

Cite

(2021). CDC WONDER: Cancer Statistics [Dataset]. https://healthdata.gov/dataset/CDC-WONDER-Cancer-Statistics/mv5s-m59f

CDC WONDER: Cancer Statistics

Explore at:

xml, tsv, application/rssxml, csv, application/rdfxml, jsonAvailable download formats

Dataset updated

Feb 13, 2021

Description

The United States Cancer Statistics (USCS) online databases in WONDER provide cancer incidence and mortality data for the United States for the years since 1999, by year, state and metropolitan areas (MSA), age group, race, ethnicity, sex, childhood cancer classifications and cancer site. Report case counts, deaths, crude and age-adjusted incidence and death rates, and 95% confidence intervals for rates. The USCS data are the official federal statistics on cancer incidence from registries having high-quality data and cancer mortality statistics for 50 states and the District of Columbia. USCS are produced by the Centers for Disease Control and Prevention (CDC) and the National Cancer Institute (NCI), in collaboration with the North American Association of Central Cancer Registries (NAACCR). Mortality data are provided by the Centers for Disease Control and Prevention (CDC), National Center for Health Statistics (NCHS), National Vital Statistics System (NVSS).

Clear search

Close search

Google apps

Main menu

CDC WONDER: Cancer Statistics

BREAST CANCER - Dataset - CKAN

SEER Breast Cancer Data

Cancer Statistics | DATA.GOV.HK

Oral Cancer Prediction Dataset

Cancer Statistics in US States

What are Cancer Statistics in US States?

How to use this dataset

Acknowledgments

The main idea for uploading this dataset is to practice data analysis with my students, as I am working in college and want my student to train our studying ideas in a big dataset, It may be not up to date and I mention the collecting years, but it is a good resource of data to practice

Cancer registration statistics, England

SEER Cancer Statistics Database

‘🎗️ Cancer Rates by U.S. State’ analyzed by Analyst-2

About this dataset

How to use this dataset

Acknowledgements

Start A New Notebook!

Synthetic Oral Cancer Prediction Dataset

Dataset Features

Distribution

Usage

Coverage

License

Who Can Use It

[ARCHIVED] Health Statistics Cancer Rates 2002-2010

Number and rates of new cases of primary cancer, by cancer type, age group...

Cancer (in persons of all ages): England

Breast Cancer India Statewise 2016-2021

Data from: County-level cumulative environmental quality associated with...

Cancer mortality trends, by sex and cancer type

State Cancer Profiles Web site

Urinary biomarkers for pancreatic cancer - Dataset - CKAN

[MI] Rapid Cancer Registration Data

cancer and sexuality Dataset

Dataset

Contents

CDC WONDER: Cancer StatisticsSee More Versions

CDC WONDER: Cancer Statistics