72 datasets found
  1. Lung Cancer Dataset

    • kaggle.com
    Updated May 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aman_Kumar094 (2025). Lung Cancer Dataset [Dataset]. https://www.kaggle.com/datasets/amankumar094/lung-cancer-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 6, 2025
    Dataset provided by
    Kaggle
    Authors
    Aman_Kumar094
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    ** Description**

    This dataset contains data about lung cancer Mortality and is a comprehensive collection of patient information, specifically focused on individuals diagnosed with cancer. This dataset contains comprehensive information on 800,000 individuals related to lung cancer diagnosis, treatment, and outcomes. With 16 well-structured columns. This large-scale dataset is designed to aid researchers, data scientists, and healthcare professionals in studying patterns, building predictive models, and enhancing early detection and treatment strategies.

    🌍 The Societal Impact of Lung Cancer

    Lung cancer is not just a disease — it's a global crisis that steals time, health, and hope from millions of people every year. As the #1 cause of cancer deaths worldwide, it takes more lives annually than breast, colon, and prostate cancer combined.

    But behind every statistic is a story:

    A parent who never saw their child graduate.

    A worker who had to leave their job too soon.

    A community that lost a leader, a friend, a neighbor.

    Why does this matter? Lung cancer often goes undetected until it's too late. It’s aggressive, silent, and devastating — especially in underserved areas where early detection is rare and treatment options are limited. It doesn’t just affect patients. It affects families, economies, and healthcare systems on a massive scale.

    This dataset represents more than numbers. It represents 800,000 real-world stories — people who can help us unlock patterns, train models, and advance life-saving research.

    By working with this data, you're not just analyzing a dataset — you're stepping into the fight against one of humanity’s deadliest diseases.

    Let’s turn insight into impact. (😊The above descriptions is generated with the help of AI, Just wanted to share this dataset That all. Thank you)

  2. a

    Lung Cancer Mortality

    • ph-lacounty.hub.arcgis.com
    • data.lacounty.gov
    • +2more
    Updated Dec 20, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    County of Los Angeles (2023). Lung Cancer Mortality [Dataset]. https://ph-lacounty.hub.arcgis.com/datasets/lung-cancer-mortality
    Explore at:
    Dataset updated
    Dec 20, 2023
    Dataset authored and provided by
    County of Los Angeles
    Area covered
    Description

    Death rate has been age-adjusted by the 2000 U.S. standard population. Single-year data are only available for Los Angeles County overall, Service Planning Areas, Supervisorial Districts, City of Los Angeles overall, and City of Los Angeles Council Districts.Lung cancer is a leading cause of cancer-related death in the US. People who smoke have the greatest risk of lung cancer, though lung cancer can also occur in people who have never smoked. Most cases are due to long-term tobacco smoking or exposure to secondhand tobacco smoke. Cities and communities can take an active role in curbing tobacco use and reducing lung cancer by adopting policies to regulate tobacco retail; reducing exposure to secondhand smoke in outdoor public spaces, such as parks, restaurants, or in multi-unit housing; and improving access to tobacco cessation programs and other preventive services.For more information about the Community Health Profiles Data Initiative, please see the initiative homepage.

  3. LUNG_CANCER

    • kaggle.com
    zip
    Updated Dec 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Subrahmanya Gaonkar (2023). LUNG_CANCER [Dataset]. https://www.kaggle.com/datasets/subrahmanya090/lung-cancer/code
    Explore at:
    zip(6212460 bytes)Available download formats
    Dataset updated
    Dec 8, 2023
    Authors
    Subrahmanya Gaonkar
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    ****Upvote above**** 👍 https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F13496874%2Fd56f59efa72d43a3da3ae7349235b429%2FScreenshot%202024-03-12%20211249.png?generation=1710258188677782&alt=media" alt="">

    Video on Risk factors of Lung Cancer - ![https://youtu.be/0vVRp5eNDlA?feature=shared]

    Dataset: 1. GENDER: Gender of the individual (M: Male, F: Female) 2. AGE: Age of the individual 3. SMOKING: Smoking status (2: Yes, 1: No) 4. YELLOW_FINGERS: Presence of yellow fingers (2: Yes, 1: No) 5. ANXIETY: Anxiety level (2: High, 1: Low) 6. PEER_PRESSURE: Peer pressure level (2: High, 1: Low) 7. CHRONIC DISEASE: Presence of chronic disease (2: Yes, 1: No) 8. FATIGUE: Fatigue level (2: High, 1: Low) 9. ALLERGY: Allergy status (2: Yes, 1: No) 10. WHEEZING: Wheezing condition (2: Yes, 1: No) 11. ALCOHOL CONSUMING: Alcohol consumption status (2: Yes, 1: No) 12. COUGHING: Presence of coughing (2: Yes, 1: No) 13. SHORTNESS OF BREATH: Shortness of breath condition (2: Yes, 1: No) 14. SWALLOWING DIFFICULTY: Difficulty in swallowing (2: Yes, 1: No) 15. CHEST PAIN: Presence of chest pain (2: Yes, 1: No) 16. LUNG_CANCER: Lung cancer diagnosis (2: Yes, 1: No)

    • Data has 309 rows and 16 columns with floating variables, integer, object which ranges from 0 - 308

    • Lung cancer is the uncontrollable growth of abnormal cells in one or both of the lungs. Cigarette smoking causes most lung cancers when smoke gets in the lungs. Lung cancer kills 1.8 million people each year, more than any other cancer. It has an 80-90% death rate, and is the leading cause of cancer death in men, and the second leading cause of cancer death in women.

    • The global cancer burden is estimated to have risen to 18.1 million new cases and 9.6 million deaths in 2018. One in 5 men and one in 6 women worldwide develop cancer during their lifetime, and one in 8 men and one in 11 women die from the disease. Worldwide, the total number of people who are alive within 5 years of a cancer diagnosis, called the 5-year prevalence, is estimated to be 43.8 million.

  4. d

    Compendium – Mortality from lung cancer

    • digital.nhs.uk
    Updated Jul 21, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2021). Compendium – Mortality from lung cancer [Dataset]. https://digital.nhs.uk/data-and-information/publications/statistical/compendium-mortality/current/mortality-from-lung-cancer
    Explore at:
    Dataset updated
    Jul 21, 2021
    License

    https://digital.nhs.uk/about-nhs-digital/terms-and-conditionshttps://digital.nhs.uk/about-nhs-digital/terms-and-conditions

    Description

    To reduce deaths from lung cancer. For information on the definitions of what these indicators include, please see the relevant specification. From 2016 onwards, mortality counts within the Compendium Mortality Indicator set are based on a bespoke extract taken from the Primary Care Mortality Database (PCMD) maintained by NHS Digital. PCMD is updated monthly using a file of death records from ONS and is continually subject to amendment. It is already well established that late registrations have a small impact on counts. This bespoke extract may be taken at a different time to that of the mortality data published by ONS and as such this may cause some small differences between ONS and NHS Digital mortality figures for a given year.

  5. h

    lung-cancer

    • huggingface.co
    Updated Jun 24, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nate Raw (2022). lung-cancer [Dataset]. https://huggingface.co/datasets/nateraw/lung-cancer
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 24, 2022
    Authors
    Nate Raw
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    Dataset Card for Lung Cancer

      Dataset Summary
    

    The effectiveness of cancer prediction system helps the people to know their cancer risk with low cost and it also helps the people to take the appropriate decision based on their cancer risk status. The data is collected from the website online lung cancer prediction system .

      Supported Tasks and Leaderboards
    

    [More Information Needed]

      Languages
    

    [More Information Needed]

      Dataset Structure… See the full description on the dataset page: https://huggingface.co/datasets/nateraw/lung-cancer.
    
  6. c

    National Lung Screening Trial

    • cancerimagingarchive.net
    • stage.cancerimagingarchive.net
    dicom, docx, n/a +2
    Updated Sep 24, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Cancer Imaging Archive (2021). National Lung Screening Trial [Dataset]. http://doi.org/10.7937/TCIA.HMQ8-J677
    Explore at:
    docx, svs, dicom, n/a, sas, zip, and docAvailable download formats
    Dataset updated
    Sep 24, 2021
    Dataset authored and provided by
    The Cancer Imaging Archive
    License

    https://www.cancerimagingarchive.net/data-usage-policies-and-restrictions/https://www.cancerimagingarchive.net/data-usage-policies-and-restrictions/

    Time period covered
    Sep 24, 2021
    Dataset funded by
    National Cancer Institutehttp://www.cancer.gov/
    Description

    https://www.cancerimagingarchive.net/wp-content/uploads/nctn-logo-300x108.png" alt="" width="300" height="108" />

    Demographic Summary of Available Imaging

    CharacteristicValue (N = 26254)
    Age (years)Mean ± SD: 61.4± 5
    Median (IQR): 60 (57-65)
    Range: 43-75
    SexMale: 15512 (59%)
    Female: 10742 (41%)
    Race

    White: 23969 (91.3%)
    Black: 1135 (4.3%)
    Asian: 547 (2.1%)
    American Indian/Alaska Native: 88 (0.3%)
    Native Hawaiian/Other Pacific Islander: 87 (0.3%)
    Unknown: 428 (1.6%)

    Ethnicity

    Not Available

    Background: The aggressive and heterogeneous nature of lung cancer has thwarted efforts to reduce mortality from this cancer through the use of screening. The advent of low-dose helical computed tomography (CT) altered the landscape of lung-cancer screening, with studies indicating that low-dose CT detects many tumors at early stages. The National Lung Screening Trial (NLST) was conducted to determine whether screening with low-dose CT could reduce mortality from lung cancer.

    Methods: From August 2002 through April 2004, we enrolled 53,454 persons at high risk for lung cancer at 33 U.S. medical centers. Participants were randomly assigned to undergo three annual screenings with either low-dose CT (26,722 participants) or single-view posteroanterior chest radiography (26,732). Data were collected on cases of lung cancer and deaths from lung cancer that occurred through December 31, 2009. This dataset includes the low-dose CT scans from 26,254 of these subjects, as well as digitized histopathology images from 451 subjects.

    Results: The rate of adherence to screening was more than 90%. The rate of positive screening tests was 24.2% with low-dose CT and 6.9% with radiography over all three rounds. A total of 96.4% of the positive screening results in the low-dose CT group and 94.5% in the radiography group were false positive results. The incidence of lung cancer was 645 cases per 100,000 person-years (1060 cancers) in the low-dose CT group, as compared with 572 cases per 100,000 person-years (941 cancers) in the radiography group (rate ratio, 1.13; 95% confidence interval [CI], 1.03 to 1.23). There were 247 deaths from lung cancer per 100,000 person-years in the low-dose CT group and 309 deaths per 100,000 person-years in the radiography group, representing a relative reduction in mortality from lung cancer with low-dose CT screening of 20.0% (95% CI, 6.8 to 26.7; P=0.004). The rate of death from any cause was reduced in the low-dose CT group, as compared with the radiography group, by 6.7% (95% CI, 1.2 to 13.6; P=0.02).

    Conclusions: Screening with the use of low-dose CT reduces mortality from lung cancer. (Funded by the National Cancer Institute; National Lung Screening Trial ClinicalTrials.gov number, NCT00047385).

    Data Availability: A summary of the National Lung Screening Trial and its available datasets are provided on the Cancer Data Access System (CDAS). CDAS is maintained by Information Management System (IMS), contracted by the National Cancer Institute (NCI) as keepers and statistical analyzers of the NLST trial data. The full clinical data set from NLST is available through CDAS. Users of TCIA can download without restriction a publicly distributable subset of that clinical data, along with the CT and Histopathology images collected during the trial. (These previously were restricted.)

  7. Number and rates of new cases of primary cancer, by cancer type, age group...

    • www150.statcan.gc.ca
    • datasets.ai
    • +2more
    Updated May 19, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Government of Canada, Statistics Canada (2021). Number and rates of new cases of primary cancer, by cancer type, age group and sex [Dataset]. http://doi.org/10.25318/1310011101-eng
    Explore at:
    Dataset updated
    May 19, 2021
    Dataset provided by
    Statistics Canadahttps://statcan.gc.ca/en
    Area covered
    Canada
    Description

    Number and rate of new cancer cases diagnosed annually from 1992 to the most recent diagnosis year available. Included are all invasive cancers and in situ bladder cancer with cases defined using the Surveillance, Epidemiology and End Results (SEER) Groups for Primary Site based on the World Health Organization International Classification of Diseases for Oncology, Third Edition (ICD-O-3). Random rounding of case counts to the nearest multiple of 5 is used to prevent inappropriate disclosure of health-related information.

  8. Lung Cancer Risk & Prediction Dataset

    • kaggle.com
    zip
    Updated Feb 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ankush Panday (2025). Lung Cancer Risk & Prediction Dataset [Dataset]. https://www.kaggle.com/datasets/ankushpanday1/lung-cancer-risk-and-prediction-dataset
    Explore at:
    zip(16114231 bytes)Available download formats
    Dataset updated
    Feb 11, 2025
    Authors
    Ankush Panday
    License

    https://cdla.io/permissive-1-0/https://cdla.io/permissive-1-0/

    Description

    This dataset helps understand and predict lung cancer risks based on health, environment, and lifestyle factors. It includes details about smoking habits, pollution exposure, healthcare access, and survival chances.

    Doctors, researchers, and data scientists can use it to find patterns in lung cancer cases and improve early detection.

    Columns Breakdown (25 Features) Country – The country where the patient resides Age – Patient’s age (randomized between 30-90) Gender – Male/Female Smoking_Status – Smoker, Non-Smoker, Former Smoker Second_Hand_Smoke – Yes/No Air_Pollution_Exposure – Low, Medium, High Occupation_Exposure – Yes/No (Factory, Mining, etc.) Rural_or_Urban – Rural/Urban Socioeconomic_Status – Low, Middle, High Healthcare_Access – Good, Limited, Poor Insurance_Coverage – Yes/No Screening_Availability – Yes/No Stage_at_Diagnosis – I, II, III, IV Cancer_Type – NSCLC, SCLC Mutation_Type – EGFR, ALK, KRAS, None Treatment_Access – Full, Partial, None Clinical_Trial_Access – Yes/No Language_Barrier – Yes/No Mortality_Risk – Probability (0.0 - 1.0) 5_Year_Survival_Probability – Probability (0.0 - 1.0) Delay_in_Diagnosis – Yes/No Family_History – Yes/No Indoor_Smoke_Exposure – Yes/No Tobacco_Marketing_Exposure – Yes/No Final_Prediction – Lung Cancer (Yes/No)

  9. s

    One-year survival from breast, lung and colorectal cancer (NHSOF 1.4.iii) -...

    • ckan.publishing.service.gov.uk
    Updated Aug 4, 2015
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2015). One-year survival from breast, lung and colorectal cancer (NHSOF 1.4.iii) - Dataset - data.gov.uk [Dataset]. https://ckan.publishing.service.gov.uk/dataset/one-year-survival-from-breast-lung-and-colorectal-cancer-nhsof-1-4-iii
    Explore at:
    Dataset updated
    Aug 4, 2015
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Description

    A measure of the number of adults diagnosed with breast, lung or colorectal cancer in a year who are still alive one year after diagnosis. ONS still publish survival percentages for individual types of cancers. These can be found at: http://www.ons.gov.uk/ons/rel/cancer-unit/cancer-survival/cancer-survival-in-england--patients-diagnosed-2007-2011-and-followed-up-to-2012/index.html A time series for one-year survival figures for breast, lung and colorectal cancer individually (previous NHS Outcomes Framework indicators 1.4.i, 1.4.iii and 1.4.v) is still published and can be found under the link 'Indicator data - previous methodology (.xls)' below. Purpose This indicator attempts to capture the success of the NHS in preventing people from dying once they have been diagnosed with breast, lung or colorectal cancer. Current version updated: Feb-14 Next version due: To be confirmed

  10. National Lung Cancer Audit State of the Nation Report 2025 - Dataset -...

    • ckan.publishing.service.gov.uk
    Updated Apr 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ckan.publishing.service.gov.uk (2025). National Lung Cancer Audit State of the Nation Report 2025 - Dataset - data.gov.uk [Dataset]. https://ckan.publishing.service.gov.uk/dataset/national-lung-cancer-audit-state-of-the-nation-report-2025
    Explore at:
    Dataset updated
    Apr 11, 2025
    Dataset provided by
    CKANhttps://ckan.org/
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Description

    The National Lung Cancer Audit (NLCA) evaluates how the care received by people diagnosed with lung cancer in England and Wales compares with recommended practice and provides information that supports healthcare providers, commissioners, and regulators to improve the care for patients. The NLCA reports a set of process and outcome measures that cover important aspects of the care pathway for people diagnosed with lung cancer. In the NLCA State of the Nation report 2025, we give an overview of the patterns of care and outcomes for 37,750 people diagnosed with lung cancer in England in 2023. A separate section provides describes results for 2,334 people diagnosed in Wales in 2023. The report describes summarises the performance of lung cancer services in 2023 and compares this to the situation in 2020, 2021 and 2022.

  11. g

    Five-year survival from breast, lung and colorectal cancer (NHSOF 1.4.iv) |...

    • gimi9.com
    Updated Jul 8, 2014
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2014). Five-year survival from breast, lung and colorectal cancer (NHSOF 1.4.iv) | gimi9.com [Dataset]. https://gimi9.com/dataset/uk_five-year-survival-from-breast-lung-and-colorectal-cancer-nhsof-1-4-iv/
    Explore at:
    Dataset updated
    Jul 8, 2014
    Description

    A measure of the number of adults diagnosed with breast, lung or colorectal cancer in a year who are still alive five years after diagnosis. ONS still publish survival percentages for individual types of cancers. These can be found at: http://www.ons.gov.uk/ons/rel/cancer-unit/cancer-survival/cancer-survival-in-england--patients-diagnosed-2007-2011-and-followed-up-to-2012/index.html A time series for five-year survival figures for breast, lung and colorectal cancer individually (previous NHS Outcomes Framework indicators 1.4.ii, 1.4.iv and 1.4.vi) is still published and can be found under the link 'Indicator data - previous methodology (.xls)' below. Purpose This indicator attempts to capture the success of the NHS in preventing people from dying once they have been diagnosed with breast, lung or colorectal cancer. Current version updated: May-14 Next version due: To be confirmed

  12. Lung Cancer Prediction

    • kaggle.com
    Updated Nov 14, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2022). Lung Cancer Prediction [Dataset]. https://www.kaggle.com/datasets/thedevastator/cancer-patients-and-air-pollution-a-new-link/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 14, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    The Devastator
    Description

    Lung Cancer Prediction

    Air Pollution, Alcohol, Smoking & Risk of Lung Cancer

    About this dataset

    This dataset contains information on patients with lung cancer, including their age, gender, air pollution exposure, alcohol use, dust allergy, occupational hazards, genetic risk, chronic lung disease, balanced diet, obesity, smoking, passive smoker, chest pain, coughing of blood, fatigue, weight loss ,shortness of breath ,wheezing ,swallowing difficulty ,clubbing of finger nails and snoring

    How to use the dataset

    Lung cancer is the leading cause of cancer death worldwide, accounting for 1.59 million deaths in 2018. The majority of lung cancer cases are attributed to smoking, but exposure to air pollution is also a risk factor. A new study has found that air pollution may be linked to an increased risk of lung cancer, even in nonsmokers.

    The study, which was published in the journal Nature Medicine, looked at data from over 462,000 people in China who were followed for an average of six years. The participants were divided into two groups: those who lived in areas with high levels of air pollution and those who lived in areas with low levels of air pollution.

    The researchers found that the people in the high-pollution group were more likely to develop lung cancer than those in the low-pollution group. They also found that the risk was higher in nonsmokers than smokers, and that the risk increased with age.

    While this study does not prove that air pollution causes lung cancer, it does suggest that there may be a link between the two. More research is needed to confirm these findings and to determine what effect different types and levels of air pollution may have on lung cancer risk

    Research Ideas

    • predicting the likelihood of a patient developing lung cancer
    • identifying risk factors for lung cancer
    • determining the most effective treatment for a patient with lung cancer

    Acknowledgements

    License

    See the dataset description for more information.

    Columns

    File: cancer patient data sets.csv | Column name | Description | |:-----------------------------|:--------------------------------------------------------------------| | Age | The age of the patient. (Numeric) | | Gender | The gender of the patient. (Categorical) | | Air Pollution | The level of air pollution exposure of the patient. (Categorical) | | Alcohol use | The level of alcohol use of the patient. (Categorical) | | Dust Allergy | The level of dust allergy of the patient. (Categorical) | | OccuPational Hazards | The level of occupational hazards of the patient. (Categorical) | | Genetic Risk | The level of genetic risk of the patient. (Categorical) | | chronic Lung Disease | The level of chronic lung disease of the patient. (Categorical) | | Balanced Diet | The level of balanced diet of the patient. (Categorical) | | Obesity | The level of obesity of the patient. (Categorical) | | Smoking | The level of smoking of the patient. (Categorical) | | Passive Smoker | The level of passive smoker of the patient. (Categorical) | | Chest Pain | The level of chest pain of the patient. (Categorical) | | Coughing of Blood | The level of coughing of blood of the patient. (Categorical) | | Fatigue | The level of fatigue of the patient. (Categorical) | | Weight Loss | The level of weight loss of the patient. (Categorical) | | Shortness of Breath | The level of shortness of breath of the patient. (Categorical) | | Wheezing | The level of wheezing of the patient. (Categorical) | | Swallowing Difficulty | The level of swallowing difficulty of the patient. (Categorical) | | Clubbing of Finger Nails | The level of clubbing of finger nails of the patient. (Categorical) |

  13. Mortality and potential years of life lost, by selected causes of death and...

    • www150.statcan.gc.ca
    • data.urbandatacentre.ca
    • +2more
    Updated Mar 16, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Government of Canada, Statistics Canada (2016). Mortality and potential years of life lost, by selected causes of death and sex, three-year average, census metropolitan areas occasional (number) [Dataset]. http://doi.org/10.25318/1310074101-eng
    Explore at:
    Dataset updated
    Mar 16, 2016
    Dataset provided by
    Statistics Canadahttps://statcan.gc.ca/en
    Area covered
    Canada
    Description

    This table contains 33048 series, with data for years 2000/2002 - 2010/2012 (not all combinations necessarily have data for all years), and was last released on 2016-03-16. This table contains data described by the following dimensions (Not all combinations are available): Geography (36 items: Total, census metropolitan areas; St. John's, Newfoundland and Labrador; Halifax, Nova Scotia;Moncton, New Brunswick; ...), Sex (3 items: Both sexes; Males; Females), Indicators (2 items: Mortality; Potential years of life lost), Selected causes of death (ICD-10) (17 items: Total, all causes of death; All malignant neoplasms (cancers); Colorectal cancer; Lung cancer; ...), Characteristics (9 items: Number; Low 95% confidence interval, number; High 95% confidence interval, number; Rate; ...).

  14. Adult Smoking Prevalence - Dataset - data.gov.uk

    • ckan.publishing.service.gov.uk
    Updated Jul 11, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ckan.publishing.service.gov.uk (2017). Adult Smoking Prevalence - Dataset - data.gov.uk [Dataset]. https://ckan.publishing.service.gov.uk/dataset/adult-smoking-prevalence
    Explore at:
    Dataset updated
    Jul 11, 2017
    Dataset provided by
    CKANhttps://ckan.org/
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Description

    This data shows the percentage of adults (age 18 and over) who are current smokers. Smoking is the single biggest cause of preventable death and illnesses, and big inequalities exist between and within communities. Smoking is a major risk factor for many diseases, such as lung cancer, chronic obstructive pulmonary disease (COPD, bronchitis and emphysema) and heart disease. It is also associated with cancers in other organs. Smoking is a modifiable lifestyle risk factor. Preventing people from starting smoking is important in reducing the health harms and inequalities. This data is based on the Office for National Statistics (ONS) Annual Population Survey (APS). The percentage of adults is not age-standardised. In this dataset particularly at district level there may be inherent statistical uncertainty in some data values. Thus as with many other datasets, this data should be used together with other data and resources to obtain a fuller picture. Data source: Office for Health Improvement and Disparities (OHID) Public Health Outcomes Framework (PHOF) indicator 92443 (Number 15). This data is updated annually.

  15. BARD1 serum autoantibodies for the detection of lung cancer

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    pdf
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maxim Pilyugin; Pascaline Descloux; Pierre-Alain André; Viktoria Laszlo; Balazs Dome; Balazs Hegedus; Sylvain Sardy; Samuel Janes; Andrea Bianco; Geoffrey J. Laurent; Irmgard Irminger-Finger (2023). BARD1 serum autoantibodies for the detection of lung cancer [Dataset]. http://doi.org/10.1371/journal.pone.0182356
    Explore at:
    pdfAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Maxim Pilyugin; Pascaline Descloux; Pierre-Alain André; Viktoria Laszlo; Balazs Dome; Balazs Hegedus; Sylvain Sardy; Samuel Janes; Andrea Bianco; Geoffrey J. Laurent; Irmgard Irminger-Finger
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    PurposeCurrently the screening for lung cancer for risk groups is based on Computed Tomography (CT) or low dose CT (LDCT); however, the lung cancer death rate has not decreased significantly with people undergoing LDCT. We aimed to develop a simple reliable blood test for early detection of all types of lung cancer based on the immunogenicity of aberrant forms of BARD1 that are specifically upregulated in lung cancer.MethodsELISA assays were performed with a panel of BARD1 epitopes to detect serum levels of antibodies against BARD1 epitopes. We tested 194 blood samples from healthy donors and lung cancer patients with a panel of 40 BARD1 antigens. Using fitted Lasso logistic regression we determined the optimal combination of BARD1 antigens to be used in ELISA for discriminating lung cancer from healthy controls. Random selection of samples for training sets or validations sets was applied to validate the accuracy of our test.ResultsFitted Lasso logistic regression models predict high accuracy of the BARD1 autoimmune antibody test with an AUC = 0.96. Validation in independent samples provided and AUC = 0.86 and identical AUCs were obtained for combined stages 1–3 and late stage 4 lung cancers. The BARD1 antibody test is highly specific for lung cancer and not breast or ovarian cancer.ConclusionThe BARD1 lung cancer test shows higher sensitivity and specificity than previously published blood tests for lung cancer detection and/or diagnosis or CT scans, and it could detect all types and all stages of lung cancer. This BARD1 lung cancer test could therefore be further developed as i) screening test for early detection of lung cancers in high-risk groups, and ii) diagnostic aid in complementing CT scan.

  16. r

    AIHW - Cancer Incidence and Mortality Across Regions (CIMAR) - Persons...

    • researchdata.edu.au
    null
    Updated Jun 28, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Government of the Commonwealth of Australia - Australian Institute of Health and Welfare (2023). AIHW - Cancer Incidence and Mortality Across Regions (CIMAR) - Persons Mortality (GCCSA) 2009-2013 [Dataset]. https://researchdata.edu.au/aihw-cancer-incidence-2009-2013/2738757
    Explore at:
    nullAvailable download formats
    Dataset updated
    Jun 28, 2023
    Dataset provided by
    Australian Urban Research Infrastructure Network (AURIN)
    Authors
    Government of the Commonwealth of Australia - Australian Institute of Health and Welfare
    License

    Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
    License information was derived automatically

    Area covered
    Description

    This dataset presents the footprint of cancer mortality statistics in Australia for all cancers combined and the 6 top cancer groupings (colorectal, leukaemia, lung, lymphoma, melanoma of the skin and pancreas) and their respective ICD-10 codes. The data spans the years 2009-2013 and is aggregated to Greater Capital City Statistical Areas (GCCSA) from the 2011 Australian Statistical Geography Standard (ASGS).

    Mortality data refer to the number of deaths due to cancer in a given time period. Cancer deaths data are sourced from the Australian Institute of Health and Welfare (AIHW) 2013 National Mortality Database (NMD).

    For further information about this dataset, please visit:

    Please note:

    • AURIN has spatially enabled the original data.

    • Due to changes in geographic classifications over time, long-term trends are not available.

    • Values assigned to "n.p." in the original data have been removed from the data.

    • The Australian and jurisdictional totals include people who could not be assigned a GCCSA. The number of people who could not be assigned a GCCSA is less than 1% of the total.

    • The Australian total also includes residents of Other Territories (Cocos (Keeling) Islands, Christmas Island and Jervis Bay Territory).

    • Cause of Death Unit Record File data are provided to the AIHW by the Registries of Births, Deaths and Marriages and the National Coronial Information System (managed by the Victorian Department of Justice) and include cause of death coded by the Australian Bureau of Statistics (ABS). The data are maintained by the AIHW in the NMD.

    • Year refers to year of occurrence of death for years up to and including 2012, and year of registration of death for 2013. Deaths registered in 2011 and earlier are based on the final version of cause of death data; deaths registered in 2012 and 2013 are based on revised and preliminary versions, respectively and are subject to further revision by the ABS.

    • Cause of death information are based on underlying cause of death and are classified according to the International Classification of Diseases and Related Health Problems (ICD). Deaths registered in 1997 onwards are classified according to the 10th revision (ICD-10).

    • Colorectal deaths presented are underestimates. For further information, refer to "Complexities in the measurement of bowel cancer in Australia" in Causes of Death, Australia (ABS cat. no. 3303.0).

  17. H

    National Occupational Respiratory Mortality System (NORMS)

    • data.niaid.nih.gov
    • dataverse.harvard.edu
    Updated Mar 31, 2011
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2011). National Occupational Respiratory Mortality System (NORMS) [Dataset]. http://doi.org/10.7910/DVN/ZATO3A
    Explore at:
    Dataset updated
    Mar 31, 2011
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Users can search this database pertaining to respiratory conditions such as asthma, pneumonia, bronchitis, and tuberculosis. BackgroundThe National Occupational Respiratory Mortality System (NORMS) is developed and maintained by National Institute of Occupational Safety and Health (NIOSH) of the Centers for Disease Control and Prevention (CDC). This surveillance system includes respiratory conditions such as: asthma, pneumonia, bronchitis, tuberculosis, lung cancer, and silicosis, among others. User FunctionalityUsers can generate national- or occupation-specific queries. Users can gener ate tables, charts and maps containing the summary statistics such as number of deaths, crude death rates, age-adjusted death rates, and years of potential life lost (YPLL ). Users can also download the dataset and/or data queries into Microsoft Excel. Data NotesThis website provides data history regarding revisions to the dataset. Data from additional sources (i.e., population estimates, comparative standard population, and life-table values) are also available. National mortality data is derived from the National Center for Health Statistics (NCHS) multiple cause of death records. These data are updated annually since 1968, unless otherwise indicated. Data are available on national, state, and county levels. The most recent d ata available is from 2007.

  18. f

    Table_1_The presence of autoantibodies is associated with improved overall...

    • datasetcatalog.nlm.nih.gov
    • frontiersin.figshare.com
    Updated Dec 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zhao, Huijuan; Ouyang, Libo; Zheng, Peiming; Chen, Lianlian; Li, Gang; Wang, Rong; Cai, Jun; Jing, Keying (2023). Table_1_The presence of autoantibodies is associated with improved overall survival in lung cancer patients.docx [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000942824
    Explore at:
    Dataset updated
    Dec 19, 2023
    Authors
    Zhao, Huijuan; Ouyang, Libo; Zheng, Peiming; Chen, Lianlian; Li, Gang; Wang, Rong; Cai, Jun; Jing, Keying
    Description

    ObjectiveAutoantibodies have been reported to be associated with cancers. As a biomarker, autoantibodies have been widely used in the early screening of lung cancer. However, the correlation between autoantibodies and the prognosis of lung cancer patients is poorly understood, especially in the Asian population. This retrospective study investigated the association between the presence of autoantibodies and outcomes in patients with lung cancer.MethodsA total of 264 patients diagnosed with lung cancer were tested for autoantibodies in Henan Provincial People’s Hospital from January 2017 to June 2022. The general clinical data of these patients were collected, and after screening out those who met the exclusion criteria, 151 patients were finally included in the study. The Cox proportional hazards model was used to analyze the effect of autoantibodies on the outcomes of patients with lung cancer. The Kaplan-Meier curve was used to analyze the relationship between autoantibodies and the overall survival of patients with lung cancer.ResultsCompared to lung cancer patients without autoantibodies, those with autoantibodies had an associated reduced risk of death (HRs: 0.45, 95% CIs 0.27~0.77), independent of gender, age, smoking history, pathological type, and pathological stage of lung cancer. Additionally, the association was found to be more significant by subgroup analysis in male patients, younger patients, and patients with small cell lung cancer. Furthermore, lung cancer patients with autoantibodies had significantly longer survival time than those without autoantibodies.ConclusionThe presence of autoantibodies is an independent indicator of good prognosis in patients with lung cancer, providing a new biomarker for prognostic evaluation in patients with lung cancer.

  19. e

    Adult Smoking Prevalence

    • data.europa.eu
    • data.wu.ac.at
    csv, html
    Updated Jul 11, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lincolnshire County Council (2017). Adult Smoking Prevalence [Dataset]. https://data.europa.eu/data/datasets/adult-smoking-prevalence
    Explore at:
    html, csvAvailable download formats
    Dataset updated
    Jul 11, 2017
    Dataset authored and provided by
    Lincolnshire County Council
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Description

    This data shows the percentage of adults (age 18 and over) who are current smokers.

    Smoking is the single biggest cause of preventable death and illnesses, and big inequalities exist between and within communities. Smoking is a major risk factor for many diseases, such as lung cancer, chronic obstructive pulmonary disease (COPD, bronchitis and emphysema) and heart disease. It is also associated with cancers in other organs.

    Smoking is a modifiable lifestyle risk factor. Preventing people from starting smoking is important in reducing the health harms and inequalities.

    This data is based on the Office for National Statistics (ONS) Annual Population Survey (APS). The percentage of adults is not age-standardised. In this dataset particularly at district level there may be inherent statistical uncertainty in some data values. Thus as with many other datasets, this data should be used together with other data and resources to obtain a fuller picture.

    Data source: Public Health England, Public Health Outcomes Framework (PHOF) indicator 92443 (Number 15). This data is updated annually.

  20. d

    Adult Smoking Prevalence - Dataset - Datopian CKAN instance

    • demo.dev.datopian.com
    Updated Oct 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Adult Smoking Prevalence - Dataset - Datopian CKAN instance [Dataset]. https://demo.dev.datopian.com/dataset/lcc--adult-smoking-prevalence
    Explore at:
    Dataset updated
    Oct 7, 2025
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Description

    This data shows the percentage of adults (age 18 and over) who are current smokers. Smoking is the single biggest cause of preventable death and illnesses, and big inequalities exist between and within communities. Smoking is a major risk factor for many diseases, such as lung cancer, chronic obstructive pulmonary disease (COPD, bronchitis and emphysema) and heart disease. It is also associated with cancers in other organs. Smoking is a modifiable lifestyle risk factor. Preventing people from starting smoking is important in reducing the health harms and inequalities. This data is based on the Office for National Statistics (ONS) Annual Population Survey (APS). The percentage of adults is not age-standardised. In this dataset particularly at district level there may be inherent statistical uncertainty in some data values. Thus as with many other datasets, this data should be used together with other data and resources to obtain a fuller picture. Data source: Office for Health Improvement and Disparities (OHID) Public Health Outcomes Framework (PHOF) indicator 92443 (Number 15). This data is updated annually.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Aman_Kumar094 (2025). Lung Cancer Dataset [Dataset]. https://www.kaggle.com/datasets/amankumar094/lung-cancer-dataset
Organization logo

Lung Cancer Dataset

Smoking Doesn't Make u Look Cool.It's Harmful for You and Your Family. Stay Safe

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 6, 2025
Dataset provided by
Kaggle
Authors
Aman_Kumar094
License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

** Description**

This dataset contains data about lung cancer Mortality and is a comprehensive collection of patient information, specifically focused on individuals diagnosed with cancer. This dataset contains comprehensive information on 800,000 individuals related to lung cancer diagnosis, treatment, and outcomes. With 16 well-structured columns. This large-scale dataset is designed to aid researchers, data scientists, and healthcare professionals in studying patterns, building predictive models, and enhancing early detection and treatment strategies.

🌍 The Societal Impact of Lung Cancer

Lung cancer is not just a disease — it's a global crisis that steals time, health, and hope from millions of people every year. As the #1 cause of cancer deaths worldwide, it takes more lives annually than breast, colon, and prostate cancer combined.

But behind every statistic is a story:

A parent who never saw their child graduate.

A worker who had to leave their job too soon.

A community that lost a leader, a friend, a neighbor.

Why does this matter? Lung cancer often goes undetected until it's too late. It’s aggressive, silent, and devastating — especially in underserved areas where early detection is rare and treatment options are limited. It doesn’t just affect patients. It affects families, economies, and healthcare systems on a massive scale.

This dataset represents more than numbers. It represents 800,000 real-world stories — people who can help us unlock patterns, train models, and advance life-saving research.

By working with this data, you're not just analyzing a dataset — you're stepping into the fight against one of humanity’s deadliest diseases.

Let’s turn insight into impact. (😊The above descriptions is generated with the help of AI, Just wanted to share this dataset That all. Thank you)

Search
Clear search
Close search
Google apps
Main menu