43 datasets found
  1. p

    Breast Cancer Dataset - Dataset - CKAN

    • data.poltekkes-smg.ac.id
    Updated Oct 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Breast Cancer Dataset - Dataset - CKAN [Dataset]. https://data.poltekkes-smg.ac.id/dataset/breast-cancer-dataset
    Explore at:
    Dataset updated
    Oct 7, 2024
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Description: Breast cancer is the most common cancer amongst women in the world. It accounts for 25% of all cancer cases, and affected over 2.1 Million people in 2015 alone. It starts when cells in the breast begin to grow out of control. These cells usually form tumors that can be seen via X-ray or felt as lumps in the breast area. The key challenges against it’s detection is how to classify tumors into malignant (cancerous) or benign(non cancerous). We ask you to complete the analysis of classifying these tumors using machine learning (with SVMs) and the Breast Cancer Wisconsin (Diagnostic) Dataset. Acknowledgements: This dataset has been referred from Kaggle. Objective: Understand the Dataset & cleanup (if required). Build classification models to predict whether the cancer type is Malignant or Benign. Also fine-tune the hyperparameters & compare the evaluation metrics of various classification algorithms.

  2. d

    Percent Receiving Breast Cancer Screenings

    • data.ore.dc.gov
    Updated Aug 28, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    City of Washington, DC (2024). Percent Receiving Breast Cancer Screenings [Dataset]. https://data.ore.dc.gov/datasets/percent-receiving-breast-cancer-screenings
    Explore at:
    Dataset updated
    Aug 28, 2024
    Dataset authored and provided by
    City of Washington, DC
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Some racial and ethnic categories are suppressed for privacy and to avoid misleading estimates when the relative standard error exceeds 30% or the unweighted sample size is less than 50 respondents.

    Data Source: Centers for Disease Control and Prevention (CDC). Behavioral Risk Factor Surveillance System Survey (BRFSS) Data

    Why This Matters

    Breast cancer is the most commonly diagnosed cancer in women and people assigned female at birth (AFAB) and the second leading cause of cancer death in the U.S. Breast cancer screenings can save lives by helping to detect breast cancer in its early stages when treatment is more effective.

    While non-Hispanic white women and AFAB individuals are more likely to be diagnosed with breast cancer than their counterparts of other races and ethnicities, non-Hispanic Black women and AFAB individuals die from breast cancer at a significantly higher rate than their counterparts races and ethnicities.

    Later-stage diagnoses and prolonged treatment duration partly explain these disparities in mortality rate. Structural barriers to quality health care, insurance, education, affordable housing, and sustainable income that disproportionately affect communities of color also drive racial inequities in breast cancer screenings and mortality.

    The District Response

    Project Women Into Staying Healthy (WISH) provides free breast and cervical cancer screenings to uninsured or underinsured women and AFAB adults aged 21 to 64. Patient navigation, transportation assistance, and cancer education are also provided.

    DC Health’s Cancer and Chronic Disease Prevention Bureau works with healthcare providers to improve the use of preventative health services and provide breast cancer screening services.

    DC Health maintains the District of Columbia Cancer Registry (DCCR) to track cancer incidences, examine environmental substances that cause cancer, and identify differences in cancer incidences by age, gender, race, and geographical location.

  3. h

    breastcancer-auto-segmentation

    • huggingface.co
    Updated Apr 8, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Clelia Astra Bertelli (2024). breastcancer-auto-segmentation [Dataset]. https://huggingface.co/datasets/as-cle-bert/breastcancer-auto-segmentation
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 8, 2024
    Authors
    Clelia Astra Bertelli
    License

    https://choosealicense.com/licenses/cc/https://choosealicense.com/licenses/cc/

    Description

    breastcanc-ultrasound-class

      Background
    

    Cancer is the second leading cause of death worldwide, according to IHME - Global Burden of Disease, with 10.7 mln casualties in 2019.

    Amongst the various types of cancer, a huge role is played by breast cancer, which stands in 4th position among the deadliest tumors, with more than 700.000 deaths during 2019 (IHME - Global Burden of Disease).

    Moreover, breast cancer has the highest share of number of cases/100 people worldwide… See the full description on the dataset page: https://huggingface.co/datasets/as-cle-bert/breastcancer-auto-segmentation.

  4. RSNA Mammography Breast Cancer TFRecord Dataset

    • kaggle.com
    Updated Dec 17, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    muhammed (2023). RSNA Mammography Breast Cancer TFRecord Dataset [Dataset]. https://www.kaggle.com/datasets/clkmuhammed/rsna-mammography-breast-cancer-tfrecord-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 17, 2023
    Dataset provided by
    Kaggle
    Authors
    muhammed
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Source RSNA Screening Mammography Breast Cancer Detection

    Processing of the huge 314GB+ Dataset (Include 54713 Images) of this competition into TFRecords for fast dataloading during training.

    All images are resized to 768x1280 and saved in 100 TFRecords, making each TFRecord contain roughly 548 images as 8.6GB+ Dataset.

    TFRecords have the benefit of loading large chunks of data containing many samples instead of loading every image and label seperately.

    Dataset Description

    Note: The dataset for this challenge contains radiographic breast images of female subjects. The goal of this competition is to identify cases of breast cancer in mammograms from screening exams. It is important to identify cases of cancer for obvious reasons, but false positives also have downsides for patients. As millions of women get mammograms each year, a useful machine learning tool could help a great many people. This competition uses a hidden test. When your submitted notebook is scored the actual test data (including a full length sample submission) will be made available to your notebook.

    Files

    [train/test]_images/[patient_id]/[image_id].dcm The mammograms, in dicom format. You can expect roughly 8,000 patients in the hidden test set. There are usually but not always 4 images per patient. Note that many of the images use the jpeg 2000 format which may you may need special libraries to load.

    sample_submission.csv A valid sample submission. Only the first few rows are available for download.

    [train/test].csv Metadata for each patient and image. Only the first few rows of the test set are available for download.

    site_id - ID code for the source hospital. patient_id - ID code for the patient. image_id - ID code for the image. laterality - Whether the image is of the left or right breast. view - The orientation of the image. The default for a screening exam is to capture two views per breast. age - The patient's age in years. implant - Whether or not the patient had breast implants. Site 1 only provides breast implant information at the patient level, not at the breast level. density - A rating for how dense the breast tissue is, with A being the least dense and D being the most dense. Extremely dense tissue can make diagnosis more difficult. Only provided for train. machine_id - An ID code for the imaging device. cancer - Whether or not the breast was positive for malignant cancer. The target value. Only provided for train. biopsy - Whether or not a follow-up biopsy was performed on the breast. Only provided for train. invasive - If the breast is positive for cancer, whether or not the cancer proved to be invasive. Only provided for train. BIRADS - 0 if the breast required follow-up, 1 if the breast was rated as negative for cancer, and 2 if the breast was rated as normal. Only provided for train. prediction_id - The ID for the matching submission row. Multiple images will share the same prediction ID. Test only. difficult_negative_case - True if the case was unusually difficult. Only provided for train.

  5. Five-year survival from breast, lung and colorectal cancer (NHSOF 1.4.iv)

    • data.europa.eu
    • data.wu.ac.at
    csv, excel xls
    Updated Oct 11, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NHS Digital (2021). Five-year survival from breast, lung and colorectal cancer (NHSOF 1.4.iv) [Dataset]. https://data.europa.eu/data/datasets/five-year-survival-from-breast-lung-and-colorectal-cancer-nhsof-1-4-iv
    Explore at:
    csv, excel xlsAvailable download formats
    Dataset updated
    Oct 11, 2021
    Dataset provided by
    National Health Servicehttps://www.nhs.uk/
    NHS Digitalhttps://digital.nhs.uk/
    Authors
    NHS Digital
    License

    http://reference.data.gov.uk/id/open-government-licencehttp://reference.data.gov.uk/id/open-government-licence

    Description

    A measure of the number of adults diagnosed with breast, lung or colorectal cancer in a year who are still alive five years after diagnosis.

    ONS still publish survival percentages for individual types of cancers. These can be found at: http://www.ons.gov.uk/ons/rel/cancer-unit/cancer-survival/cancer-survival-in-england--patients-diagnosed-2007-2011-and-followed-up-to-2012/index.html

    A time series for five-year survival figures for breast, lung and colorectal cancer individually (previous NHS Outcomes Framework indicators 1.4.ii, 1.4.iv and 1.4.vi) is still published and can be found under the link 'Indicator data - previous methodology (.xls)' below.

    Purpose

    This indicator attempts to capture the success of the NHS in preventing people from dying once they have been diagnosed with breast, lung or colorectal cancer.

    Current version updated: May-14

    Next version due: To be confirmed

  6. h

    breastcancer-auto-objdetect

    • huggingface.co
    Updated Apr 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Clelia Astra Bertelli (2024). breastcancer-auto-objdetect [Dataset]. https://huggingface.co/datasets/as-cle-bert/breastcancer-auto-objdetect
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 13, 2024
    Authors
    Clelia Astra Bertelli
    License

    https://choosealicense.com/licenses/cc/https://choosealicense.com/licenses/cc/

    Description

    breastcanc-ultrasound-class

      Background
    

    Cancer is the second leading cause of death worldwide, according to IHME - Global Burden of Disease, with 10.7 mln casualties in 2019.

    Amongst the various types of cancer, a huge role is played by breast cancer, which stands in 4th position among the deadliest tumors, with more than 700.000 deaths during 2019 (IHME - Global Burden of Disease).

    Moreover, breast cancer has the highest share of number of cases/100 people worldwide… See the full description on the dataset page: https://huggingface.co/datasets/as-cle-bert/breastcancer-auto-objdetect.

  7. G

    Health Status: Breast Cancer Rates, 1986 to 1995

    • open.canada.ca
    • data.amerigeoss.org
    • +1more
    jp2, zip
    Updated Mar 14, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Natural Resources Canada (2022). Health Status: Breast Cancer Rates, 1986 to 1995 [Dataset]. https://open.canada.ca/data/dataset/f146e480-8893-11e0-b60f-6cf049291510
    Explore at:
    zip, jp2Available download formats
    Dataset updated
    Mar 14, 2022
    Dataset provided by
    Natural Resources Canada
    License

    Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
    License information was derived automatically

    Description

    One woman in nine can expect to develop breast cancer during her lifetime and one in 25 will die from the disease. Statistically low incidences of breast cancer are found in Newfoundland and Labrador, the territories, and northern areas of most provinces. Otherwise, each province has one or more pockets of significantly high breast cancer incidence. These are often located in more southerly areas, but they do not seem to be restricted to either urban or rural areas alone. Breast cancer rates are a health status indicator. They can be used to help assess health conditions. Health status refers to the state of health of a person or group, and measures causes of sickness and death. It can also include people’s assessment of their own health.

  8. Breast cancer ICPR contest

    • kaggle.com
    Updated May 15, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    honestInnocent (2020). Breast cancer ICPR contest [Dataset]. https://www.kaggle.com/sainathseelam/breast-cancer-icpr-contest/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 15, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    honestInnocent
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    There's a story behind every dataset and here's your opportunity to share yours.

    Content

    What's inside is more than just rows and columns. Make it easy for others to get started by describing how you acquired the data and what time period it represents, too.

    Acknowledgements

    We wouldn't be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research.

    Inspiration

    Your data will be in front of the world's largest data science community. What questions do you want to see answered?

  9. A

    ‘Breast Cancer (METABRIC)’ analyzed by Analyst-2

    • analyst-2.ai
    Updated Nov 16, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2021). ‘Breast Cancer (METABRIC)’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-breast-cancer-metabric-3009/50fd3acf/?iid=120-705&v=presentation
    Explore at:
    Dataset updated
    Nov 16, 2021
    Dataset authored and provided by
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Analysis of ‘Breast Cancer (METABRIC)’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/gunesevitan/breast-cancer-metabric on 12 November 2021.

    --- Dataset description provided by original source is as follows ---

    Context

    There's a story behind every dataset and here's your opportunity to share yours.

    Content

    What's inside is more than just rows and columns. Make it easy for others to get started by describing how you acquired the data and what time period it represents, too.

    Acknowledgements

    We wouldn't be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research.

    Inspiration

    Your data will be in front of the world's largest data science community. What questions do you want to see answered?

    --- Original source retains full ownership of the source dataset ---

  10. One-year survival from breast, lung and colorectal cancer (NHSOF 1.4.iii)

    • data.europa.eu
    csv, excel xls
    Updated Oct 30, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NHS Digital (2021). One-year survival from breast, lung and colorectal cancer (NHSOF 1.4.iii) [Dataset]. https://data.europa.eu/data/datasets/one-year-survival-from-breast-lung-and-colorectal-cancer-nhsof-1-4-iii?locale=hu
    Explore at:
    csv, excel xlsAvailable download formats
    Dataset updated
    Oct 30, 2021
    Dataset authored and provided by
    NHS Digitalhttps://digital.nhs.uk/
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Description

    A measure of the number of adults diagnosed with breast, lung or colorectal cancer in a year who are still alive one year after diagnosis.

    ONS still publish survival percentages for individual types of cancers. These can be found at: http://www.ons.gov.uk/ons/rel/cancer-unit/cancer-survival/cancer-survival-in-england--patients-diagnosed-2007-2011-and-followed-up-to-2012/index.html

    A time series for one-year survival figures for breast, lung and colorectal cancer individually (previous NHS Outcomes Framework indicators 1.4.i, 1.4.iii and 1.4.v) is still published and can be found under the link 'Indicator data - previous methodology (.xls)' below.

    Purpose

    This indicator attempts to capture the success of the NHS in preventing people from dying once they have been diagnosed with breast, lung or colorectal cancer.

    Current version updated: Feb-14

    Next version due: To be confirmed

  11. Data from: haberman

    • kaggle.com
    Updated Jul 22, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gowtam Singulur (2019). haberman [Dataset]. https://www.kaggle.com/gowtamsingulur/habermancsv/tasks
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 22, 2019
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Gowtam Singulur
    Description

    Context

    The dataset contains cases from a study that was conducted between 1958 and 1970 at the University of Chicago's Billings Hospital on the survival of patients who had undergone surgery for breast cancer

  12. GENE EXPRESSION DATASETS OF BREAST CANCER

    • kaggle.com
    Updated Dec 4, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    umair16082 (2021). GENE EXPRESSION DATASETS OF BREAST CANCER [Dataset]. https://www.kaggle.com/datasets/umair16082/gene-expression-datasets-of-breast-cancer/versions/1
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 4, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    umair16082
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    THE TRAINING DATA SET CONTAIN 78 SAMPLES FROM 34 ARE THOSE PATIENTS WHO HAD DEVELOPED DISTANCE METASTASES WITHIN 5 YEARS (LABELLED AS RELAPSE) THE REST SAMPLES ARE FROM THOSE WHO ARE HEALTHY (LABELLED AS NON-RELAPSE), WHEREAS TEST DATA SET CONTAINING 19 SAMPLES 12(RELAPSE) 7(NON-RELAPSE), THERE ARE 2448 GENES (FEATURES) IN DATASET.

    There's a story behind every dataset and here's your opportunity to share yours.

    Content

    THE TRAINING DATA SET CONTAIN 78 SAMPLES FROM 34 ARE THOSE PATIENTS WHO HAD DEVELOPED DISTANCE METASTASES WITHIN 5 YEARS (LABELLED AS RELAPSE) THE REST SAMPLES ARE FROM THOSE WHO ARE HEALTHY (LABELLED AS NON-RELAPSE), WHEREAS TEST DATA SET CONTAINING 19 SAMPLES 12(RELAPSE) 7(NON-RELAPSE), THERE ARE 2448 GENES (FEATURES) IN DATASET.

    What's inside is more than just rows and columns. Make it easy for others to get started by describing how you acquired the data and what time period it represents, too.

    Acknowledgements

    THE TRAINING DATA SET CONTAIN 78 SAMPLES FROM 34 ARE THOSE PATIENTS WHO HAD DEVELOPED DISTANCE METASTASES WITHIN 5 YEARS (LABELLED AS RELAPSE) THE REST SAMPLES ARE FROM THOSE WHO ARE HEALTHY (LABELLED AS NON-RELAPSE), WHEREAS TEST DATA SET CONTAINING 19 SAMPLES 12(RELAPSE) 7(NON-RELAPSE), THERE ARE 2448 GENES (FEATURES) IN DATASET. We wouldn't be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research.

    Inspiration

    Your data will be in front of the world's largest data science community. What questions do you want to see answered?

  13. Number and rates of new cases of primary cancer, by cancer type, age group...

    • www150.statcan.gc.ca
    • datasets.ai
    • +3more
    Updated May 19, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Government of Canada, Statistics Canada (2021). Number and rates of new cases of primary cancer, by cancer type, age group and sex [Dataset]. http://doi.org/10.25318/1310011101-eng
    Explore at:
    Dataset updated
    May 19, 2021
    Dataset provided by
    Statistics Canadahttps://statcan.gc.ca/en
    Area covered
    Canada
    Description

    Number and rate of new cancer cases diagnosed annually from 1992 to the most recent diagnosis year available. Included are all invasive cancers and in situ bladder cancer with cases defined using the Surveillance, Epidemiology and End Results (SEER) Groups for Primary Site based on the World Health Organization International Classification of Diseases for Oncology, Third Edition (ICD-O-3). Random rounding of case counts to the nearest multiple of 5 is used to prevent inappropriate disclosure of health-related information.

  14. c

    A Curated Benchmark Dataset for Ultrasound Based Breast Lesion Analysis

    • cancerimagingarchive.net
    n/a, png and zip +1
    Updated Jan 8, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Cancer Imaging Archive (2024). A Curated Benchmark Dataset for Ultrasound Based Breast Lesion Analysis [Dataset]. http://doi.org/10.7937/9WKK-Q141
    Explore at:
    xlsx, png and zip, n/aAvailable download formats
    Dataset updated
    Jan 8, 2024
    Dataset authored and provided by
    The Cancer Imaging Archive
    License

    https://www.cancerimagingarchive.net/data-usage-policies-and-restrictions/https://www.cancerimagingarchive.net/data-usage-policies-and-restrictions/

    Time period covered
    Jan 8, 2024
    Dataset funded by
    National Cancer Institutehttp://www.cancer.gov/
    Description

    This dataset consists of 256 breast ultrasound scans collected from 256 patients and 266 benign and malignant segmented lesions. It includes patient-level labels, image-level annotations, and tumor-level labels with all cases confirmed by follow-up care or biopsy result. Each scan was manually annotated and labeled by a radiologist experienced in breast ultrasound examination. In particular, each tumor was identified in the image via a freehand annotation and labeled according to BIRADS features. The tumor histopathological classification is stated for patients who underwent a biopsy. Patient-level labels include clinical data such as age, breast tissue composition, signs and symptoms. Image-level freehand annotations identify the tumor and other abnormal areas in the image. The tumor and image are labeled with BIRADS category, 7 BIRADS descriptors, and interpretation of critical findings as presence of breast diseases. Additional labels include the method of verification, tumor classification and histopathological diagnosis.

    Since the role of machine learning and theoretical computing towards the development of augmented inference in the field of cancer detection is indisputable, the quality of the data used to develop any explainable augmented inference methods is extremely important. This dataset can be used as an external testing set for assessing a model’s performance and for developing explainable AI or supervised machine learning models for the detection, segmentation and classification of breast abnormalities in ultrasound images.

    A detailed description of this dataset can be found here and should be cited along with the citation of the data:
    Pawłowska, A., Ćwierz-Pieńkowska, A., Domalik, A., Jaguś, D., Kasprzak, P., Matkowski, R., Fura, Ł., Nowicki, A., & Zolek, N. A Curated benchmark dataset for ultrasound based breast lesion analysis. Sci Data 11, 148 (2024). https://doi.org/10.1038/s41597-024-02984-z.

  15. h

    breastcanc-ultrasound-class

    • huggingface.co
    Updated Apr 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Clelia Astra Bertelli (2024). breastcanc-ultrasound-class [Dataset]. https://huggingface.co/datasets/as-cle-bert/breastcanc-ultrasound-class
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 29, 2024
    Authors
    Clelia Astra Bertelli
    License

    https://choosealicense.com/licenses/cc/https://choosealicense.com/licenses/cc/

    Description

    breastcanc-ultrasound-class

      Background
    

    Cancer is the second leading cause of death worldwide, according to IHME - Global Burden of Disease, with 10.7 mln casualties in 2019.

    Amongst the various types of cancer, a huge role is played by breast cancer, which stands in 4th position among the deadliest tumors, with more than 700.000 deaths during 2019 (IHME - Global Burden of Disease).

    Moreover, breast cancer has the highest share of number of cases/100 people worldwide… See the full description on the dataset page: https://huggingface.co/datasets/as-cle-bert/breastcanc-ultrasound-class.

  16. Mammogram Dataset (KAUMDS)

    • kaggle.com
    zip
    Updated Apr 10, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    King Abdul Aziz University Dataset (2021). Mammogram Dataset (KAUMDS) [Dataset]. https://www.kaggle.com/asmaasaad/mammogram-dataset-kaumds
    Explore at:
    zip(5233333411 bytes)Available download formats
    Dataset updated
    Apr 10, 2021
    Authors
    King Abdul Aziz University Dataset
    Description

    ##

    The current study aims to build the first digitalized mammogram dataset for breast cancer in Saudi Arabia, depend on the BI-RADS categories, to solve the availability problem of local public datasets by collecting, categorizing, and annotating mammogram images, supporting the medical field by providing physicians with different diagnosed cases especially in Saudi Arabia The dataset was collected from Sheikh Mohammed Hussein Al-Amoudi Center of Excellence in Breast Cancer at King Abdulaziz University in Jeddah, Saudi Arabia, from April 2019 to March 2020 and the annotated was between April and June 2020. The dataset contains 1521 cases; all cases include images with two types of views (CC and MLO) for both breasts (right and left), making a total of 6109 mammogram images. The dataset was classified into 0 to 5 categories in accordance with BI-RADS

    Acknowledgements

    We wouldn't be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research.

    Inspiration

    Your data will be in front of the world's largest data science community. What questions do you want to see answered?

  17. f

    High Prevalence of the BIM Deletion Polymorphism in Young Female Breast...

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    xls
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ching-Hung Lin; Chen-Yang Shen; Jih-Hsiang Lee; Chiun-Sheng Huang; Chih-Hsin Yang; Wen-Hung Kuo; Dwan-Ying Chang; Chia-Ni Hsiung; Kuan-Ting Kuo; Wei-Wu Chen; I-Chun Chen; Pei-Fang Wu; Sung-Hsin Kuo; Chien-Jen Chen; Yen-Shen Lu; Ann-Lii Cheng (2023). High Prevalence of the BIM Deletion Polymorphism in Young Female Breast Cancer in an East Asian Country [Dataset]. http://doi.org/10.1371/journal.pone.0124908
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Ching-Hung Lin; Chen-Yang Shen; Jih-Hsiang Lee; Chiun-Sheng Huang; Chih-Hsin Yang; Wen-Hung Kuo; Dwan-Ying Chang; Chia-Ni Hsiung; Kuan-Ting Kuo; Wei-Wu Chen; I-Chun Chen; Pei-Fang Wu; Sung-Hsin Kuo; Chien-Jen Chen; Yen-Shen Lu; Ann-Lii Cheng
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BackgroundA rapid surge of female breast cancer has been observed in young women in several East Asian countries. The BIM deletion polymorphism, which confers cell resistance to apoptosis, was recently found exclusively in East Asian people with prevalence rate of 12%. We aimed to evaluate the possible role of this genetic alteration in carcinogenesis of breast cancer in East Asians.MethodFemale healthy volunteers (n = 307), patients in one consecutive stage I-III breast cancer cohort (n = 692) and one metastatic breast cancer cohort (n = 189) were evaluated. BIM wild-type and deletion alleles were separately genotyped in genomic DNAs.ResultsBoth cancer cohorts consistently showed inverse associations between the BIM deletion polymorphism and patient age (≤35 y vs. 36-50 y vs. >50 y: 29% vs. 22% vs. 15%, P = 0.006 in the consecutive cohort, and 40% vs. 23% vs. 13%, P = 0.023 in the metastatic cohort). In healthy volunteers, the frequencies of the BIM deletion polymorphism were similar (13%-14%) in all age groups. Further analyses indicated that the BIM deletion polymorphism was not associated with specific clinicopathologic features, but it was associated with poor overall survival (adjusted hazard ratio 1.71) in the consecutive cohort.ConclusionsBIM deletion polymorphism may be involved in the tumorigenesis of the early-onset breast cancer among East Asians.

  18. G

    Health Status: Breast Cancer Ratios, 1986 to 1995

    • open.canada.ca
    jp2, zip
    Updated Mar 14, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Natural Resources Canada (2022). Health Status: Breast Cancer Ratios, 1986 to 1995 [Dataset]. https://open.canada.ca/data/en/dataset/f1505a5e-8893-11e0-8db2-6cf049291510
    Explore at:
    zip, jp2Available download formats
    Dataset updated
    Mar 14, 2022
    Dataset provided by
    Natural Resources Canada
    License

    Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
    License information was derived automatically

    Description

    This map uses age-standardized ratios to further aid in regional comparisons. A value of 1.0 would indicate that the region rate is identical to the overall Canadian rate; a value greater than 1.0 would indicate that the rate for that region is higher than the Canadian rate; and, in turn, a ratio value less than 1.0 would indicate that the rate for the specific region is lower than the Canadian rate. Statistically low incidences of breast cancer are found in Newfoundland and Labrador, the territories, and northern areas of most provinces. Otherwise, each province has one or more pockets of significantly high breast cancer incidence. Health status refers to the state of health of a person or group, and measures causes of sickness and death. It can also include people’s assessment of their own health.

  19. Cancer Screening Coverage

    • johnsnowlabs.com
    csv
    Updated Jan 20, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    John Snow Labs (2021). Cancer Screening Coverage [Dataset]. https://www.johnsnowlabs.com/marketplace/cancer-screening-coverage/
    Explore at:
    csvAvailable download formats
    Dataset updated
    Jan 20, 2021
    Dataset authored and provided by
    John Snow Labs
    Time period covered
    2010 - 2016
    Area covered
    England
    Description

    This dataset is sourced from Public Health England and consists of the percentage of people in the resident population eligible for cervical screening who were screened adequately within the previous years (2010 to 2016) for bowel, cervical and breast cancer.

  20. c

    ACRIN 6667

    • cancerimagingarchive.net
    csv, zip, and xlsx +2
    Updated Mar 5, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Cancer Imaging Archive (2021). ACRIN 6667 [Dataset]. http://doi.org/10.7937/Q1EE-J082
    Explore at:
    dicom, n/a, csv, zip, and xlsxAvailable download formats
    Dataset updated
    Mar 5, 2021
    Dataset authored and provided by
    The Cancer Imaging Archive
    License

    https://www.cancerimagingarchive.net/data-usage-policies-and-restrictions/https://www.cancerimagingarchive.net/data-usage-policies-and-restrictions/

    Time period covered
    Mar 5, 2021
    Dataset funded by
    National Cancer Institutehttp://www.cancer.gov/
    Description

    https://www.cancerimagingarchive.net/wp-content/uploads/nctn-logo-300x108.png" alt="" width="300" height="108" />

    Demographic Summary of Available Imaging

    CharacteristicValue (N = 984)
    Age (years)Mean ± SD: 53.2 ± 11
    Median (IQR): 53 (45-60)
    Range: 25-86
    SexFemale: 984 (100%)
    Race

    White: 898 (91.3%)
    Black: 46 (4.7%)
    Asian: 22 (2.2%)
    Native Hawaiian or Other Pacific Islander: 1 (0.1%)
    More than One: 5 (0.5%)
    Unknown: 12 (1.2%)

    Ethnicity

    Hispanic: 38 (3.9%)
    Non-Hispanic: 940 (95.5%)
    Unknown: 6 (0.6%)

    This dataset relates to NCI Clinical trial, "Magnetic Resonance Imaging in Women Recently Diagnosed With Unilateral Breast Cancer (ACRIN-6667)". The dataset consists of 984 patients but only 969 were included in the primary data analysis due to study criteria.

    Even after careful clinical and mammographic evaluation, cancer is found in the contralateral breast in up to 10% of women who have received treatment for unilateral breast cancer. ACRIN 6667 was conducted to determine whether magnetic resonance imaging (MRI) could improve on clinical breast examination and mammography in detecting contralateral breast cancer soon after the initial diagnosis of unilateral breast cancer. Additional information about the trial is available in the Study Protocol and Case Report Forms.

    METHODS

    A total of 969 women with a recent diagnosis of unilateral breast cancer and no abnormalities on mammographic and clinical examination of the contralateral breast underwent breast MRI. The diagnosis of MRI-detected cancer was confirmed by means of biopsy within 12 months after study entry. The absence of breast cancer was determined by means of biopsy, the absence of positive findings on repeat imaging and clinical examination, or both at 1 year of follow-up.

    RESULTS

    MRI detected clinically and mammographically occult breast cancer in the contralateral breast in 30 of 969 women who were enrolled in the study (3.1%). The sensitivity of MRI in the contralateral breast was 91%, and the specificity was 88%. The negative predictive value of MRI was 99%. A biopsy was performed on the basis of a positive MRI finding in 121 of the 969 women (12.5%), 30 of whom had specimens that were positive for cancer (24.8%); 18 of the 30 specimens were positive for invasive cancer. The mean diameter of the invasive tumors detected was 10.9 mm. The additional number of cancers detected was not influenced by breast density, menopausal status, or the histologic features of the primary tumor.

    CONCLUSIONS

    MRI can detect cancer in the contralateral breast that is missed by mammography and clinical examination at the time of the initial breast-cancer diagnosis.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
(2024). Breast Cancer Dataset - Dataset - CKAN [Dataset]. https://data.poltekkes-smg.ac.id/dataset/breast-cancer-dataset

Breast Cancer Dataset - Dataset - CKAN

Explore at:
Dataset updated
Oct 7, 2024
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Description: Breast cancer is the most common cancer amongst women in the world. It accounts for 25% of all cancer cases, and affected over 2.1 Million people in 2015 alone. It starts when cells in the breast begin to grow out of control. These cells usually form tumors that can be seen via X-ray or felt as lumps in the breast area. The key challenges against it’s detection is how to classify tumors into malignant (cancerous) or benign(non cancerous). We ask you to complete the analysis of classifying these tumors using machine learning (with SVMs) and the Breast Cancer Wisconsin (Diagnostic) Dataset. Acknowledgements: This dataset has been referred from Kaggle. Objective: Understand the Dataset & cleanup (if required). Build classification models to predict whether the cancer type is Malignant or Benign. Also fine-tune the hyperparameters & compare the evaluation metrics of various classification algorithms.

Search
Clear search
Close search
Google apps
Main menu