100+ datasets found
  1. h

    Data from: breast-cancer-wisconsin

    • huggingface.co
    Updated May 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    scikit-learn (2025). breast-cancer-wisconsin [Dataset]. https://huggingface.co/datasets/scikit-learn/breast-cancer-wisconsin
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 26, 2025
    Dataset authored and provided by
    scikit-learn
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Breast Cancer Wisconsin Diagnostic Dataset

    Following description was retrieved from breast cancer dataset on UCI machine learning repository. Features are computed from a digitized image of a fine needle aspirate (FNA) of a breast mass. They describe characteristics of the cell nuclei present in the image. A few of the images can be found at here. Separating plane described above was obtained using Multisurface Method-Tree (MSM-T), a classification method which uses linear… See the full description on the dataset page: https://huggingface.co/datasets/scikit-learn/breast-cancer-wisconsin.

  2. Breast cancer dataset

    • zenodo.org
    zip
    Updated Jan 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Saiful Izzuan Hussain; Saiful Izzuan Hussain (2025). Breast cancer dataset [Dataset]. http://doi.org/10.5281/zenodo.14769221
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jan 30, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Saiful Izzuan Hussain; Saiful Izzuan Hussain
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The dataset used in this study consists of 7,632 mammogram images categorized into two classes: 2,520 benign and 5,112 malignant images from Huang and Lin (2020). The mammography images in the INbreast database were originally collected from the Centro Hospitalar de S. Joao (CHSJ) Breast Center in Porto. The database contains data collected from August 2008 to July 2010 and includes 115 cases with a total of 410 images (Moreira et al., 2012). Of these, 90 cases concern women with abnormalities in both breasts. Four different types of breast disease are recorded in the database: Mass, calcification, asymmetries and distortions. The mammograms are recorded from two standard perspectives: Craniocaudal (CC) and Mediolateral Oblique (MLO). In addition, breast density is classified into four categories based on the BI-RADS standards: Fully Fat (Density 1), Scattered Fibrous-Landular Density (Density 2), Heterogeneously Dense (Density 3) and Extremely Dense (Density 4). The images are stored in two resolutions: 3328 x 4084 pixels or 2560 x 3328 pixels, in DICOM format. 106 mammograms depicting breast masses were selected from the INbreast database. To enhance the dataset for model training, data augmentation techniques were applied, increasing the total number of breast mammography images to 7,632.

  3. c

    Breast Cancer Dataset

    • cubig.ai
    Updated Aug 30, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CUBIG (2024). Breast Cancer Dataset [Dataset]. https://cubig.ai/store/products/178/breast-cancer-dataset
    Explore at:
    Dataset updated
    Aug 30, 2024
    Dataset authored and provided by
    CUBIG
    License

    https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service

    Measurement technique
    Synthetic data generation using AI techniques for model training, Privacy-preserving data transformation via differential privacy
    Description

    1) Data Introduction • The Breast Cancer Wisconsin (Diagnostic) data focuses on distinguishing between malignant (cancerous) and benign (non-cancerous) breast tumors. This dataset is crucial for developing machine learning models to aid in the early detection and classification of breast cancer, thereby potentially saving lives through timely intervention.

    2) Data Utilization (1) Breast cancer data has characteristics that: • The dataset contains various features extracted from digitized images of fine needle aspirate (FNA) of breast masses, allowing for detailed analysis and classification of tumors. (2) Breast cancer data can be used to: • Healthcare and Medical Research: Useful for developing diagnostic tools and models to accurately classify breast tumors, aiding healthcare providers in making informed decisions. • Machine Learning and AI Development: Assists in creating and fine-tuning machine learning algorithms to improve predictive accuracy in medical diagnostics.

  4. i

    SEER Breast Cancer Data

    • ieee-dataport.org
    • data.niaid.nih.gov
    • +1more
    Updated Jul 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    jing teng (2025). SEER Breast Cancer Data [Dataset]. https://ieee-dataport.org/open-access/seer-breast-cancer-data
    Explore at:
    Dataset updated
    Jul 29, 2025
    Authors
    jing teng
    Description

    examined regional LNs

  5. RSNA Breast Cancer Detection - 512x512 pngs

    • kaggle.com
    Updated Nov 29, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Theo Viel (2022). RSNA Breast Cancer Detection - 512x512 pngs [Dataset]. https://www.kaggle.com/datasets/theoviel/rsna-breast-cancer-512-pngs
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 29, 2022
    Dataset provided by
    Kaggle
    Authors
    Theo Viel
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description
  6. h

    breast-cancer

    • huggingface.co
    • opendatalab.com
    Updated Sep 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Niels Rogge (2023). breast-cancer [Dataset]. https://huggingface.co/datasets/nielsr/breast-cancer
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 16, 2023
    Authors
    Niels Rogge
    Description

    Dataset Card for "breast-cancer"

    Dataset was taken from the MedSAM project and used in this notebook which fine-tunes Meta's SAM model on the dataset. More Information needed

  7. Breast Cancer Coimbra

    • kaggle.com
    Updated Jan 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vivek Agrawal (2024). Breast Cancer Coimbra [Dataset]. https://www.kaggle.com/datasets/atom1991/breast-cancer-coimbra
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 7, 2024
    Dataset provided by
    Kaggle
    Authors
    Vivek Agrawal
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset originates from a deep learning model trained on the "Coimbra Breast Cancer" dataset, with feature distributions closely resembling the original. The original data includes clinical observations from 64 patients with breast cancer and 52 healthy controls, encompassing 10 quantitative predictors and a binary dependent variable indicating the presence or absence of breast cancer.

    Quantitative Attributes:

    Age (years): Represents the age of individuals in the dataset.

    BMI (kg/m²): Body Mass Index, a measure of body fat based on weight and height.

    Glucose (mg/dL): Reflects blood glucose levels, a vital metabolic indicator.

    Insulin (µU/mL): Indicates insulin levels, a hormone associated with glucose regulation.

    HOMA: Homeostatic Model Assessment, a method assessing insulin resistance and beta-cell function.

    Leptin (ng/mL): Represents leptin levels, a hormone involved in appetite and energy balance regulation.

    Adiponectin (µg/mL): Reflects adiponectin levels, a protein associated with metabolic regulation.

    Resistin (ng/mL): Indicates resistin levels, a protein implicated in insulin resistance.

    MCP-1 (pg/dL): Reflects Monocyte Chemoattractant Protein-1 levels, a cytokine involved in inflammation.

    Labels:

    1: Healthy controls

    2: Patients with breast cancer

    These quantitative attributes, including anthropometric data and parameters gathered from routine blood analysis, serve as the foundation for potential biomarkers of breast cancer. The dataset presents an opportunity for developing accurate prediction models, aiding in the identification and understanding of factors associated with breast cancer.

  8. h

    wisconsin-breast-cancer

    • huggingface.co
    Updated Feb 1, 2001
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Witold Wydmański (2001). wisconsin-breast-cancer [Dataset]. https://huggingface.co/datasets/wwydmanski/wisconsin-breast-cancer
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 1, 2001
    Authors
    Witold Wydmański
    Area covered
    Wisconsin
    Description

    Source:

    Copied from the original dataset

      Creators:
    

    Dr. William H. Wolberg, General Surgery Dept. University of Wisconsin, Clinical Sciences Center Madison, WI 53792 wolberg '@' eagle.surgery.wisc.edu

    W. Nick Street, Computer Sciences Dept. University of Wisconsin, 1210 West Dayton St., Madison, WI 53706 street '@' cs.wisc.edu 608-262-6619

    Olvi L. Mangasarian, Computer Sciences Dept. University of Wisconsin, 1210 West Dayton St., Madison, WI 53706 olvi '@' cs.wisc.edu… See the full description on the dataset page: https://huggingface.co/datasets/wwydmanski/wisconsin-breast-cancer.

  9. d

    Breast cancer

    • datahub.io
    Updated Sep 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Breast cancer [Dataset]. https://datahub.io/core/breast-cancer
    Explore at:
    Dataset updated
    Sep 17, 2024
    Description

    This is a dataset about breast cancer occurrences.

    This dataset is taken from OpenML - breast-cancer

    This breast cancer domain was obtained from the University Medical Centre, Institute of Oncolog...

  10. p

    Breast Cancer Prediction Dataset - Dataset - CKAN

    • data.poltekkes-smg.ac.id
    Updated Oct 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Breast Cancer Prediction Dataset - Dataset - CKAN [Dataset]. https://data.poltekkes-smg.ac.id/dataset/breast-cancer-prediction-dataset
    Explore at:
    Dataset updated
    Oct 7, 2024
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Worldwide, breast cancer is the most common type of cancer in women and the second highest in terms of mortality rates.Diagnosis of breast cancer is performed when an abnormal lump is found (from self-examination or x-ray) or a tiny speck of calcium is seen (on an x-ray). After a suspicious lump is found, the doctor will conduct a diagnosis to determine whether it is cancerous and, if so, whether it has spread to other parts of the body. This breast cancer dataset was obtained from the University of Wisconsin Hospitals, Madison from Dr. William H. Wolberg.

  11. Breast Cancer MSI Multimodal Image Dataset

    • kaggle.com
    Updated May 31, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Developer (2025). Breast Cancer MSI Multimodal Image Dataset [Dataset]. https://www.kaggle.com/datasets/zoya77/breast-cancer-msi-multimodal-image-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 31, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Developer
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    The multispectral breast cancer image datasets span three complementary imaging modalities: Ultrasound, Histopathological, and Chest X-ray. Each dataset includes balanced classes of benign and malignant cases, and the images are enhanced through spectral conversion (RGB, HSV, Jet) to support robust multispectral analysis for classification and fusion tasks.

    MSI Ultrasound Breast Images for Breast Cancer This dataset contains ultrasound images of breast tissue labeled as either benign or malignant.

    Total Images: 806

    Benign: 406 images

    Malignant: 400 images

    Augmentation: Data augmentation techniques such as rotation and sharpening were applied to enhance the diversity and volume of the dataset, enabling robust training of machine learning models.

    MSI BreastHis – Breast Cancer Histopathological Images This dataset comprises high-resolution microscopic images of breast tumor tissue collected for histopathological analysis. These images provide cellular-level detail and are essential for determining cancer grade and type.

    Total Images Used: 1,246 (Subset of the full BreakHis dataset)

    Benign: 623 images

    Malignant: 623 images

    MSI Chest X-Ray for Breast Cancer This dataset consists of colorized chest X-ray images used for identifying breast cancer-related anomalies. While traditionally not the primary modality for breast cancer detection, chest X-rays can provide useful structural insights when used in conjunction with other imaging types.

    Total Images: 1,000

    Benign: 500 images

    Malignant: 500 images

  12. c

    Breast Cancer Screening - Digital Breast Tomosynthesis

    • cancerimagingarchive.net
    csv, dicom, n/a +1
    Updated Jun 8, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Cancer Imaging Archive (2021). Breast Cancer Screening - Digital Breast Tomosynthesis [Dataset]. http://doi.org/10.7937/E4WT-CD02
    Explore at:
    n/a, dicom, csv, zip and csvAvailable download formats
    Dataset updated
    Jun 8, 2021
    Dataset authored and provided by
    The Cancer Imaging Archive
    License

    https://www.cancerimagingarchive.net/data-usage-policies-and-restrictions/https://www.cancerimagingarchive.net/data-usage-policies-and-restrictions/

    Time period covered
    Jan 18, 2024
    Dataset funded by
    National Cancer Institutehttp://www.cancer.gov/
    Description

    Breast cancer is among the most common cancers and a common cause of death among women. Over 39 million breast cancer screening exams are performed every year and are among the most common radiological tests. This creates a high need for accurate image interpretation. Machine learning has shown promise in interpretation of medical images. However, limited data for training and validation remains an issue.

    Here, we share a curated dataset of digital breast tomosynthesis images that includes normal, actionable, biopsy-proven benign, and biopsy-proven cancer cases. The dataset contains four components: (1) DICOM images, (2) a spreadsheet indicating which group each case belongs to (3) annotation boxes, and (4) Image paths for patients/studies/views. A detailed description of this dataset can be found in the following paper; please reference this paper if you use this dataset:

    M. Buda, A. Saha, R. Walsh, S. Ghate, N. Li, A. Święcicki, J. Y. Lo, M. A. Mazurowski, Detection of masses and architectural distortions in digital breast tomosynthesis: a publicly available dataset of 5,060 patients and a deep learning model. (https://doi.org/10.1001/jamanetworkopen.2021.19100).

    Additional information and resources related to this dataset can be found here: https://sites.duke.edu/mazurowski/resources/digital-breast-tomosynthesis-database/

    A Version 1 of the dataset contains only a subset of all data described in the paper above. More data will be share in subsequent versions.

    Please visit this discussion forum for any questions related to the data: https://www.reddit.com/r/DukeDBTData/

    Required Preprocessing of DBT Images

    For some of the images, the laterality stored in the DICOM header and/or image orientation are incorrect. The reference standard "truth" boxes are defined with respect to the corrected image orientation in these instances. Therefore, it is crucial to provide your results for images in the correct image orientation. Python functions for loading image data from a DICOM file into 3D array of pixel values in the correct orientation and for displaying "truth" boxes (if any) are on GitHub. Please see the readme file there for instructions.

    DBTex Lesion Detection Challenge Predictions

    The DBTex lesion detection challenge tasked participating teams with detecting lesions in the BCS-DBT test set. The challenge had two phases: DBTex1 and DBTex2. Here we provide the BCS-DBT lesion predictions made by all participating teams for both phases, for both the BCS-DBT test and validation sets, as “team_predictions_bothphases.zip”. Please see here under “Output format for the DBTex2 Challenge test set results” for a description of how these results are formatted. Finally, when comparing lesion bounding box predictions to the image data, be sure to load the images correctly according to the above “Required Preprocessing of DBT Images”.

    If you use these predictions, please reference the DBTex challenge paper:

    Konz N, Buda M, Gu H, et al. A Competition, Benchmark, Code, and Data for Using Artificial Intelligence to Detect Lesions in Digital Breast Tomosynthesis. JAMA Netw Open. 2023;6(2):e230524. doi:10.1001/jamanetworkopen.2023.0524

  13. m

    AISSLab Breast Cancer Dataset: Toward General AI Harmonization with Real...

    • data.mendeley.com
    Updated Jul 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aymen Al-Hejri (2025). AISSLab Breast Cancer Dataset: Toward General AI Harmonization with Real Mammogram Imaging [Dataset]. http://doi.org/10.17632/zp8yfhvndv.2
    Explore at:
    Dataset updated
    Jul 15, 2025
    Authors
    Aymen Al-Hejri
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The AISSLab Breast Cancer Dataset is a collection of mammogram images by experts from the Ma'amon's Diagnostic Centre Mammogram Images for Breast Cancer (MDCMI-BC) in Yemen. It is designed to support advancements in breast cancer research and computer-aided diagnosis (CAD) systems. To facilitate research in breast cancer detection, focusing on harmonizing AI with diverse imaging data. This dataset emphasizes improving diagnostic accuracy and is available for academic and clinical research applications.

    If you are using this dataset for research purpose kindly cite the following papers:

    [1] A. M. Al-Hejri, R. M. Al-Tam, M. Fazea, A. H. Sable, S. Lee, and M. A. Al-antari, “ETECADx: Ensemble Self-Attention Transformer Encoder for Breast Cancer Diagnosis Using Full-Field Digital X-ray Breast Images,” Diagnostics, vol. 13, no. 1, p. 89, Dec. 2022, doi: 10.3390/diagnostics13010089.

    [2] R. M. Al-Tam, A. M. Al-Hejri, S. S. Alshamrani, M. A. Al-antari, and S. M. Narangale, “Multimodal breast cancer hybrid explainable computer-aided diagnosis using medical mammograms and ultrasound Images,” Biocybern. Biomed. Eng., vol. 44, no. 3, pp. 731–758, Jul. 2024, doi: 10.1016/j.bbe.2024.08.007.

  14. R

    Breastcancer Yolov8 Dataset

    • universe.roboflow.com
    zip
    Updated May 27, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    b (2024). Breastcancer Yolov8 Dataset [Dataset]. https://universe.roboflow.com/b-davmu/breastcancer-yolov8
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 27, 2024
    Dataset authored and provided by
    b
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Breastcancer Yolov8 Cancer Bounding Boxes
    Description

    BreastCancer YOLOv8

    ## Overview
    
    BreastCancer YOLOv8 is a dataset for object detection tasks - it contains Breastcancer Yolov8 Cancer annotations for 1,642 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  15. f

    Data from: BreCaHAD: A Dataset for Breast Cancer Histopathological...

    • figshare.com
    png
    Updated Jan 28, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alper Aksac; Douglas J. Demetrick; Tansel Özyer; Reda Alhajj (2019). BreCaHAD: A Dataset for Breast Cancer Histopathological Annotation and Diagnosis [Dataset]. http://doi.org/10.6084/m9.figshare.7379186.v3
    Explore at:
    pngAvailable download formats
    Dataset updated
    Jan 28, 2019
    Dataset provided by
    figshare
    Authors
    Alper Aksac; Douglas J. Demetrick; Tansel Özyer; Reda Alhajj
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset consists of 1 .xlsx file, 2 .png files, 1 .json file and 1 .zip file:annotation_details.xlsx: The distribution of annotations in the previously mentioned six classes (mitosis, apoptosis, tumor nuclei, non-tumor nuclei, tubule, and non-tubule) is presented in a Excel spreadsheet.original.png: The input image.annotated.png: An example from the dataset. In the annotated image, blue circles indicate the tumor nuclei, pink circles show non-tumor nuclei such as blood cells, stroma nuclei, and lymphocytes; orange and green circles are mitosis and apoptosis, respectively; light blue circles are true lumen for tubules, and yellow circles represent white regions (non-lumen) such as fat, blood vessel, and broken tissues.data.json: The annotations for the BreCaHAD dataset are provided in JSON (JavaScript Object Notation) format. In the given example, the JSON file (ground truth) contains two mitosis and only one tumor nuclei annotations. Here, x and y are the coordinates of the centroid of the annotated object, and the values are between 0, 1.BreCaHAD.zip: An archive file containing dataset. Three folders are included: images (original images), groundTruth (json files), and groundTruth_display (groundTruth applied on original images)

  16. c

    Multimodal imaging of ductal carcinoma in situ with microinvasion

    • cancerimagingarchive.net
    n/a +1
    Updated Dec 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Cancer Imaging Archive (2023). Multimodal imaging of ductal carcinoma in situ with microinvasion [Dataset]. http://doi.org/10.7937/3fyc-ac78
    Explore at:
    svs, tiff, and xml, n/aAvailable download formats
    Dataset updated
    Dec 8, 2023
    Dataset authored and provided by
    The Cancer Imaging Archive
    License

    https://www.cancerimagingarchive.net/data-usage-policies-and-restrictions/https://www.cancerimagingarchive.net/data-usage-policies-and-restrictions/

    Time period covered
    Dec 8, 2023
    Dataset funded by
    National Cancer Institutehttp://www.cancer.gov/
    Description

    Ductal carcinoma in situ with microinvasion (DCISM) is a challenging subtype of breast cancer with controversial invasiveness and prognosis. Accurate diagnosis of DCISM from ductal carcinoma in situ (DCIS) is crucial for optimal treatment and improved clinical outcomes. This dataset provides histopathology images and paired CK5/6 immunohistochemical staining images from patients with DCISM, as well as multiphoton microscopy images of suspicious regions. It offers multi-modal imaging data from various perspectives for analysis and diagnosis of microinvasive breast cancer by other researchers in the field.

    The dataset contains data from 12 breast cancer patients, including 10 cases of ductal carcinoma in situ with microinvasion (DCISM), 1 case of ductal carcinoma in situ (DCIS), and 1 case of invasive breast cancer.

    The magnification of the glass slide images is 40x. The pathology slide scanner used was created by the Sunny Optical Technology (group) Co., Ltd., and the pixel aspect ratio of the images is 1. The dataset also includes multiphoton microscopy imaging of suspicious microinvasion areas. The multiphoton imaging system was manufactured by Zeiss, and it also has a pixel aspect ratio of 1.

    Our database was specifically collected for the use of imaging methods in diagnosing DICSM. The suffixes in each case number indicate the patient's condition - "DCISM" for ductal carcinoma in situ with microinvasion, "DCIS" for ductal carcinoma in situ, and "IDC" for invasive ductal carcinoma. Apart from these labels, we have not collected any additional clinical information for these cases.

  17. RSNA Screening Mammography Breast Cancer Detection (RSNA-SMBC) Dataset

    • registry.opendata.aws
    Updated Aug 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Radiological Society of North America (https://www.rsna.org/) (2024). RSNA Screening Mammography Breast Cancer Detection (RSNA-SMBC) Dataset [Dataset]. https://registry.opendata.aws/rsna-screening-mammography-breast-cancer-detection/
    Explore at:
    Dataset updated
    Aug 1, 2024
    Dataset provided by
    Radiological Society of North America
    Description

    According to the WHO, breast cancer is the most commonly occurring cancer worldwide. In 2020 alone, there were 2.3 million new breast cancer diagnoses and 685,000 deaths. Yet breast cancer mortality in high-income countries has dropped by 40% since the 1980s when health authorities implemented regular mammography screening in age groups considered at risk. Early detection and treatment are critical to reducing cancer fatalities, and your machine learning skills could help streamline the process radiologists use to evaluate screening mammograms. Currently, early detection of breast cancer requires the expertise of highly-trained human observers, making screening mammography programs expensive to conduct. RSNA collected screening mammograms and supporting information from two sites, totaling just under 20,000 imaging studies.

  18. i

    King Abdulaziz University Breast Cancer Mammogram Dataset

    • ieee-dataport.org
    Updated Apr 10, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Snehal Sapkale (2024). King Abdulaziz University Breast Cancer Mammogram Dataset [Dataset]. https://ieee-dataport.org/documents/king-abdulaziz-university-breast-cancer-mammogram-dataset
    Explore at:
    Dataset updated
    Apr 10, 2024
    Authors
    Snehal Sapkale
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    categorizing

  19. s

    Breast data from the Visual Sweden project DROID

    • datahub.aida.scilifelab.se
    • researchdata.se
    • +2more
    Updated Nov 27, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anna Bodén; Jerónimo F. Rose; Martin Lindvall; Caroline Bivik Stadler (2020). Breast data from the Visual Sweden project DROID [Dataset]. http://doi.org/10.23698/aida/drbr
    Explore at:
    Dataset updated
    Nov 27, 2020
    Dataset provided by
    Linköping University
    AIDA
    AIDA Data Hub
    Authors
    Anna Bodén; Jerónimo F. Rose; Martin Lindvall; Caroline Bivik Stadler
    Description

    This dataset consists of 361 whole slide images (WSI) - 296 malignant from women with invasive breast cancer (HER2 neg) and 65 benign. The tumours have been classified with four SNOMED-CT categories based on morphology: invasive duct carcinoma, invasive lobular carcinoma, in situ carcinoma, and others. 4144 separate annotations have been made to segment different tissue structures connected to ontologies.

  20. h

    breast-cancer-dataset

    • huggingface.co
    Updated Oct 6, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Beijing Institute of Technology (2024). breast-cancer-dataset [Dataset]. https://huggingface.co/datasets/BIT/breast-cancer-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 6, 2024
    Dataset authored and provided by
    Beijing Institute of Technology
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    BIT/breast-cancer-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
scikit-learn (2025). breast-cancer-wisconsin [Dataset]. https://huggingface.co/datasets/scikit-learn/breast-cancer-wisconsin

Data from: breast-cancer-wisconsin

scikit-learn/breast-cancer-wisconsin

Related Article
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 26, 2025
Dataset authored and provided by
scikit-learn
License

Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically

Description

Breast Cancer Wisconsin Diagnostic Dataset

Following description was retrieved from breast cancer dataset on UCI machine learning repository. Features are computed from a digitized image of a fine needle aspirate (FNA) of a breast mass. They describe characteristics of the cell nuclei present in the image. A few of the images can be found at here. Separating plane described above was obtained using Multisurface Method-Tree (MSM-T), a classification method which uses linear… See the full description on the dataset page: https://huggingface.co/datasets/scikit-learn/breast-cancer-wisconsin.

Search
Clear search
Close search
Google apps
Main menu