100+ datasets found
  1. h

    Data from: breast-cancer-wisconsin

    • huggingface.co
    Updated May 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    scikit-learn (2025). breast-cancer-wisconsin [Dataset]. https://huggingface.co/datasets/scikit-learn/breast-cancer-wisconsin
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 26, 2025
    Dataset authored and provided by
    scikit-learn
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Breast Cancer Wisconsin Diagnostic Dataset

    Following description was retrieved from breast cancer dataset on UCI machine learning repository. Features are computed from a digitized image of a fine needle aspirate (FNA) of a breast mass. They describe characteristics of the cell nuclei present in the image. A few of the images can be found at here. Separating plane described above was obtained using Multisurface Method-Tree (MSM-T), a classification method which uses linear… See the full description on the dataset page: https://huggingface.co/datasets/scikit-learn/breast-cancer-wisconsin.

  2. Breast Cancer

    • kaggle.com
    Updated Aug 8, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Reihaneh Namdari (2022). Breast Cancer [Dataset]. https://www.kaggle.com/datasets/reihanenamdari/breast-cancer
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 8, 2022
    Dataset provided by
    Kaggle
    Authors
    Reihaneh Namdari
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset of breast cancer patients was obtained from the 2017 November update of the SEER Program of the NCI, which provides information on population-based cancer statistics. The dataset involved female patients with infiltrating duct and lobular carcinoma breast cancer (SEER primary cites recode NOS histology codes 8522/3) diagnosed in 2006-2010. Patients with unknown tumour size, examined regional LNs, positive regional LNs, and patients whose survival months were less than 1 month were excluded; thus, 4024 patients were ultimately included.

  3. c

    Breast Cancer Dataset

    • cubig.ai
    zip
    Updated May 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CUBIG (2025). Breast Cancer Dataset [Dataset]. https://cubig.ai/store/products/178/breast-cancer-dataset
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 2, 2025
    Dataset authored and provided by
    CUBIG
    License

    https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service

    Measurement technique
    Synthetic data generation using AI techniques for model training, Privacy-preserving data transformation via differential privacy
    Description

    1) Data Introduction • The Breast Cancer Wisconsin (Diagnostic) data focuses on distinguishing between malignant (cancerous) and benign (non-cancerous) breast tumors. This dataset is crucial for developing machine learning models to aid in the early detection and classification of breast cancer, thereby potentially saving lives through timely intervention.

    2) Data Utilization (1) Breast cancer data has characteristics that: • The dataset contains various features extracted from digitized images of fine needle aspirate (FNA) of breast masses, allowing for detailed analysis and classification of tumors. (2) Breast cancer data can be used to: • Healthcare and Medical Research: Useful for developing diagnostic tools and models to accurately classify breast tumors, aiding healthcare providers in making informed decisions. • Machine Learning and AI Development: Assists in creating and fine-tuning machine learning algorithms to improve predictive accuracy in medical diagnostics.

  4. Breast Cancer Dataset

    • kaggle.com
    zip
    Updated Feb 27, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Figo (2024). Breast Cancer Dataset [Dataset]. https://www.kaggle.com/datasets/figolm10/breast-cancer-dataset
    Explore at:
    zip(49831 bytes)Available download formats
    Dataset updated
    Feb 27, 2024
    Authors
    Figo
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Figo

    Released under MIT

    Contents

  5. Breast Cancer Diagnostic Dataset (BCD)

    • kaggle.com
    zip
    Updated Oct 26, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dev Raikwar (2021). Breast Cancer Diagnostic Dataset (BCD) [Dataset]. https://www.kaggle.com/datasets/devraikwar/breast-cancer-diagnostic
    Explore at:
    zip(2081 bytes)Available download formats
    Dataset updated
    Oct 26, 2021
    Authors
    Dev Raikwar
    Description

    Context

    The resources for this dataset can be found at https://www.openml.org/d/13 and https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+%28Diagnostic%29

    Content

    This data set includes 201 instances of one class and 85 instances of another class. The instances are described by 9 attributes, some of which are linear and some are nominal.

    Number of Instances: 286

    Number of Attributes: 9 + the class attribute

    Attribute Information:

    Class: no-recurrence-events, recurrence-events age: 10-19, 20-29, 30-39, 40-49, 50-59, 60-69, 70-79, 80-89, 90-99. menopause: lt40, ge40, premeno. tumor-size: 0-4, 5-9, 10-14, 15-19, 20-24, 25-29, 30-34, 35-39, 40-44, 45-49, 50-54, 55-59. inv-nodes: 0-2, 3-5, 6-8, 9-11, 12-14, 15-17, 18-20, 21-23, 24-26, 27-29, 30-32, 33-35, 36-39. node-caps: yes, no. deg-malig: 1, 2, 3. breast: left, right. breast-quad: left-up, left-low, right-up, right-low, central. irradiat: yes, no.

    Missing Attribute Values: (denoted by “?”) Attribute #: Number of instances with missing values: 6. 8 9. 1.

    Class Distribution:

    no-recurrence-events: 201 instances recurrence-events: 85 instances

    Acknowledgements

    Original data https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+%28Diagnostic%29

    Inspiration

    With the attributes described above, can you predict if a patient has recurrence event ?

  6. Breast Cancer Dataset [Wisconsin Diagnostic UCI]

    • kaggle.com
    zip
    Updated Jan 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abhinav Mangalore (2024). Breast Cancer Dataset [Wisconsin Diagnostic UCI] [Dataset]. https://www.kaggle.com/datasets/abhinavmangalore/breast-cancer-dataset-wisconsin-diagnostic-uci
    Explore at:
    zip(49831 bytes)Available download formats
    Dataset updated
    Jan 22, 2024
    Authors
    Abhinav Mangalore
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Wisconsin
    Description

    This dataset is taken from the UCI Machine Learning Repository (Link: https://data.world/health/breast-cancer-wisconsin) by the Donor: Nick Street

    The main idea and inspiration behind the upload was to provide datasets for Machine Learning as practice and reference for my peers at college. The main purpose is to analyze data and experiment with different machine learning ideas and techniques for this binary classification task. As such, this dataset is a very useful resource to practice on.

    Breast cancer is when breast cells mutate and become cancerous cells that multiply and form tumors. It accounts for 25% of all cancer cases and affected over 2.1 Million people in 2015 alone. Breast cancer typically affects women and people assigned female at birth (AFAB) age 50 and older, but it can also affect men and people assigned male at birth (AMAB), as well as younger women. Healthcare providers may treat breast cancer with surgery to remove tumors or treatment to kill cancerous cells.

    Features are computed from a digitized image of a fine needle aspirate (FNA) of a breast mass. They describe characteristics of the cell nuclei present in the image. A few of the images can be found at http://www.cs.wisc.edu/~street/images/

    The task: To classify whether the tumor is benign (B) or malignant (M).

    Relevant information

    Features are computed from a digitized image of a fine needle
    aspirate (FNA) of a breast mass. They describe
    characteristics of the cell nuclei present in the image.
    A few of the images can be found at
    http://www.cs.wisc.edu/~street/images/
    
    Separating plane described above was obtained using
    Multisurface Method-Tree (MSM-T) [K. P. Bennett, "Decision Tree
    Construction Via Linear Programming." Proceedings of the 4th
    Midwest Artificial Intelligence and Cognitive Science Society,
    pp. 97-101, 1992], a classification method which uses linear
    programming to construct a decision tree. Relevant features
    were selected using an exhaustive search in the space of 1-4
    features and 1-3 separating planes.
    
    The actual linear program used to obtain the separating plane
    in the 3-dimensional space is that described in:
    [K. P. Bennett and O. L. Mangasarian: "Robust Linear
    Programming Discrimination of Two Linearly Inseparable Sets",
    Optimization Methods and Software 1, 1992, 23-34].
    
    
    This database is also available through the UW CS ftp server:
    
    ftp ftp.cs.wisc.edu
    cd math-prog/cpo-dataset/machine-learn/WDBC/
    

    Number of instances: 569

    Number of attributes: 32 (ID, diagnosis, 30 real-valued input features)

    Original Creators:

    Dr. William H. Wolberg, General Surgery Dept., University of
    Wisconsin, Clinical Sciences Center, Madison, WI 53792
    wolberg@eagle.surgery.wisc.edu
    
    W. Nick Street, Computer Sciences Dept., University of
    Wisconsin, 1210 West Dayton St., Madison, WI 53706
    street@cs.wisc.edu 608-262-6619
    
    Olvi L. Mangasarian, Computer Sciences Dept., University of
    Wisconsin, 1210 West Dayton St., Madison, WI 53706
    olvi@cs.wisc.edu 
    

    Donor: Nick Street

    Date: November 1995

    Past Usage:

    first usage:

    W.N. Street, W.H. Wolberg and O.L. Mangasarian 
    Nuclear feature extraction for breast tumor diagnosis.
    IS&T/SPIE 1993 International Symposium on Electronic Imaging: Science
    and Technology, volume 1905, pages 861-870, San Jose, CA, 1993.
    

    OR literature:

    O.L. Mangasarian, W.N. Street and W.H. Wolberg. 
    Breast cancer diagnosis and prognosis via linear programming. 
    Operations Research, 43(4), pages 570-577, July-August 1995.
    

    Medical literature:

    W.H. Wolberg, W.N. Street, and O.L. Mangasarian. 
    Machine learning techniques to diagnose breast cancer from
    fine-needle aspirates. 
    Cancer Letters 77 (1994) 163-171.
    
    W.H. Wolberg, W.N. Street, and O.L. Mangasarian. 
    Image analysis and machine learning applied to breast cancer
    diagnosis and prognosis. 
    Analytical and Quantitative Cytology and Histology, Vol. 17
    No. 2, pages 77-87, April 1995. 
    
    W.H. Wolberg, W.N. Street, D.M. Heisey, and O.L. Mangasarian. 
    Computerized breast cancer diagnosis and prognosis from fine
    needle aspirates. 
    Archives of Surgery 1995;130:511-516.
    
    W.H. Wolberg, W.N. Street, D.M. Heisey, and O.L. Mangasarian. 
    Computer-derived nuclear features distinguish malignant from
    benign breast cytology. 
    Human Pathology, 26:792--796, 1995.
    

    See also: http://www.cs.wisc.edu/~olvi/uwmp/mpml.html http://www.cs.wisc.edu/~olvi/uwmp/cancer.html

  7. h

    breast-cancer-dataset

    • huggingface.co
    Updated Oct 6, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Beijing Institute of Technology (2024). breast-cancer-dataset [Dataset]. https://huggingface.co/datasets/BIT/breast-cancer-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 6, 2024
    Dataset authored and provided by
    Beijing Institute of Technology
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    BIT/breast-cancer-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community

  8. Breast Cancer Prediction Dataset

    • kaggle.com
    zip
    Updated Sep 26, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Merishna Singh Suwal (2018). Breast Cancer Prediction Dataset [Dataset]. https://www.kaggle.com/datasets/merishnasuwal/breast-cancer-prediction-dataset
    Explore at:
    zip(8105 bytes)Available download formats
    Dataset updated
    Sep 26, 2018
    Authors
    Merishna Singh Suwal
    Description

    Worldwide, breast cancer is the most common type of cancer in women and the second highest in terms of mortality rates.Diagnosis of breast cancer is performed when an abnormal lump is found (from self-examination or x-ray) or a tiny speck of calcium is seen (on an x-ray). After a suspicious lump is found, the doctor will conduct a diagnosis to determine whether it is cancerous and, if so, whether it has spread to other parts of the body.

    This breast cancer dataset was obtained from the University of Wisconsin Hospitals, Madison from Dr. William H. Wolberg.

  9. h

    breast-cancer

    • huggingface.co
    • opendatalab.com
    Updated Sep 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Niels Rogge (2023). breast-cancer [Dataset]. https://huggingface.co/datasets/nielsr/breast-cancer
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 16, 2023
    Authors
    Niels Rogge
    Description

    Dataset Card for "breast-cancer"

    Dataset was taken from the MedSAM project and used in this notebook which fine-tunes Meta's SAM model on the dataset. More Information needed

  10. p

    Breast Cancer Prediction Dataset - Dataset - CKAN

    • data.poltekkes-smg.ac.id
    Updated Oct 7, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Breast Cancer Prediction Dataset - Dataset - CKAN [Dataset]. https://data.poltekkes-smg.ac.id/dataset/breast-cancer-prediction-dataset
    Explore at:
    Dataset updated
    Oct 7, 2024
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Worldwide, breast cancer is the most common type of cancer in women and the second highest in terms of mortality rates.Diagnosis of breast cancer is performed when an abnormal lump is found (from self-examination or x-ray) or a tiny speck of calcium is seen (on an x-ray). After a suspicious lump is found, the doctor will conduct a diagnosis to determine whether it is cancerous and, if so, whether it has spread to other parts of the body. This breast cancer dataset was obtained from the University of Wisconsin Hospitals, Madison from Dr. William H. Wolberg.

  11. i

    SEER Breast Cancer Data

    • ieee-dataport.org
    • data.niaid.nih.gov
    Updated Jul 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    jing teng (2025). SEER Breast Cancer Data [Dataset]. https://ieee-dataport.org/open-access/seer-breast-cancer-data
    Explore at:
    Dataset updated
    Jul 29, 2025
    Authors
    jing teng
    Description

    examined regional LNs

  12. m

    AISSLab Breast Cancer Dataset: Toward General AI Harmonization with Real...

    • data.mendeley.com
    Updated Jul 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aymen Al-Hejri (2025). AISSLab Breast Cancer Dataset: Toward General AI Harmonization with Real Mammogram Imaging [Dataset]. http://doi.org/10.17632/zp8yfhvndv.2
    Explore at:
    Dataset updated
    Jul 15, 2025
    Authors
    Aymen Al-Hejri
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The AISSLab Breast Cancer Dataset is a collection of mammogram images by experts from the Ma'amon's Diagnostic Centre Mammogram Images for Breast Cancer (MDCMI-BC) in Yemen. It is designed to support advancements in breast cancer research and computer-aided diagnosis (CAD) systems. To facilitate research in breast cancer detection, focusing on harmonizing AI with diverse imaging data. This dataset emphasizes improving diagnostic accuracy and is available for academic and clinical research applications.

    If you are using this dataset for research purpose kindly cite the following papers:

    [1] A. M. Al-Hejri, R. M. Al-Tam, M. Fazea, A. H. Sable, S. Lee, and M. A. Al-antari, “ETECADx: Ensemble Self-Attention Transformer Encoder for Breast Cancer Diagnosis Using Full-Field Digital X-ray Breast Images,” Diagnostics, vol. 13, no. 1, p. 89, Dec. 2022, doi: 10.3390/diagnostics13010089.

    [2] R. M. Al-Tam, A. M. Al-Hejri, S. S. Alshamrani, M. A. Al-antari, and S. M. Narangale, “Multimodal breast cancer hybrid explainable computer-aided diagnosis using medical mammograms and ultrasound Images,” Biocybern. Biomed. Eng., vol. 44, no. 3, pp. 731–758, Jul. 2024, doi: 10.1016/j.bbe.2024.08.007.

  13. i

    Breast Cancer Dataset

    • ieee-dataport.org
    Updated Sep 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    fan tonghuan (2025). Breast Cancer Dataset [Dataset]. https://ieee-dataport.org/documents/breast-cancer-dataset
    Explore at:
    Dataset updated
    Sep 14, 2025
    Authors
    fan tonghuan
    Description

    Pathological Section of Breast Cancer Cells

  14. d

    Breast cancer dataset

    • search.dataone.org
    • dataverse.harvard.edu
    Updated Sep 24, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tellaroli, Paola (2024). Breast cancer dataset [Dataset]. http://doi.org/10.7910/DVN/JBVFGV
    Explore at:
    Dataset updated
    Sep 24, 2024
    Dataset provided by
    Harvard Dataverse
    Authors
    Tellaroli, Paola
    Description

    Breast cancer dataset cited in 'Cross-Clustering: A Partial Clustering Algorithm with Automatic Estimation of the Number of Clusters'

  15. h

    PICO-breast-cancer

    • huggingface.co
    Updated Feb 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Carlos Cuevas Villarmin (2024). PICO-breast-cancer [Dataset]. https://huggingface.co/datasets/cuevascarlos/PICO-breast-cancer
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 12, 2024
    Authors
    Carlos Cuevas Villarmin
    Description

    PICO breast cancer dataset

    This dataset has been extracted from PICO-Corpus. The corpus consists of 1,011 abstracts of breast cancer randomized controlled trials extracted from PubMed. The PICO breast cancer dataset contains a total of 26 entities, compared to the usual 4 found in PICO corpora. Specifically, the following image extracted by the dataset's authors shows the hierarchy of the entities. The preprocessed dataset, ready to serve as inputs for MLMs such as BERT-like… See the full description on the dataset page: https://huggingface.co/datasets/cuevascarlos/PICO-breast-cancer.

  16. BreastHis - Breast Cancer Histopathological

    • kaggle.com
    Updated May 29, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tathagat Banerjee (2024). BreastHis - Breast Cancer Histopathological [Dataset]. https://www.kaggle.com/datasets/tathagatbanerjee/breakhis-breast-cancer-histopathological
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 29, 2024
    Dataset provided by
    Kaggle
    Authors
    Tathagat Banerjee
    Description

    This dataset was created to be used in machine learning tasks for both binary and multiclass classification problems. It consists of 7,909 microscopic images of breast tumor tissue, captured using different magnification factors (40X, 100X, 200X, and 400X). The images in this dataset were resized to 224x224 pixels and organized according to binary and multiclass classification tasks.

    THIS DATASET IS ARCHIVED AT DANS/EASY, BUT NOT ACCESSIBLE HERE. TO VIEW A LIST OF FILES AND ACCESS THE FILES IN THIS DATASET CLICK ON THE DOI-LINK ABOVE

    Interdisciplinary s... Identifier DOI https://doi.org/10.17632/jxwvdwhpc2.1 PID https://nbn-resolving.org/urn:nbn:nl:ui:13-i2-oi2g Metadata Access https://easy.dans.knaw.nl/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=oai:easy.dans.knaw.nl:easy-dataset:301675 Provenance Creator Pereira, M Publisher Data Archiving and Networked Services (DANS) Contributor Mayke Pereira Publication Year 2023 Rights info:eu-repo/semantics/openAccess; License: http://creativecommons.org/licenses/by/4.0; http://creativecommons.org/licenses/by/4.0 OpenAccess true Representation Resource Type Dataset Discipline Other

    Pereira, Mayke (2023), “BreakHis - Breast Cancer Histopathological Database”, Mendeley Data, V1, doi: 10.17632/jxwvdwhpc2.1

  17. c

    Multimodal imaging of ductal carcinoma in situ with microinvasion

    • cancerimagingarchive.net
    • stage.cancerimagingarchive.net
    n/a +1
    Updated Dec 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Cancer Imaging Archive (2023). Multimodal imaging of ductal carcinoma in situ with microinvasion [Dataset]. http://doi.org/10.7937/3fyc-ac78
    Explore at:
    svs, tiff, and xml, n/aAvailable download formats
    Dataset updated
    Dec 8, 2023
    Dataset authored and provided by
    The Cancer Imaging Archive
    License

    https://www.cancerimagingarchive.net/data-usage-policies-and-restrictions/https://www.cancerimagingarchive.net/data-usage-policies-and-restrictions/

    Time period covered
    Dec 8, 2023
    Dataset funded by
    National Cancer Institutehttp://www.cancer.gov/
    Description

    Ductal carcinoma in situ with microinvasion (DCISM) is a challenging subtype of breast cancer with controversial invasiveness and prognosis. Accurate diagnosis of DCISM from ductal carcinoma in situ (DCIS) is crucial for optimal treatment and improved clinical outcomes. This dataset provides histopathology images and paired CK5/6 immunohistochemical staining images from patients with DCISM, as well as multiphoton microscopy images of suspicious regions. It offers multi-modal imaging data from various perspectives for analysis and diagnosis of microinvasive breast cancer by other researchers in the field.

    The dataset contains data from 12 breast cancer patients, including 10 cases of ductal carcinoma in situ with microinvasion (DCISM), 1 case of ductal carcinoma in situ (DCIS), and 1 case of invasive breast cancer.

    The magnification of the glass slide images is 40x. The pathology slide scanner used was created by the Sunny Optical Technology (group) Co., Ltd., and the pixel aspect ratio of the images is 1. The dataset also includes multiphoton microscopy imaging of suspicious microinvasion areas. The multiphoton imaging system was manufactured by Zeiss, and it also has a pixel aspect ratio of 1.

    Our database was specifically collected for the use of imaging methods in diagnosing DICSM. The suffixes in each case number indicate the patient's condition - "DCISM" for ductal carcinoma in situ with microinvasion, "DCIS" for ductal carcinoma in situ, and "IDC" for invasive ductal carcinoma. Apart from these labels, we have not collected any additional clinical information for these cases.

  18. Breast Cancer Dataset

    • figshare.com
    • zenodo.org
    txt
    Updated Jan 20, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rafael Pinto (2016). Breast Cancer Dataset [Dataset]. http://doi.org/10.6084/m9.figshare.1552017.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jan 20, 2016
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Rafael Pinto
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Breast Cancer Dataset

  19. i

    King Abdulaziz University Breast Cancer Mammogram Dataset

    • ieee-dataport.org
    Updated Nov 16, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Snehal Sapkale (2025). King Abdulaziz University Breast Cancer Mammogram Dataset [Dataset]. https://ieee-dataport.org/documents/king-abdulaziz-university-breast-cancer-mammogram-dataset
    Explore at:
    Dataset updated
    Nov 16, 2025
    Authors
    Snehal Sapkale
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    categorizing

  20. Data from: Breast cancer pathway

    • wikipathways.org
    • sandbox.wikipathways.org
    Updated Feb 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    WikiPathways (2023). Breast cancer pathway [Dataset]. https://www.wikipathways.org/pathways/WP4262.html
    Explore at:
    Dataset updated
    Feb 16, 2023
    Dataset authored and provided by
    WikiPathwayshttp://wikipathways.org/
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Breast cancer is the leading cause of cancer death among women worldwide. The vast majority of breast cancers are carcinomas that originate from cells lining the milk-forming ducts of the mammary gland. The molecular subtypes of breast cancer, which are based on the presence or absence of hormone receptors (estrogen and progesterone subtypes) and human epidermal growth factor receptor-2 (HER2), include: * Luminal A subtype: Hormone receptor positive (progesterone and estrogen) and HER2 (ERBB2) negative * Luminal B subtype: Hormone receptor positive (progesterone and estrogen) and HER2 (ERBB2) positive * HER2 positive: Hormone receptor negative (progesterone and estrogen) and HER2 (ERBB2) positive * Basal-like or triple-negative (TNBCs): Hormone receptor negative (progesterone and estrogen) and HER2 (ERBB2) negative Hormone receptor positive breast cancers are largely driven by the estrogen/ER pathway. In HER2 positive breast tumors, HER2 activates the PI3K/AKT and the RAS/RAF/MAPK pathways, and stimulate cell growth, survival and differentiation. In patients suffering from TNBC, the deregulation of various signaling pathways (Notch and Wnt/beta-catenin), EGFR protein have been confirmed. In the case of breast cancer only 8% of all cancers are hereditary, a phenomenon linked to genetic changes in BRCA1 or BRCA2. Somatic mutations in only three genes (TP53, PIK3CA and GATA3) occurred at >10% incidence across all breast cancers. Phosphorylation sites were added based on information from PhosphoSitePlus (R), www.phosphosite.org.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
scikit-learn (2025). breast-cancer-wisconsin [Dataset]. https://huggingface.co/datasets/scikit-learn/breast-cancer-wisconsin

Data from: breast-cancer-wisconsin

scikit-learn/breast-cancer-wisconsin

Related Article
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 26, 2025
Dataset authored and provided by
scikit-learn
License

Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically

Description

Breast Cancer Wisconsin Diagnostic Dataset

Following description was retrieved from breast cancer dataset on UCI machine learning repository. Features are computed from a digitized image of a fine needle aspirate (FNA) of a breast mass. They describe characteristics of the cell nuclei present in the image. A few of the images can be found at here. Separating plane described above was obtained using Multisurface Method-Tree (MSM-T), a classification method which uses linear… See the full description on the dataset page: https://huggingface.co/datasets/scikit-learn/breast-cancer-wisconsin.

Search
Clear search
Close search
Google apps
Main menu