100+ datasets found

h
Data from: breast-cancer-wisconsin
huggingface.co
Updated May 26, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
scikit-learn (2025). breast-cancer-wisconsin [Dataset]. https://huggingface.co/datasets/scikit-learn/breast-cancer-wisconsin
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 26, 2025
Dataset authored and provided by
scikit-learn
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
Breast Cancer Wisconsin Diagnostic Dataset

Following description was retrieved from breast cancer dataset on UCI machine learning repository. Features are computed from a digitized image of a fine needle aspirate (FNA) of a breast mass. They describe characteristics of the cell nuclei present in the image. A few of the images can be found at here. Separating plane described above was obtained using Multisurface Method-Tree (MSM-T), a classification method which uses linear… See the full description on the dataset page: https://huggingface.co/datasets/scikit-learn/breast-cancer-wisconsin.
Breast Cancer Dataset
kaggle.com
zip
Updated Feb 27, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Figo (2024). Breast Cancer Dataset [Dataset]. https://www.kaggle.com/datasets/figolm10/breast-cancer-dataset
Explore at:
zip(49831 bytes)Available download formats
Dataset updated
Feb 27, 2024
Authors
Figo
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset

This dataset was created by Figo

Released under MIT

Contents
c
Breast Cancer Dataset
cubig.ai
zip
Updated May 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CUBIG (2025). Breast Cancer Dataset [Dataset]. https://cubig.ai/store/products/178/breast-cancer-dataset
Explore at:
zipAvailable download formats
Dataset updated
May 2, 2025
Dataset authored and provided by
CUBIG
License
https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service
Measurement technique
Synthetic data generation using AI techniques for model training, Privacy-preserving data transformation via differential privacy
Description
1) Data Introduction • The Breast Cancer Wisconsin (Diagnostic) data focuses on distinguishing between malignant (cancerous) and benign (non-cancerous) breast tumors. This dataset is crucial for developing machine learning models to aid in the early detection and classification of breast cancer, thereby potentially saving lives through timely intervention.

2) Data Utilization (1) Breast cancer data has characteristics that: • The dataset contains various features extracted from digitized images of fine needle aspirate (FNA) of breast masses, allowing for detailed analysis and classification of tumors. (2) Breast cancer data can be used to: • Healthcare and Medical Research: Useful for developing diagnostic tools and models to accurately classify breast tumors, aiding healthcare providers in making informed decisions. • Machine Learning and AI Development: Assists in creating and fine-tuning machine learning algorithms to improve predictive accuracy in medical diagnostics.
Breast Cancer
kaggle.com
Updated Aug 8, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Reihaneh Namdari (2022). Breast Cancer [Dataset]. https://www.kaggle.com/datasets/reihanenamdari/breast-cancer
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 8, 2022
Dataset provided by
Kaggle
Authors
Reihaneh Namdari
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset of breast cancer patients was obtained from the 2017 November update of the SEER Program of the NCI, which provides information on population-based cancer statistics. The dataset involved female patients with infiltrating duct and lobular carcinoma breast cancer (SEER primary cites recode NOS histology codes 8522/3) diagnosed in 2006-2010. Patients with unknown tumour size, examined regional LNs, positive regional LNs, and patients whose survival months were less than 1 month were excluded; thus, 4024 patients were ultimately included.

Breast Cancer Dataset [Wisconsin Diagnostic UCI]

kaggle.com

zip

Updated Jan 22, 2024

Facebook

Twitter

Click to copy link

Link copied

Cite

Abhinav Mangalore (2024). Breast Cancer Dataset [Wisconsin Diagnostic UCI] [Dataset]. https://www.kaggle.com/datasets/abhinavmangalore/breast-cancer-dataset-wisconsin-diagnostic-uci

Explore at:

zip(49831 bytes)Available download formats

Dataset updated

Jan 22, 2024

Authors

Abhinav Mangalore

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Area covered

Wisconsin

Description

This dataset is taken from the UCI Machine Learning Repository (Link: https://data.world/health/breast-cancer-wisconsin) by the Donor: Nick Street

The main idea and inspiration behind the upload was to provide datasets for Machine Learning as practice and reference for my peers at college. The main purpose is to analyze data and experiment with different machine learning ideas and techniques for this binary classification task. As such, this dataset is a very useful resource to practice on.

Breast cancer is when breast cells mutate and become cancerous cells that multiply and form tumors. It accounts for 25% of all cancer cases and affected over 2.1 Million people in 2015 alone. Breast cancer typically affects women and people assigned female at birth (AFAB) age 50 and older, but it can also affect men and people assigned male at birth (AMAB), as well as younger women. Healthcare providers may treat breast cancer with surgery to remove tumors or treatment to kill cancerous cells.

Features are computed from a digitized image of a fine needle aspirate (FNA) of a breast mass. They describe characteristics of the cell nuclei present in the image. A few of the images can be found at http://www.cs.wisc.edu/~street/images/

The task: To classify whether the tumor is benign (B) or malignant (M).

Relevant information

Features are computed from a digitized image of a fine needle
aspirate (FNA) of a breast mass. They describe
characteristics of the cell nuclei present in the image.
A few of the images can be found at
http://www.cs.wisc.edu/~street/images/

Separating plane described above was obtained using
Multisurface Method-Tree (MSM-T) [K. P. Bennett, "Decision Tree
Construction Via Linear Programming." Proceedings of the 4th
Midwest Artificial Intelligence and Cognitive Science Society,
pp. 97-101, 1992], a classification method which uses linear
programming to construct a decision tree. Relevant features
were selected using an exhaustive search in the space of 1-4
features and 1-3 separating planes.

The actual linear program used to obtain the separating plane
in the 3-dimensional space is that described in:
[K. P. Bennett and O. L. Mangasarian: "Robust Linear
Programming Discrimination of Two Linearly Inseparable Sets",
Optimization Methods and Software 1, 1992, 23-34].


This database is also available through the UW CS ftp server:

ftp ftp.cs.wisc.edu
cd math-prog/cpo-dataset/machine-learn/WDBC/

Number of instances: 569

Number of attributes: 32 (ID, diagnosis, 30 real-valued input features)

Original Creators:

Dr. William H. Wolberg, General Surgery Dept., University of
Wisconsin, Clinical Sciences Center, Madison, WI 53792
wolberg@eagle.surgery.wisc.edu

W. Nick Street, Computer Sciences Dept., University of
Wisconsin, 1210 West Dayton St., Madison, WI 53706
street@cs.wisc.edu 608-262-6619

Olvi L. Mangasarian, Computer Sciences Dept., University of
Wisconsin, 1210 West Dayton St., Madison, WI 53706
olvi@cs.wisc.edu

Donor: Nick Street

Date: November 1995

Past Usage:

first usage:

W.N. Street, W.H. Wolberg and O.L. Mangasarian 
Nuclear feature extraction for breast tumor diagnosis.
IS&T/SPIE 1993 International Symposium on Electronic Imaging: Science
and Technology, volume 1905, pages 861-870, San Jose, CA, 1993.

OR literature:

O.L. Mangasarian, W.N. Street and W.H. Wolberg. 
Breast cancer diagnosis and prognosis via linear programming. 
Operations Research, 43(4), pages 570-577, July-August 1995.

Medical literature:

W.H. Wolberg, W.N. Street, and O.L. Mangasarian. 
Machine learning techniques to diagnose breast cancer from
fine-needle aspirates. 
Cancer Letters 77 (1994) 163-171.

W.H. Wolberg, W.N. Street, and O.L. Mangasarian. 
Image analysis and machine learning applied to breast cancer
diagnosis and prognosis. 
Analytical and Quantitative Cytology and Histology, Vol. 17
No. 2, pages 77-87, April 1995. 

W.H. Wolberg, W.N. Street, D.M. Heisey, and O.L. Mangasarian. 
Computerized breast cancer diagnosis and prognosis from fine
needle aspirates. 
Archives of Surgery 1995;130:511-516.

W.H. Wolberg, W.N. Street, D.M. Heisey, and O.L. Mangasarian. 
Computer-derived nuclear features distinguish malignant from
benign breast cytology. 
Human Pathology, 26:792--796, 1995.

p
Breast Cancer Prediction Dataset - Dataset - CKAN
data.poltekkes-smg.ac.id
Updated Oct 7, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Breast Cancer Prediction Dataset - Dataset - CKAN [Dataset]. https://data.poltekkes-smg.ac.id/dataset/breast-cancer-prediction-dataset
Explore at:
Dataset updated
Oct 7, 2024
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Worldwide, breast cancer is the most common type of cancer in women and the second highest in terms of mortality rates.Diagnosis of breast cancer is performed when an abnormal lump is found (from self-examination or x-ray) or a tiny speck of calcium is seen (on an x-ray). After a suspicious lump is found, the doctor will conduct a diagnosis to determine whether it is cancerous and, if so, whether it has spread to other parts of the body. This breast cancer dataset was obtained from the University of Wisconsin Hospitals, Madison from Dr. William H. Wolberg.
Breast Cancer Diagnostic Dataset (BCD)
kaggle.com
zip
Updated Oct 26, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dev Raikwar (2021). Breast Cancer Diagnostic Dataset (BCD) [Dataset]. https://www.kaggle.com/datasets/devraikwar/breast-cancer-diagnostic
Explore at:
zip(2081 bytes)Available download formats
Dataset updated
Oct 26, 2021
Authors
Dev Raikwar
Description
Context

The resources for this dataset can be found at https://www.openml.org/d/13 and https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+%28Diagnostic%29

Content

This data set includes 201 instances of one class and 85 instances of another class. The instances are described by 9 attributes, some of which are linear and some are nominal.

Number of Instances: 286

Number of Attributes: 9 + the class attribute

Attribute Information:

Class: no-recurrence-events, recurrence-events age: 10-19, 20-29, 30-39, 40-49, 50-59, 60-69, 70-79, 80-89, 90-99. menopause: lt40, ge40, premeno. tumor-size: 0-4, 5-9, 10-14, 15-19, 20-24, 25-29, 30-34, 35-39, 40-44, 45-49, 50-54, 55-59. inv-nodes: 0-2, 3-5, 6-8, 9-11, 12-14, 15-17, 18-20, 21-23, 24-26, 27-29, 30-32, 33-35, 36-39. node-caps: yes, no. deg-malig: 1, 2, 3. breast: left, right. breast-quad: left-up, left-low, right-up, right-low, central. irradiat: yes, no.

Missing Attribute Values: (denoted by “?”) Attribute #: Number of instances with missing values: 6. 8 9. 1.

Class Distribution:

no-recurrence-events: 201 instances recurrence-events: 85 instances

Acknowledgements

Original data https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+%28Diagnostic%29

Inspiration

With the attributes described above, can you predict if a patient has recurrence event ?
h
breast-cancer-dataset
huggingface.co
Updated Oct 6, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Beijing Institute of Technology (2024). breast-cancer-dataset [Dataset]. https://huggingface.co/datasets/BIT/breast-cancer-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 6, 2024
Dataset authored and provided by
Beijing Institute of Technology
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
BIT/breast-cancer-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community
i
SEER Breast Cancer Data
ieee-dataport.org
data.niaid.nih.gov
Updated Jul 29, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
jing teng (2025). SEER Breast Cancer Data [Dataset]. https://ieee-dataport.org/open-access/seer-breast-cancer-data
Explore at:
Dataset updated
Jul 29, 2025
Authors
jing teng
Description
examined regional LNs
Digital mammography Dataset for Breast Cancer Diagnosis Research (DMID)
figshare.com
zip
Updated Nov 8, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Parita Oza; Rajiv Oza; Urvi Oza; Paawan Sharma; Samir Patel; Pankaj Kumar; Bakul Gohel (2023). Digital mammography Dataset for Breast Cancer Diagnosis Research (DMID) [Dataset]. http://doi.org/10.6084/m9.figshare.24522883.v2
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.24522883.v2
Dataset updated
Nov 8, 2023
Dataset provided by
Figsharehttp://figshare.com/
Authors
Parita Oza; Rajiv Oza; Urvi Oza; Paawan Sharma; Samir Patel; Pankaj Kumar; Bakul Gohel
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset contains images of mammograms and can be used for research and education purposes only. The dataset contains DCM images, TIFF images, a Radiology report, a Segmented mask, and pixel level annotation on abnormal regions and csv file that contains other metadata.
h
breast-cancer
huggingface.co
opendatalab.com
Updated Sep 16, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Niels Rogge (2023). breast-cancer [Dataset]. https://huggingface.co/datasets/nielsr/breast-cancer
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 16, 2023
Authors
Niels Rogge
Description
Dataset Card for "breast-cancer"

Dataset was taken from the MedSAM project and used in this notebook which fine-tunes Meta's SAM model on the dataset. More Information needed
m
AISSLab Breast Cancer Dataset: Toward General AI Harmonization with Real...
data.mendeley.com
springermedizin.de
Updated Jul 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Aymen Al-Hejri (2025). AISSLab Breast Cancer Dataset: Toward General AI Harmonization with Real Mammogram Imaging [Dataset]. http://doi.org/10.17632/zp8yfhvndv.2
Explore at:
Unique identifier
https://doi.org/10.17632/zp8yfhvndv.2
Dataset updated
Jul 15, 2025
Authors
Aymen Al-Hejri
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The AISSLab Breast Cancer Dataset is a collection of mammogram images by experts from the Ma'amon's Diagnostic Centre Mammogram Images for Breast Cancer (MDCMI-BC) in Yemen. It is designed to support advancements in breast cancer research and computer-aided diagnosis (CAD) systems. To facilitate research in breast cancer detection, focusing on harmonizing AI with diverse imaging data. This dataset emphasizes improving diagnostic accuracy and is available for academic and clinical research applications.

If you are using this dataset for research purpose kindly cite the following papers:

[1] A. M. Al-Hejri, R. M. Al-Tam, M. Fazea, A. H. Sable, S. Lee, and M. A. Al-antari, “ETECADx: Ensemble Self-Attention Transformer Encoder for Breast Cancer Diagnosis Using Full-Field Digital X-ray Breast Images,” Diagnostics, vol. 13, no. 1, p. 89, Dec. 2022, doi: 10.3390/diagnostics13010089.

[2] R. M. Al-Tam, A. M. Al-Hejri, S. S. Alshamrani, M. A. Al-antari, and S. M. Narangale, “Multimodal breast cancer hybrid explainable computer-aided diagnosis using medical mammograms and ultrasound Images,” Biocybern. Biomed. Eng., vol. 44, no. 3, pp. 731–758, Jul. 2024, doi: 10.1016/j.bbe.2024.08.007.
Breast Cancer Dataset
figshare.com
zenodo.org
txt
Updated Jan 20, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rafael Pinto (2016). Breast Cancer Dataset [Dataset]. http://doi.org/10.6084/m9.figshare.1552017.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.1552017.v1
Dataset updated
Jan 20, 2016
Dataset provided by
Figsharehttp://figshare.com/
Authors
Rafael Pinto
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Breast Cancer Dataset
h
PICO-breast-cancer
huggingface.co
Updated Feb 12, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Carlos Cuevas Villarmin (2024). PICO-breast-cancer [Dataset]. https://huggingface.co/datasets/cuevascarlos/PICO-breast-cancer
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 12, 2024
Authors
Carlos Cuevas Villarmin
Description
PICO breast cancer dataset

This dataset has been extracted from PICO-Corpus. The corpus consists of 1,011 abstracts of breast cancer randomized controlled trials extracted from PubMed. The PICO breast cancer dataset contains a total of 26 entities, compared to the usual 4 found in PICO corpora. Specifically, the following image extracted by the dataset's authors shows the hierarchy of the entities. The preprocessed dataset, ready to serve as inputs for MLMs such as BERT-like… See the full description on the dataset page: https://huggingface.co/datasets/cuevascarlos/PICO-breast-cancer.
m
BreakHis - Breast Cancer Histopathological Database
data.mendeley.com
Updated Jun 21, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mayke Pereira (2023). BreakHis - Breast Cancer Histopathological Database [Dataset]. http://doi.org/10.17632/jxwvdwhpc2.1
Explore at:
Unique identifier
https://doi.org/10.17632/jxwvdwhpc2.1
Dataset updated
Jun 21, 2023
Authors
Mayke Pereira
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset was created to be used in machine learning tasks for both binary and multiclass classification problems. It consists of 7,909 microscopic images of breast tumor tissue, captured using different magnification factors (40X, 100X, 200X, and 400X). The images in this dataset were resized to 224x224 pixels and organized according to binary and multiclass classification tasks.
H
Breast cancer dataset
dataverse.harvard.edu
dataone.org
Updated May 18, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Paola Tellaroli (2024). Breast cancer dataset [Dataset]. http://doi.org/10.7910/DVN/JBVFGV
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/JBVFGV
Dataset updated
May 18, 2024
Dataset provided by
Harvard Dataverse
Authors
Paola Tellaroli
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Breast cancer dataset cited in 'Cross-Clustering: A Partial Clustering Algorithm with Automatic Estimation of the Number of Clusters'
h
wisconsin-breast-cancer
huggingface.co
Updated Feb 1, 2001
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Witold Wydmański (2001). wisconsin-breast-cancer [Dataset]. https://huggingface.co/datasets/wwydmanski/wisconsin-breast-cancer
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 1, 2001
Authors
Witold Wydmański
Area covered
Wisconsin
Description
Source:

Copied from the original dataset

Creators:

Dr. William H. Wolberg, General Surgery Dept. University of Wisconsin, Clinical Sciences Center Madison, WI 53792 wolberg '@' eagle.surgery.wisc.edu

W. Nick Street, Computer Sciences Dept. University of Wisconsin, 1210 West Dayton St., Madison, WI 53706 street '@' cs.wisc.edu 608-262-6619

Olvi L. Mangasarian, Computer Sciences Dept. University of Wisconsin, 1210 West Dayton St., Madison, WI 53706 olvi '@' cs.wisc.edu… See the full description on the dataset page: https://huggingface.co/datasets/wwydmanski/wisconsin-breast-cancer.
Data from: BreCaHAD: A Dataset for Breast Cancer Histopathological...
figshare.com
png
Updated Jan 28, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Alper Aksac; Douglas J. Demetrick; Tansel Özyer; Reda Alhajj (2019). BreCaHAD: A Dataset for Breast Cancer Histopathological Annotation and Diagnosis [Dataset]. http://doi.org/10.6084/m9.figshare.7379186.v3
Explore at:
pngAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.7379186.v3
Dataset updated
Jan 28, 2019
Dataset provided by
Figsharehttp://figshare.com/
Authors
Alper Aksac; Douglas J. Demetrick; Tansel Özyer; Reda Alhajj
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset consists of 1 .xlsx file, 2 .png files, 1 .json file and 1 .zip file:annotation_details.xlsx: The distribution of annotations in the previously mentioned six classes (mitosis, apoptosis, tumor nuclei, non-tumor nuclei, tubule, and non-tubule) is presented in a Excel spreadsheet.original.png: The input image.annotated.png: An example from the dataset. In the annotated image, blue circles indicate the tumor nuclei, pink circles show non-tumor nuclei such as blood cells, stroma nuclei, and lymphocytes; orange and green circles are mitosis and apoptosis, respectively; light blue circles are true lumen for tubules, and yellow circles represent white regions (non-lumen) such as fat, blood vessel, and broken tissues.data.json: The annotations for the BreCaHAD dataset are provided in JSON (JavaScript Object Notation) format. In the given example, the JSON file (ground truth) contains two mitosis and only one tumor nuclei annotations. Here, x and y are the coordinates of the centroid of the annotated object, and the values are between 0, 1.BreCaHAD.zip: An archive file containing dataset. Three folders are included: images (original images), groundTruth (json files), and groundTruth_display (groundTruth applied on original images)

Breast Cancer Diagnosis Dataset - Wisconsin State

kaggle.com

zip

Updated Mar 31, 2024

Facebook

Twitter

Click to copy link

Link copied

Cite

Saurabh Badole (2024). Breast Cancer Diagnosis Dataset - Wisconsin State [Dataset]. https://www.kaggle.com/datasets/saurabhbadole/breast-cancer-wisconsin-state

Explore at:

zip(5844 bytes)Available download formats

Dataset updated

Mar 31, 2024

Authors

Saurabh Badole

Area covered

Wisconsin

Description

Description:

Explore the field of breast cancer diagnosis with the insightful Wisconsin Breast Cancer dataset (Original). This dataset provides detailed attributes representing tumor characteristics observed in breast tissue samples. By analyzing these attributes, researchers and medical professionals can gain insights into tumor behavior and develop predictive models for cancer detection and prognosis.

Features
1. Sample code number: Unique identifier for each tissue sample.
2. Clump Thickness: Assessment of the thickness of tumor cell clusters (1 - 10).
3. Uniformity of Cell Size: Uniformity in the size of tumor cells (1 - 10).
4. Uniformity of Cell Shape: Uniformity in the shape of tumor cells (1 - 10).
5. Marginal Adhesion: Degree of adhesion of tumor cells to surrounding tissue (1 - 10).
6. Single Epithelial Cell Size: Size of individual tumor cells (1 - 10).
7. Bare Nuclei: Presence of nuclei without surrounding cytoplasm (1 - 10).
8. Bland Chromatin: Assessment of chromatin structure in tumor cells (1 - 10).
9. Normal Nucleoli: Presence of normal-looking nucleoli in tumor cells (1 - 10).
10. Mitoses: Frequency of mitotic cell divisions (1 - 10).
11. Class: Classification of tumor type (2 for benign, 4 for malignant).

Usage:

Cancer diagnosis: Develop machine learning models to classify tumors as benign or malignant based on their characteristics, aiding in early detection and treatment planning.
Feature importance analysis: Identify key attributes contributing to tumor malignancy and understand their biological significance.
Clinical decision support: Assist healthcare professionals in interpreting biopsy results and making informed decisions about patient care.

Acknowledgements:

The Breast Cancer Wisconsin dataset is sourced from tissue samples collected for diagnostic purposes, with attributes derived from microscopic examination. The dataset is anonymized and made available for research purposes, contributing to advancements in cancer diagnosis and treatment.

i
King Abdulaziz University Breast Cancer Mammogram Dataset
ieee-dataport.org
Updated Nov 16, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Snehal Sapkale (2025). King Abdulaziz University Breast Cancer Mammogram Dataset [Dataset]. https://ieee-dataport.org/documents/king-abdulaziz-university-breast-cancer-mammogram-dataset
Explore at:
Dataset updated
Nov 16, 2025
Authors
Snehal Sapkale
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
categorizing

Facebook

Twitter

Click to copy link

Link copied

Cite

scikit-learn (2025). breast-cancer-wisconsin [Dataset]. https://huggingface.co/datasets/scikit-learn/breast-cancer-wisconsin

Data from: breast-cancer-wisconsin

scikit-learn/breast-cancer-wisconsin

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

May 26, 2025

Dataset authored and provided by

scikit-learn

License

Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically

Description

Breast Cancer Wisconsin Diagnostic Dataset

Following description was retrieved from breast cancer dataset on UCI machine learning repository. Features are computed from a digitized image of a fine needle aspirate (FNA) of a breast mass. They describe characteristics of the cell nuclei present in the image. A few of the images can be found at here. Separating plane described above was obtained using Multisurface Method-Tree (MSM-T), a classification method which uses linear… See the full description on the dataset page: https://huggingface.co/datasets/scikit-learn/breast-cancer-wisconsin.

Clear search

Close search

Google apps

Main menu

Data from: breast-cancer-wisconsin

Breast Cancer Dataset

Dataset

Contents

Breast Cancer Dataset

Breast Cancer

Breast Cancer Dataset [Wisconsin Diagnostic UCI]

Breast Cancer Prediction Dataset - Dataset - CKAN

Breast Cancer Diagnostic Dataset (BCD)

Context

Content

Acknowledgements

Inspiration

breast-cancer-dataset

SEER Breast Cancer Data

Digital mammography Dataset for Breast Cancer Diagnosis Research (DMID)

breast-cancer

AISSLab Breast Cancer Dataset: Toward General AI Harmonization with Real...

Breast Cancer Dataset

PICO-breast-cancer

BreakHis - Breast Cancer Histopathological Database

Breast cancer dataset

wisconsin-breast-cancer

Data from: BreCaHAD: A Dataset for Breast Cancer Histopathological...

Breast Cancer Diagnosis Dataset - Wisconsin State

Description:

Usage:

Acknowledgements:

King Abdulaziz University Breast Cancer Mammogram Dataset

Data from: breast-cancer-wisconsin

scikit-learn/breast-cancer-wisconsin