48 datasets found
  1. D

    Skin Cancer: HAM10000 Dataset

    • datasetninja.com
    Updated Jan 21, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tschandl, Philipp; Cliff Rosendahl; Harald Kittler (2024). Skin Cancer: HAM10000 Dataset [Dataset]. https://datasetninja.com/skin-cancer-ham10000
    Explore at:
    Dataset updated
    Jan 21, 2024
    Dataset provided by
    Dataset Ninja
    Authors
    Tschandl, Philipp; Cliff Rosendahl; Harald Kittler
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    To address the challenges of training neural networks for automated diagnosis of pigmented skin lesions, the authors introduced the HAM10000 ("Human Against Machine with 10000 training images") dataset. This dataset aimed to overcome the limitations of small-sized and homogeneous dermatoscopic image datasets by providing a diverse and extensive collection. To achieve this, they collected dermatoscopic images from various populations using different modalities, which necessitated employing distinct acquisition and cleaning methods. The authors also designed semi-automatic workflows that incorporated specialized neural networks to enhance the dataset's quality. The resulting HAM10000 dataset comprised 10,015 dermatoscopic images, which were made available for academic machine learning applications through the ISIC archive. This dataset served as a benchmark for machine learning experiments and comparisons with human experts.

  2. a

    HAM10000

    • datasets.activeloop.ai
    • opendatalab.com
    deeplake
    Updated Mar 24, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tschandl, Philipp (2022). HAM10000 [Dataset]. http://doi.org/10.7910/DVN/DBW86T
    Explore at:
    deeplakeAvailable download formats
    Dataset updated
    Mar 24, 2022
    Authors
    Tschandl, Philipp
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    The HAM10000 Dataset is a dataset of skin lesion images. It is a popular dataset for skin cancer classification research. The dataset consists of 10,000 images of skin lesions, each of which is labeled with one of seven different types of skin cancer.

  3. a

    Data from: The HAM10000 dataset, a large collection of multi-source...

    • academictorrents.com
    bittorrent
    Updated Jul 29, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Philipp Tschandl (2022). The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions [Dataset]. https://academictorrents.com/details/dc3188ee1ce7e2d2254113111b406c484101ba65
    Explore at:
    bittorrent(3203576126)Available download formats
    Dataset updated
    Jul 29, 2022
    Dataset authored and provided by
    Philipp Tschandl
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Training of neural networks for automated diagnosis of pigmented skin lesions is hampered by the small size and lack of diversity of available dataset of dermatoscopic images. We tackle this problem by releasing the HAM10000 ("Human Against Machine with 10000 training images") dataset. We collected dermatoscopic images from different populations, acquired and stored by different modalities. The final dataset consists of 10015 dermatoscopic images which can serve as a training set for academic machine learning purposes. Cases include a representative collection of all important diagnostic categories in the realm of pigmented lesions: Actinic keratoses and intraepithelial carcinoma / Bowen s disease (akiec), basal cell carcinoma (bcc), benign keratosis-like lesions (solar lentigines / seborrheic keratoses and lichen-planus like keratoses, bkl), dermatofibroma (df), melanoma (mel), melanocytic nevi (nv) and vascular lesions (angiomas, angiokeratomas, pyogenic granulomas and hemo

  4. h

    skin_cancer

    • huggingface.co
    Updated Jan 26, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Daniel Low (2023). skin_cancer [Dataset]. https://huggingface.co/datasets/marmal88/skin_cancer
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 26, 2023
    Authors
    Daniel Low
    Description

    The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions

    Original Paper and Dataset here Kaggle dataset here

      Introduction to datasets
    

    Training of neural networks for automated diagnosis of pigmented skin lesions is hampered by the small size and lack of diversity of available dataset of dermatoscopic images. We tackle this problem by releasing the HAM10000 ("Human Against Machine with 10000 training images") dataset.… See the full description on the dataset page: https://huggingface.co/datasets/marmal88/skin_cancer.

  5. R

    Ham10000 Skin Lesions Annotation Dataset

    • universe.roboflow.com
    zip
    Updated Jul 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    deeplearningproject (2025). Ham10000 Skin Lesions Annotation Dataset [Dataset]. https://universe.roboflow.com/deeplearningproject-7393p/ham10000-skin-lesions-annotation
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jul 27, 2025
    Dataset authored and provided by
    deeplearningproject
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Lesions Polygons
    Description

    Ham10000 Skin Lesions Annotation

    ## Overview
    
    Ham10000 Skin Lesions Annotation is a dataset for instance segmentation tasks - it contains Lesions annotations for 298 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  6. Skin Disease Detection Dataset (HAM10000 + ISIC)

    • kaggle.com
    Updated May 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    nour12347653 (2025). Skin Disease Detection Dataset (HAM10000 + ISIC) [Dataset]. https://www.kaggle.com/datasets/nour12347653/skin-disease-detection-dataset-ham10000-isic
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 20, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    nour12347653
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    This dataset is a cleaned and preprocessed combination of the HAM10000 and ISIC Archive dermoscopic image datasets, intended for training and evaluating deep learning models for skin lesion classification.

    It is structured to support multi-class image classification, and has been carefully processed to maintain high quality, class balance.

    Classes Included :

    "melanocytic nevi": "Melanocytic Nevus", "nv": "Melanocytic Nevus", "melanoma": "Melanoma", "mel": "Melanoma", "benign keratosis": "Benign Keratosis", "bkl": "Benign Keratosis", "basal cell carcinoma": "Basal Cell Carcinoma", "bcc": "Basal Cell Carcinoma", "actinic keratosis": "Actinic Keratosis", "akiec": "Actinic Keratosis", "dermatofibroma": "Dermatofibroma", "df": "Dermatofibroma", "vascular lesions": "Vascular Lesion", "vasc": "Vascular Lesion", "warts/molluscum": "Warts/Molluscum"

    Preprocessing Notes

    • All images resized to 224x224 for CNN compatibility
    • Labels unified and cleaned across both datasets
    • Invalid or corrupted entries removed
  7. h

    HAM10000

    • huggingface.co
    Updated Nov 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Takashi Nagaoka (2024). HAM10000 [Dataset]. https://huggingface.co/datasets/Nagabu/HAM10000
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 6, 2024
    Authors
    Takashi Nagaoka
    Description

    Nagabu/HAM10000 dataset hosted on Hugging Face and contributed by the HF Datasets community

  8. Ham10000 Dataset

    • universe.roboflow.com
    zip
    Updated Apr 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Culinda (2025). Ham10000 Dataset [Dataset]. https://universe.roboflow.com/culinda/ham10000/model/1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Apr 3, 2025
    Dataset provided by
    Culinda Inc.
    Authors
    Culinda
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Skin Disease Bounding Boxes
    Description

    HAM10000

    ## Overview
    
    HAM10000 is a dataset for object detection tasks - it contains Skin Disease annotations for 1,036 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  9. R

    My Ham10000 University Dataset

    • universe.roboflow.com
    zip
    Updated Jul 4, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MyHAM10000Universitydiannuswantoro (2025). My Ham10000 University Dataset [Dataset]. https://universe.roboflow.com/myham10000universitydiannuswantoro/my-ham10000-university
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jul 4, 2025
    Dataset authored and provided by
    MyHAM10000Universitydiannuswantoro
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Objects Bounding Boxes
    Description

    My HAM10000 University

    ## Overview
    
    My HAM10000 University is a dataset for object detection tasks - it contains Objects annotations for 6,999 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  10. i

    Synthetic HAM10000 Extension Dataset with Skin Tone Balance

    • ieee-dataport.org
    Updated Aug 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Safrina Kabir (2025). Synthetic HAM10000 Extension Dataset with Skin Tone Balance [Dataset]. https://ieee-dataport.org/documents/synthetic-ham10000-extension-dataset-skin-tone-balance
    Explore at:
    Dataset updated
    Aug 14, 2025
    Authors
    Safrina Kabir
    Description

    augmentation

  11. h

    HAM10000

    • huggingface.co
    Updated Feb 13, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    PRANAY D KUMAR (2025). HAM10000 [Dataset]. https://huggingface.co/datasets/pranay-43/HAM10000
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 13, 2025
    Authors
    PRANAY D KUMAR
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    pranay-43/HAM10000 dataset hosted on Hugging Face and contributed by the HF Datasets community

  12. f

    HAM10000 expert-annotated explanations and pseudonymized reader study data.

    • figshare.com
    txt
    Updated Jul 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tirtha Chanda (2025). HAM10000 expert-annotated explanations and pseudonymized reader study data. [Dataset]. http://doi.org/10.6084/m9.figshare.23766477.v9
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jul 14, 2025
    Dataset provided by
    figshare
    Authors
    Tirtha Chanda
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Reader study was conducted in 3 phases: no ai (phase 1), ai support (phase 2), xai support (phase 3). The data generated in each phase is available in metadata_phase1.csv, metadata_phase2.csv, and metadata_phase3.csv.There are 113 unique participants in phase 1. Additional participants were added in phase 2, resulting in 116 unique clinicians in phases 2 and 3.Important: The 3rd and 13th image in each group are identical. Be careful when performing table joins as the duplicate image_ids can affect them. In metadata_phase1.csv, the AI predictions for the 13th image in each group are null. Please take that into account when performing analysis. In metadata_phase2 and metadata_phase3, the AI predictions for the repeating images are not omitted.participant: Each clinician was assigned a participant Id represented by the participant column.group: Each clinician was randomly assigned to a group. Each group was assigned mutually exclusive sets of images.mask: An internal identifier used for the images. Can be ignored.benign_malignant: ground truth diagnosis.prediction: Diagnosis chosen by the clinician. 1 represents melanoma, 0 represents nevus, 0.5 represents a nevus diagnosis but the clinician chose to excise.confidence: Confidence value entered by the clinician.trust: Trust value entered by the clinician.AI_prediction: Diagnosis predicted by the AI. 1 represents melanoma and 0 represents nevus. language: Language chosen by the clinician.

  13. HAM10000 Lesion Segmentations

    • kaggle.com
    Updated Jul 2, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    chdlr (2020). HAM10000 Lesion Segmentations [Dataset]. https://www.kaggle.com/datasets/tschandl/ham10000-lesion-segmentations/
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 2, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    chdlr
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Context

    Dermatoscopic images usually depict a single skin lesion, but large scale datasets with available segmentations of affected areas are not available until now. Challenge segmentation data often suffered from being either too coarse or too noisy. This dataset provides 10015 binary segmentation masks based on FCN-created segmentations and hand-drawn lines, which together with the HAM10000 diagnosis metadata can be used for object detection or semantic segmentation.

    Content

    This dataset contains binary segmentation masks as PNG-files of all HAM10000 dataset images. The area segments lesion area as evaluated by a single dermatologist (me). They were initiated with a FCN lesion segmentation model, where afterwards I went through all of them and either approved them, or corrected / redrew them with the free-hand selection tool in FIJI.

    You can find the HAM10000 dataset images at the following places: - Harvard Dataverse: https://doi.org/10.7910/DVN/DBW86T - ISIC Archive Gallery: https://www.isic-archive.com - Kaggle Dataset Kernel (downsampled): https://www.kaggle.com/kmader/skin-cancer-mnist-ham10000

    Acknowledgements

    If you use this data, please cite/refer to the publication I made these segmentation masks for...

    ...and the original source of the images:

  14. t

    HAM10000 dataset

    • service.tib.eu
    Updated Dec 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). HAM10000 dataset [Dataset]. https://service.tib.eu/ldmservice/dataset/ham10000-dataset
    Explore at:
    Dataset updated
    Dec 2, 2024
    Description

    Medical image segmentation is vital to the area of imaging because it enables professionals to more accurately examine and understand the information offered by different imaging modalities.

  15. skin-cancer-mnist-ham10000

    • kaggle.com
    Updated May 13, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Henry Herrera007 (2022). skin-cancer-mnist-ham10000 [Dataset]. https://www.kaggle.com/datasets/henryherrera007/skin-cancer-mnist-ham10000
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 13, 2022
    Dataset provided by
    Kaggle
    Authors
    Henry Herrera007
    Description

    Dataset

    This dataset was created by Henry Herrera007

    Contents

  16. d

    Introducing the Segmented HAM10000 (\"Segmented Human Against Machine with...

    • search.dataone.org
    Updated Sep 24, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nishant Darji; Kiran Patel; Deep Patel; Dr. Yagnesh Vyas (2024). Introducing the Segmented HAM10000 (\"Segmented Human Against Machine with 10000 training images\") Dataset (Pre-processed) [Dataset]. http://doi.org/10.7910/DVN/LZJTKO
    Explore at:
    Dataset updated
    Sep 24, 2024
    Dataset provided by
    Harvard Dataverse
    Authors
    Nishant Darji; Kiran Patel; Deep Patel; Dr. Yagnesh Vyas
    Description

    Contributors towards pre-processing the existing dataset from the same (dataverse.harvard.edu) platform: Students of College: U.V.Patel College of Engineering, Ganpat University, Gujarat Nishant Darji Kiran Patel Deep Patel Guide: Dr. Yagnesh Vyas During Internship at Bhaskaracharya National Institute for Space Applications and Geo-informatics (BISAG-N) Addressing the challenge of training neural networks for automated diagnosis of pigmented skin lesions, we proudly present the Segmented HAM10000 dataset. Stemming from the original HAM10000, this segmented iteration aims to further enhance the efficacy and diversity of available data for academic machine-learning endeavors. Derived from a diverse array of populations and captured through various modalities, the Segmented HAM10000 dataset boasts a comprehensive collection of 10,015 dermatoscopic images. Each image is meticulously segmented, optimizing it for precise analysis and interpretation by machine learning algorithms. Representing a spectrum of crucial diagnostic categories within pigmented lesions, our dataset encompasses Actinic keratoses and intraepithelial carcinoma / Bowen's disease (akiec), basal cell carcinoma (bcc), benign keratosis-like lesions (solar lentigines / seborrheic keratoses and lichen-planus like keratoses, bkl), dermatofibroma (df), melanoma (mel), melanocytic nevi (nv), and vascular lesions (angiomas, angiokeratomas, pyogenic granulomas, and hemorrhage, vasc). More than 50% of lesions in our dataset are confirmed through histopathology (histo), while the ground truth for the remaining cases is established through follow-up examination (follow_up), expert consensus (consensus), or confirmation by in-vivo confocal microscopy (confocal). Notably, the Segmented HAM10000 dataset includes lesions with multiple images, facilitating tracking via the lesion_id-column within the accompanying metadata file.

  17. R

    Skincanserdetection: Ham10000.v2 Dataset

    • universe.roboflow.com
    zip
    Updated Jul 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vinh HP (2025). Skincanserdetection: Ham10000.v2 Dataset [Dataset]. https://universe.roboflow.com/vinh-hp/skincanserdetection-ham10000.v2/model/9
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jul 27, 2025
    Dataset authored and provided by
    Vinh HP
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Canser Bounding Boxes
    Description

    SkinCanserDetection: HAM10000.v2

    ## Overview
    
    SkinCanserDetection: HAM10000.v2 is a dataset for object detection tasks - it contains Canser annotations for 8,482 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  18. h

    ham10000

    • huggingface.co
    Updated Apr 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nimra Noor (2025). ham10000 [Dataset]. https://huggingface.co/datasets/BoooomNing/ham10000
    Explore at:
    Dataset updated
    Apr 16, 2025
    Authors
    Nimra Noor
    Description

    BoooomNing/ham10000 dataset hosted on Hugging Face and contributed by the HF Datasets community

  19. ham10000 skin cancer isic 2018 dataset

    • kaggle.com
    Updated Mar 27, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Joy Dhar (2024). ham10000 skin cancer isic 2018 dataset [Dataset]. https://www.kaggle.com/datasets/dharjoy/ham10000-skin-cancer-isic-2018-dataset/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 27, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Joy Dhar
    Description

    Dataset

    This dataset was created by Joy Dhar

    Contents

  20. f

    Ablation experiments.

    • plos.figshare.com
    bin
    Updated Oct 23, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xiaopu Ma; Jiangdan Shan; Fei Ning; Wentao Li; He Li (2023). Ablation experiments. [Dataset]. http://doi.org/10.1371/journal.pone.0293266.t003
    Explore at:
    binAvailable download formats
    Dataset updated
    Oct 23, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Xiaopu Ma; Jiangdan Shan; Fei Ning; Wentao Li; He Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Computer-aided diagnosis techniques based on deep learning in skin cancer classification have disadvantages such as unbalanced datasets, redundant information in the extracted features and ignored interactions of partial features among different convolutional layers. In order to overcome these disadvantages, we propose a skin cancer classification model named EFFNet, which is based on feature fusion and random forests. Firstly, the model preprocesses the HAM10000 dataset to make each category of training set images balanced by image enhancement technology. Then, the pre-training weights of the EfficientNetV2 model on the ImageNet dataset are fine-tuned on the HAM10000 skin cancer dataset. After that, an improved hierarchical bilinear pooling is introduced to capture the interactions of some features between the layers and enhance the expressive ability of features. Finally, the fused features are passed into the random forests for classification prediction. The experimental results show that the accuracy, recall, precision and F1-score of the model reach 94.96%, 93.74%, 93.16% and 93.24% respectively. Compared with other models, the accuracy rate is improved to some extent and the highest accuracy rate can be increased by about 10%.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Tschandl, Philipp; Cliff Rosendahl; Harald Kittler (2024). Skin Cancer: HAM10000 Dataset [Dataset]. https://datasetninja.com/skin-cancer-ham10000

Skin Cancer: HAM10000 Dataset

Explore at:
3 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Jan 21, 2024
Dataset provided by
Dataset Ninja
Authors
Tschandl, Philipp; Cliff Rosendahl; Harald Kittler
License

Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically

Description

To address the challenges of training neural networks for automated diagnosis of pigmented skin lesions, the authors introduced the HAM10000 ("Human Against Machine with 10000 training images") dataset. This dataset aimed to overcome the limitations of small-sized and homogeneous dermatoscopic image datasets by providing a diverse and extensive collection. To achieve this, they collected dermatoscopic images from various populations using different modalities, which necessitated employing distinct acquisition and cleaning methods. The authors also designed semi-automatic workflows that incorporated specialized neural networks to enhance the dataset's quality. The resulting HAM10000 dataset comprised 10,015 dermatoscopic images, which were made available for academic machine learning applications through the ISIC archive. This dataset served as a benchmark for machine learning experiments and comparisons with human experts.

Search
Clear search
Close search
Google apps
Main menu