24 datasets found
  1. a

    CAMELYON16 dataset

    • academictorrents.com
    bittorrent
    Updated Jan 20, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Peter Bandi (2020). CAMELYON16 dataset [Dataset]. https://academictorrents.com/details/afc995065b3e7a48db6dbebf184b8dc9205103af
    Explore at:
    bittorrent(762000041626)Available download formats
    Dataset updated
    Jan 20, 2020
    Dataset authored and provided by
    Peter Bandi
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    CAMELYON16 challenge dataset. The goal of CAMELYON16 challenge is to evaluate new and existing algorithms for automated detection of metastases in hematoxylin and eosin (H&E) stained whole-slide images (WSIs) of lymph node sections. The dataset contains 270 WSIs (159 normal slides, and 111 slides with tumor) for training, and 129 WSIs for testing. The dataset is a slightly updated version of the one available on GigaScience at . The changes are: 1. The test_114.tiff WSI was exhaustively annotated. 2. Generated mask files were added for each WSI with value 1 for normal tissue, and 2 for tumor areas in the corresponding WSI.

  2. camelyon16-features

    • huggingface.co
    Updated Nov 15, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Owkin (2023). camelyon16-features [Dataset]. https://huggingface.co/datasets/owkin/camelyon16-features
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 15, 2023
    Dataset authored and provided by
    Owkinhttps://owkin.com/
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    Dataset Card for Camelyon16-features

      Dataset Summary
    

    The Camelyon16 dataset is a very popular benchmark dataset used in the field of cancer classification.

    The dataset we've uploaded here is the result of features extracted from the Camelyon16 dataset using the Phikon model, which is also openly available on Hugging Face.

      Dataset Creation
    
    
    
    
    
    
    
      Initial Data Collection and Normalization
    

    The initial collection of the Camelyon16 Whole Slide Images… See the full description on the dataset page: https://huggingface.co/datasets/owkin/camelyon16-features.

  3. f

    Detection performance comparison with Camelyon16.

    • plos.figshare.com
    xls
    Updated Jun 10, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jun Ruan; Zhikui Zhu; Chenchen Wu; Guanglu Ye; Jingfan Zhou; Junqiu Yue (2023). Detection performance comparison with Camelyon16. [Dataset]. http://doi.org/10.1371/journal.pone.0251521.t004
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 10, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Jun Ruan; Zhikui Zhu; Chenchen Wu; Guanglu Ye; Jingfan Zhou; Junqiu Yue
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Detection performance comparison with Camelyon16.

  4. h

    Camelyon16_MIL

    • huggingface.co
    Updated Jul 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    torchmil (2025). Camelyon16_MIL [Dataset]. https://huggingface.co/datasets/torchmil/Camelyon16_MIL
    Explore at:
    Dataset updated
    Jul 24, 2025
    Dataset authored and provided by
    torchmil
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    CAMELYON16 - Multiple Instance Learning (MIL)

    Important. This dataset is part of the torchmil library. This repository provides an adapted version of the CAMELYON16 dataset tailored for Multiple Instance Learning (MIL). It is designed for use with the CAMELYON16Dataset class from the torchmil library. CAMELYON16 is a widely used benchmark in MIL research, making this adaptation particularly valuable for developing and evaluating MIL models.

      About the Original CAMELYON16… See the full description on the dataset page: https://huggingface.co/datasets/torchmil/Camelyon16_MIL.
    
  5. CAMELYON16_MASKS_ANNOTATION

    • kaggle.com
    Updated Feb 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ForcewithMe (2024). CAMELYON16_MASKS_ANNOTATION [Dataset]. https://www.kaggle.com/datasets/forcewithme/camelyon16-masks-annotation/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 22, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    ForcewithMe
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Overview

    CAMELYON16 contains 270 WSIs for training and 129 WSIs for test. This dataset is only a tiny part of the whole CAMELYON16. Please check the following links for other parts.

    Test

    @buttermint has uploaded the test set of CAMELYON 16. 1-20 21-40 41-60 61-80 81-100 101-130

    Training

    Tumor

    p1 p2 p3 p4 p5

    normal

    p1 p2 p3 p4 p5 p6 p7 p8 p9 p10 p11 p12

    label and meta information

    The authors of CAMELYON16 have manually annotated the region of cancer in high quality. And the order of the slides in normal part is a bit massive. All the information is in this dataset.

  6. S

    Camelyon+

    • scidb.cn
    Updated Nov 8, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ling Xitong; Lei Yuanyuan; Li Jiawen; Cheng Junru; Huang Wenting; Guan Tian; Guan Jian; He Yonghong (2024). Camelyon+ [Dataset]. http://doi.org/10.57760/sciencedb.16442
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 8, 2024
    Dataset provided by
    Science Data Bank
    Authors
    Ling Xitong; Lei Yuanyuan; Li Jiawen; Cheng Junru; Huang Wenting; Guan Tian; Guan Jian; He Yonghong
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    The Camelyon+ dataset is accessible through ScienceDB. The original WSI data is available from the official Camelyon16 and Camelyon-17 websites, so it has not been uploaded to the database. Slide-level labels are included in XLSX files. We provide corrected versions of the Camelyon-16 and Camelyon-17 datasets, as well as a combined version of Camelyon+ with four classification labels (negative, micro, macro, ITC) and two classification labels (negative, tumor) to support different downstream tasks.To ensure unbiased data correction by pathologists, the original training dataset from Camelyon-16, originally named "tumor," "normal," and ID, has been renamed. The mapping to the original naming will be recorded and shared in an XLSX file. For positive WSIs, pixel-level annotations are provided in XML format.To enable future comparative experiments using various feature extractors on the Camelyon+ dataset, feature files extracted at 20X magnification using ResNet-50, VIT-S, PLIP, CONCH, UNI, and Gigapath are also available. These feature files are provided in PT format for easy use.

  7. CAMELYON_NORMAL1

    • kaggle.com
    Updated Feb 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ForcewithMe (2024). CAMELYON_NORMAL1 [Dataset]. https://www.kaggle.com/datasets/forcewithme/camelyon-normal1/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 16, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    ForcewithMe
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Overview

    CAMELYON16 contains 270 WSIs for training and 129 WSIs for test. This dataset is only a tiny part of the whole CAMELYON16. Please check the following links for other parts.

    Test

    @buttermint has uploaded the test set of CAMELYON 16. 1-20 21-40 41-60 61-80 81-100 101-130

    Training

    Tumor

    p1 p2 p3 p4 p5

    normal

    p1 p2 p3 p4 p5 p6 p7 p8 p9 p10 p11 p12

    label and meta information

    The authors of CAMELYON16 have manually annotated the region of cancer in high quality. And the order of the slides in normal part is a bit massive. All the information is in this dataset.

  8. h

    histopathology-1024

    • huggingface.co
    Updated Feb 12, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cilem Afacan (2025). histopathology-1024 [Dataset]. https://huggingface.co/datasets/Cilem/histopathology-1024
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 12, 2025
    Authors
    Cilem Afacan
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset Card for Histopathology Dataset

      Dataset Summary
    

    This dataset contains 1024x1024 patches of a group of histopathology images taken from the CAMELYON16 dataset and embedding vectors extracted from these patches using the Google Path Foundation model.

      Thumbnail of Main Slide
    
    
    
    
    
    
    
    
      Usage
    

    CAMELYON16: List of images taken from CAMELYON16 dataset: test_001.tiftest_002.tif test_003.tif test_004.tif test_005.tif test_006.tif test_007.tif… See the full description on the dataset page: https://huggingface.co/datasets/Cilem/histopathology-1024.

  9. [CAMELYON16]Tumor-896rz224-again

    • kaggle.com
    Updated Feb 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ForcewithMe (2024). [CAMELYON16]Tumor-896rz224-again [Dataset]. https://www.kaggle.com/datasets/forcewithme/camelyon16tumor-896rz224-again/data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 23, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    ForcewithMe
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by ForcewithMe

    Released under Apache 2.0

    Contents

  10. h

    mixed-histopathology-512

    • huggingface.co
    Updated Feb 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cilem Afacan (2025). mixed-histopathology-512 [Dataset]. https://huggingface.co/datasets/Cilem/mixed-histopathology-512
    Explore at:
    Dataset updated
    Feb 20, 2025
    Authors
    Cilem Afacan
    Description

    Dataset Card for Mixed Histopathology Dataset

      Dataset Summary
    

    This dataset contains 512x512 patches of a group of histopathology images taken from the CAMELYON16 , CANCER IMAGING ARCHIVE-KIDNEY, CANCER IMAGING ARCHIVE-COLON, CANCER IMAGING ARCHIVE-LUNG datasets and embedding vectors extracted from these patches using the Google Path Foundation model.

      Thumbnail of Main Slide
    
    
    
    
    
      Usage
    

    CAMELYON16: List of images taken from CAMELYON16 dataset:… See the full description on the dataset page: https://huggingface.co/datasets/Cilem/mixed-histopathology-512.

  11. [CAMELYON16]256-Tumor-P2-2

    • kaggle.com
    Updated Feb 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ForcewithMe (2024). [CAMELYON16]256-Tumor-P2-2 [Dataset]. https://www.kaggle.com/datasets/forcewithme/camelyon16256-tumor-p2-2
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 29, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    ForcewithMe
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by ForcewithMe

    Released under Apache 2.0

    Contents

  12. [CAMELYON16]256-Test-P6-1

    • kaggle.com
    Updated Mar 1, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ForcewithMe (2024). [CAMELYON16]256-Test-P6-1 [Dataset]. https://www.kaggle.com/datasets/forcewithme/camelyon16256-test-p6-1
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 1, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    ForcewithMe
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by ForcewithMe

    Released under Apache 2.0

    Contents

  13. f

    Mean individual patch AUC and WSI averaged AUC.

    • plos.figshare.com
    xls
    Updated Apr 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jens Rahnfeld; Mehdi Naouar; Gabriel Kalweit; Joschka Boedecker; Estelle Dubruc; Maria Kalweit (2025). Mean individual patch AUC and WSI averaged AUC. [Dataset]. http://doi.org/10.1371/journal.pdig.0000792.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Apr 15, 2025
    Dataset provided by
    PLOS Digital Health
    Authors
    Jens Rahnfeld; Mehdi Naouar; Gabriel Kalweit; Joschka Boedecker; Estelle Dubruc; Maria Kalweit
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Recent advancements in deep learning have shown promise in enhancing the performance of medical image analysis. In pathology, automated whole slide imaging has transformed clinical workflows by streamlining routine tasks and diagnostic and prognostic support. However, the lack of transparency of deep learning models, often described as black boxes, poses a significant barrier to their clinical adoption. This study evaluates various explainability methods for Vision Transformers, assessing their effectiveness in explaining the rationale behind their classification predictions on histopathological images. Using a Vision Transformer trained on the publicly available CAMELYON16 dataset comprising of 399 whole slide images of lymph node metastases of patients with breast cancer, we conducted a comparative analysis of a diverse range of state-of-the-art techniques for generating explanations through heatmaps, including Attention Rollout, Integrated Gradients, RISE, and ViT-Shapley. Our findings reveal that Attention Rollout and Integrated Gradients are prone to artifacts, while RISE and particularly ViT-Shapley generate more reliable and interpretable heatmaps. ViT-Shapley also demonstrated faster runtime and superior performance in insertion and deletion metrics. These results suggest that integrating ViT-Shapley-based heatmaps into pathology reports could enhance trust and scalability in clinical workflows, facilitating the adoption of explainable artificial intelligence in pathology.

  14. [CAMELYON16]256-Test-P3-1

    • kaggle.com
    Updated Mar 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ForcewithMe (2024). [CAMELYON16]256-Test-P3-1 [Dataset]. https://www.kaggle.com/datasets/forcewithme/camelyon16256-test-p3-1/suggestions
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 1, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    ForcewithMe
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by ForcewithMe

    Released under Apache 2.0

    Contents

  15. The classifier detection performance.

    • plos.figshare.com
    xls
    Updated Jun 11, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jun Ruan; Zhikui Zhu; Chenchen Wu; Guanglu Ye; Jingfan Zhou; Junqiu Yue (2023). The classifier detection performance. [Dataset]. http://doi.org/10.1371/journal.pone.0251521.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 11, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Jun Ruan; Zhikui Zhu; Chenchen Wu; Guanglu Ye; Jingfan Zhou; Junqiu Yue
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The classifier detection performance.

  16. [CAMELYON16]Normal-896rz224

    • kaggle.com
    Updated Feb 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ForcewithMe (2024). [CAMELYON16]Normal-896rz224 [Dataset]. https://www.kaggle.com/datasets/forcewithme/camelyon16normal-896rz224/data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 23, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    ForcewithMe
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by ForcewithMe

    Released under Apache 2.0

    Contents

  17. g

    Supporting data for "1399 H&E-stained sentinel lymph node sections of breast...

    • gigadb.org
    • aspera.gigadb.org
    Updated May 21, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2018). Supporting data for "1399 H&E-stained sentinel lymph node sections of breast cancer patients: the CAMELYON dataset" [Dataset]. http://doi.org/10.5524/100439
    Explore at:
    Dataset updated
    May 21, 2018
    License

    U.S. Government Workshttps://www.usa.gov/government-works
    License information was derived automatically

    Description

    The presence of lymph node metastases is one of the most important factors in breast cancer prognosis. The most common strategy to assess the regional lymph node status is the sentinel lymph node procedure. The sentinel lymph node is the most likely lymph node to contain metastasized cancer cells and is excised, histopathologically processed and examined by the pathologist. This tedious examination process is time-consuming and can lead to small metastases being missed. However, recent advances in whole-slide imaging and machine learning have opened an avenue for analysis of digitized lymph node sections with computer algorithms. For example, convolutional neural networks, a type of machine learning algorithm, are able to automatically detect cancer metastases in lymph nodes with high accuracy. To train machine learning models, large, well-curated datasets are needed. We released a dataset of 1399 annotated whole-slide images of lymph nodes, both with and without metastases, in total three terabytes of data in the context of the CAMELYON16 and CAMELYON17 Grand Challenges. Slides were collected from five different medical centers to cover a broad range of image appearance and staining variations. Each whole-slide image has a slide-level label indicating whether it contains no metastases, macro-metastases, micro-metastases or isolated tumor cells. Furthermore, for 209 whole-slide images, detailed hand-drawn contours for all metastases are provided. Last, open-source software tools to visualize and interact with the data have been made available. A unique dataset of annotated, whole-slide digital histopathology images has been provided with high potential for re-use.

  18. Z

    Linking provenance and its metadata for an AI-based computation using CPM...

    • data.niaid.nih.gov
    • zenodo.org
    • +1more
    Updated Dec 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wittner, Rudolf (2023). Linking provenance and its metadata for an AI-based computation using CPM and RO-Crate [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7924182
    Explore at:
    Dataset updated
    Dec 1, 2023
    Dataset provided by
    Soiland-Reyes, Stian
    Wittner, Rudolf
    Gallo, Matej
    Leo, Simone
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is a prototype implementation of a mechanism for linking provenance information and its metadata, also called provenance of provenance or meta-provenance. This dataset is an RO-Crate that bundles artifacts of an AI-based computational pipeline. The resulting RO-Crate contains (directly or by a reference) artifacts of the pipeline execution, such as input dataset, intermediate and final results, configuration files, pipeline implementation, log files, or provenance files. The RO-Crate is based on the CPM RO-Crate profile, which integrates the Common Provenance Model (CPM) and Process Run Crate profile. The description of the AI pipeline and an explanation of how the CPM RO-Crate profile is applied to bundle the pipeline execution artifacts is provided in our previous work.

    As this dataset aims to demonstrate the mechanism for linking provenance and meta-provenance, the input dataset used for the AI model training and testing is reduced only to a few images, as the size of the input dataset does not affect the mechanism. The images used in the input are from the Camelyon16 dataset.

  19. f

    The pixel-level detection performance on different sampling algorithms with...

    • plos.figshare.com
    xls
    Updated Jun 10, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jun Ruan; Zhikui Zhu; Chenchen Wu; Guanglu Ye; Jingfan Zhou; Junqiu Yue (2023). The pixel-level detection performance on different sampling algorithms with DMC classifier. [Dataset]. http://doi.org/10.1371/journal.pone.0251521.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 10, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Jun Ruan; Zhikui Zhu; Chenchen Wu; Guanglu Ye; Jingfan Zhou; Junqiu Yue
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The pixel-level detection performance on different sampling algorithms with DMC classifier.

  20. f

    Data Sheet 1_AI-augmented pathology: the experience of transfer learning and...

    • frontiersin.figshare.com
    pdf
    Updated Jun 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Manuel Cossio; Nina Wiedemann; Esther Sanfeliu Torres; Esther Barnadas Sole; Laura Igual (2025). Data Sheet 1_AI-augmented pathology: the experience of transfer learning and intra-domain data diversity in breast cancer metastasis detection.pdf [Dataset]. http://doi.org/10.3389/fonc.2025.1598289.s001
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jun 11, 2025
    Dataset provided by
    Frontiers
    Authors
    Manuel Cossio; Nina Wiedemann; Esther Sanfeliu Torres; Esther Barnadas Sole; Laura Igual
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BackgroundMetastatic detection in sentinel lymph nodes remains a crucial prognostic factor in breast cancer management, with accurate and timely diagnosis directly impacting treatment decisions. While traditional histopathological assessment relies on microscopic examination of stained tissues, the digitization of slides as whole-slide images (WSI) has enabled the development of computer-aided diagnostic systems. These automated approaches offer potential improvements in detection consistency and efficiency compared to conventional methods.ResultsThis study leverages transfer learning on hematoxylin and eosin (HE) WSIs to achieve computationally efficient metastasis detection without compromising accuracy. We propose an approach for generating segmentation masks by transferring spatial annotations from immunohistochemistry (IHC) WSIs to corresponding H&E slides. Using these masks, four distinct datasets were constructed to fine-tune a pretrained ResNet50 model across eight different configurations, incorporating varied dataset combinations and data augmentation techniques. To enhance interpretability, we developed a visualization tool that employs color-coded probability maps to highlight tumor regions alongside their prediction confidence. Our experiments demonstrated that integrating an external dataset (Camelyon16) during training significantly improved model performance, surpassing the benefits of data augmentation alone. The optimal model, trained on both external and local data, achieved an accuracy and F1-score of 0.98, outperforming existing state-of-the-art methods.ConclusionsThis study demonstrates that transfer learning architectures, when enhanced with multi-source data integration and interpretability frameworks, can significantly improve metastatic detection in whole slide imaging. Our methodology achieves diagnostic performance comparable to gold-standard techniques while dramatically accelerating analytical workflows. The synergistic combination of external dataset incorporation and probabilistic visualization outputs provides a clinically actionable solution that maintains both computational efficiency and pathological interpretability.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Peter Bandi (2020). CAMELYON16 dataset [Dataset]. https://academictorrents.com/details/afc995065b3e7a48db6dbebf184b8dc9205103af

CAMELYON16 dataset

Explore at:
bittorrent(762000041626)Available download formats
Dataset updated
Jan 20, 2020
Dataset authored and provided by
Peter Bandi
License

CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically

Description

CAMELYON16 challenge dataset. The goal of CAMELYON16 challenge is to evaluate new and existing algorithms for automated detection of metastases in hematoxylin and eosin (H&E) stained whole-slide images (WSIs) of lymph node sections. The dataset contains 270 WSIs (159 normal slides, and 111 slides with tumor) for training, and 129 WSIs for testing. The dataset is a slightly updated version of the one available on GigaScience at . The changes are: 1. The test_114.tiff WSI was exhaustively annotated. 2. Generated mask files were added for each WSI with value 1 for normal tissue, and 2 for tumor areas in the corresponding WSI.

Search
Clear search
Close search
Google apps
Main menu