44 datasets found
  1. Forgery Detection: Segment Anything with SAM2

    • kaggle.com
    zip
    Updated Nov 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jirka Borovec (2025). Forgery Detection: Segment Anything with SAM2 [Dataset]. https://www.kaggle.com/datasets/jirkaborovec/forgery-segment-sam2-npz
    Explore at:
    zip(40460512 bytes)Available download formats
    Dataset updated
    Nov 26, 2025
    Authors
    Jirka Borovec
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    The Segment Anything Model (SAM) and its successor, SAM2, are highly influential foundation models in the field of computer vision, specifically designed for promptable image segmentation.

    What SAM Is

    • Foundation Model for Segmentation: SAM is a large, general-purpose model trained on a massive dataset to perform image segmentation. Its goal is to segment any object in any image, even those it hasn't seen before.
    • Prompt-Based Segmentation: Its core feature is the ability to generate a high-quality segmentation mask based on a minimal user "prompt." These prompts can include:
      • Clicks: The user clicks a point on an object, and SAM generates the mask for that entire object.
      • Bounding Boxes: The user draws a box around an area, and SAM refines it into a precise mask.
      • Text or Scribbles: (In more advanced versions like SAM2) The model can generate masks based on more complex inputs.

    Context in the Forgery Detection Competition

    In the context of the Kaggle competition, the use of SAM/SAM2 is for Candidate Generation:

    • The Problem: The competition, "Recod.ai/LUC - Scientific Image Forgery Detection," requires models to not only detect a forgery (like a copy-move manipulation in a biomedical image) but also to segment the manipulated region at the pixel level.
    • SAM/SAM2's Role: SAM is a powerful tool for generating segmentation masks for any prominent object or region in an image. The notebook you linked, "Forgery using SAM2 for Candidate Generation," leverages this ability to:
      1. Run SAM2 over the forged images to automatically generate a large collection of general segmentation masks (the "candidates").
      2. The idea is that the actual forged regions, which often involve copying and pasting distinct objects or image patches, are likely to be contained within or closely align with one of these automatically generated SAM/SAM2 masks.
    • Aiding Forgery Detection: By using SAM2, the notebook transforms the problem from searching for a forgery anywhere in the image to classifying and refining a smaller, more manageable set of high-quality object masks. This can significantly simplify the task for the final forgery-specific model.
  2. R

    Sam2 Instance Seg Dataset

    • universe.roboflow.com
    zip
    Updated Nov 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Land project (2025). Sam2 Instance Seg Dataset [Dataset]. https://universe.roboflow.com/land-project/sam2-instance-seg-ycirl/model/2
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 14, 2025
    Dataset authored and provided by
    Land project
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Building Polygons
    Description

    Sam2 Instance Seg

    ## Overview
    
    Sam2 Instance Seg is a dataset for instance segmentation tasks - it contains Building annotations for 275 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  3. R

    Sam2 Yolo11 Dataset

    • universe.roboflow.com
    zip
    Updated Nov 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yolospoof (2025). Sam2 Yolo11 Dataset [Dataset]. https://universe.roboflow.com/yolospoof/sam2-yolo11
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 4, 2025
    Dataset authored and provided by
    Yolospoof
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Beet Carrot Polygons
    Description

    Sam2 Yolo11

    ## Overview
    
    Sam2 Yolo11 is a dataset for instance segmentation tasks - it contains Beet Carrot annotations for 335 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  4. h

    SAM2-based-plant-disease-lesion-segmentation-model-DATASET

    • huggingface.co
    Updated Nov 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sangjin Go (2025). SAM2-based-plant-disease-lesion-segmentation-model-DATASET [Dataset]. https://huggingface.co/datasets/godoldol99/SAM2-based-plant-disease-lesion-segmentation-model-DATASET
    Explore at:
    Dataset updated
    Nov 29, 2025
    Authors
    Sangjin Go
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    godoldol99/SAM2-based-plant-disease-lesion-segmentation-model-DATASET dataset hosted on Hugging Face and contributed by the HF Datasets community

  5. D

    Data from: Unlocking the Power of SAM 2 for Few-Shot Segmentation

    • researchdata.ntu.edu.sg
    Updated May 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Qianxiong Xu; Qianxiong Xu; Lanyun Zhu; Lanyun Zhu; Xuanyi Liu; Xuanyi Liu; Guosheng Lin; Guosheng Lin; Cheng Long; Cheng Long; Ziyue Li; Ziyue Li; Rui Zhao; Rui Zhao (2025). Unlocking the Power of SAM 2 for Few-Shot Segmentation [Dataset]. http://doi.org/10.21979/N9/XIDXVT
    Explore at:
    Dataset updated
    May 22, 2025
    Dataset provided by
    DR-NTU (Data)
    Authors
    Qianxiong Xu; Qianxiong Xu; Lanyun Zhu; Lanyun Zhu; Xuanyi Liu; Xuanyi Liu; Guosheng Lin; Guosheng Lin; Cheng Long; Cheng Long; Ziyue Li; Ziyue Li; Rui Zhao; Rui Zhao
    License

    https://researchdata.ntu.edu.sg/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.21979/N9/XIDXVThttps://researchdata.ntu.edu.sg/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.21979/N9/XIDXVT

    Dataset funded by
    RIE2020 Industry Alignment Fund– Industry Collaboration Projects (IAF-ICP) Funding Initiative
    Description

    Few-Shot Segmentation (FSS) aims to learn class-agnostic segmentation on few classes to segment arbitrary classes, but at the risk of overfitting. To address this, some methods use the well-learned knowledge of foundation models (e.g., SAM) to simplify the learning process. Recently, SAM 2 has extended SAM by supporting video segmentation, whose class-agnostic matching ability is useful to FSS. A simple idea is to encode support foreground (FG) features as memory, with which query FG features are matched and fused. Unfortunately, the FG objects in different frames of SAM 2's video data are always the same identity, while those in FSS are different identities, i.e., the matching step is incompatible. Therefore, we design Pseudo Prompt Generator to encode pseudo query memory, matching with query features in a compatible way. However, the memories can never be as accurate as the real ones, i.e., they are likely to contain incomplete query FG, but some unexpected query background (BG) features, leading to wrong segmentation. Hence, we further design Iterative Memory Refinement to fuse more query FG features into the memory, and devise a Support-Calibrated Memory Attention to suppress the unexpected query BG features in memory. Extensive experiments have been conducted on PASCAL-5i and COCO-20i to validate the effectiveness of our design, e.g., the 1-shot mIoU can be 4.2% better than the best baseline.

  6. SAM2 segmentation test and comparison with manual segmentation

    • figshare.com
    png
    Updated May 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Killian Verlingue (2025). SAM2 segmentation test and comparison with manual segmentation [Dataset]. http://doi.org/10.6084/m9.figshare.29136194.v1
    Explore at:
    pngAvailable download formats
    Dataset updated
    May 23, 2025
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Killian Verlingue
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Visual comparison of 100 human annotations (labels) compared with Segment Anything Model 2 (SAM2) segmentation.

  7. R

    Sam2 Labelling Dataset

    • universe.roboflow.com
    zip
    Updated Oct 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    SAM2 Labelling (2025). Sam2 Labelling Dataset [Dataset]. https://universe.roboflow.com/sam2-labelling/sam2-labelling/model/2
    Explore at:
    zipAvailable download formats
    Dataset updated
    Oct 28, 2025
    Dataset authored and provided by
    SAM2 Labelling
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Bleeding Crakcs Polygons
    Description

    SAM2 Labelling

    ## Overview
    
    SAM2 Labelling is a dataset for instance segmentation tasks - it contains Bleeding Crakcs annotations for 1,916 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  8. Leaf Segmentation Dataset - SAM2 Format

    • kaggle.com
    zip
    Updated Jan 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ankan Ghosh (2025). Leaf Segmentation Dataset - SAM2 Format [Dataset]. https://www.kaggle.com/datasets/ankanghosh651/leaf-sengmentation-dataset-sam2-format
    Explore at:
    zip(53973530 bytes)Available download formats
    Dataset updated
    Jan 24, 2025
    Authors
    Ankan Ghosh
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    The dataset contains:

    .
    β”œβ”€β”€ images
    β”œβ”€β”€ masks
    └── train.csv
    
    • Images – This folder contains 588 RGB images showcasing various types of leaf diseases.
    • Masks – This folder holds 588 RGBA segmentation masks, where the diseased regions of the leaves are annotated.
    • train.csv – A CSV file that maps each image to its corresponding segmentation mask, ensuring proper indexing for SAM2 training.

    Original Dataset - kaggle.com/datasets/sovitrath/leaf-disease-segmentation

  9. R

    Sam2 Vegatables Dataset

    • universe.roboflow.com
    zip
    Updated Nov 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yolospoof (2024). Sam2 Vegatables Dataset [Dataset]. https://universe.roboflow.com/yolospoof/sam2-vegatables
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 12, 2024
    Dataset authored and provided by
    Yolospoof
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Vegatable 10 Polygons
    Description

    Sam2 Vegatables

    ## Overview
    
    Sam2 Vegatables is a dataset for instance segmentation tasks - it contains Vegatable 10 annotations for 319 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  10. h

    VIRESET

    • huggingface.co
    Updated Mar 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HaojieZheng (2025). VIRESET [Dataset]. https://huggingface.co/datasets/suimu/VIRESET
    Explore at:
    Dataset updated
    Mar 11, 2025
    Authors
    HaojieZheng
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    VIRESET

    VIRESET is a high-quality video instance editing dataset that provides temporally consistent and precise instance masks. Built upon the foundation of SA-V, VIRESET leverages the pretrained SAM-2 model to enhance the mask annotations from 6 FPS to 24 FPS, further enriched with detailed prompt-based annotations using PLLaVA. This dataset is used in the paper VIRES: Video Instance Repainting with Sketch and Text Guidance. Project page Code: https://github.com/suimuc/VIRES The… See the full description on the dataset page: https://huggingface.co/datasets/suimu/VIRESET.

  11. Sam2 Sar Dataset

    • universe.roboflow.com
    zip
    Updated Oct 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FZU (2025). Sam2 Sar Dataset [Dataset]. https://universe.roboflow.com/fzu-kmle9/sam2-sar-dujhu
    Explore at:
    zipAvailable download formats
    Dataset updated
    Oct 25, 2025
    Dataset provided by
    Institute of Physics of the Czech Academy of Scienceshttps://www.fzu.cz/
    Authors
    FZU
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Ship Polygons
    Description

    Sam2 Sar

    ## Overview
    
    Sam2 Sar is a dataset for instance segmentation tasks - it contains Ship annotations for 232 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  12. R

    Sam2 Food Dataset

    • universe.roboflow.com
    zip
    Updated Aug 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    1 (2025). Sam2 Food Dataset [Dataset]. https://universe.roboflow.com/1-sbhel/sam2-food-dataset-tnjhv/model/3
    Explore at:
    zipAvailable download formats
    Dataset updated
    Aug 24, 2025
    Dataset authored and provided by
    1
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Food Bounding Boxes
    Description

    Sam2 Food Dataset

    ## Overview
    
    Sam2 Food Dataset is a dataset for object detection tasks - it contains Food annotations for 1,017 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  13. h

    segment-images-sam2

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    wanghaonan, segment-images-sam2 [Dataset]. https://huggingface.co/datasets/stark2000s/segment-images-sam2
    Explore at:
    Authors
    wanghaonan
    Description

    stark2000s/segment-images-sam2 dataset hosted on Hugging Face and contributed by the HF Datasets community

  14. SAM2-UNet_file

    • kaggle.com
    zip
    Updated Nov 30, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    liukunshan2 (2024). SAM2-UNet_file [Dataset]. https://www.kaggle.com/datasets/liukunshan2/sam2-unet-file/code
    Explore at:
    zip(611 bytes)Available download formats
    Dataset updated
    Nov 30, 2024
    Authors
    liukunshan2
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset

    This dataset was created by liukunshan2

    Released under MIT

    Contents

  15. R

    Sam2.1 Dataset

    • universe.roboflow.com
    zip
    Updated Sep 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    workplace (2025). Sam2.1 Dataset [Dataset]. https://universe.roboflow.com/workplace-da74u/sam2.1-otsrv/dataset/1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Sep 1, 2025
    Dataset authored and provided by
    workplace
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Segmenatation_scratch Polygons
    Description

    Sam2.1

    ## Overview
    
    Sam2.1 is a dataset for instance segmentation tasks - it contains Segmenatation_scratch annotations for 276 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  16. Results of AI segmentations and cell files research Part.1

    • figshare.com
    png
    Updated May 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Killian Verlingue (2025). Results of AI segmentations and cell files research Part.1 [Dataset]. http://doi.org/10.6084/m9.figshare.29108438.v1
    Explore at:
    pngAvailable download formats
    Dataset updated
    May 20, 2025
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Killian Verlingue
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    These figures are the graphical results of my Master 2 internship on automatic segmentation using SAM2(Segment Anything Model 2)an artificial intelligence. The red line represents the best cell line from which anatomical measurements were made.

  17. e

    GRIME AI Water Segmentation Model for the USGS Monitoring Site East Branch...

    • portal.edirepository.org
    csv, zip
    Updated Sep 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Troy Gilmore; John Stranzl, Jr.; Mary Harner; Keegan Johnson; Chris Terry; Maggie Wells; Mackenzie Smith; Dawson Kosmicki; Jamila Bajelan; Jahir Uddin; Pavan Guggilla (2025). GRIME AI Water Segmentation Model for the USGS Monitoring Site East Branch Brandywine Creek below Downingtown, PA, 2023-2024 [Dataset]. http://doi.org/10.6073/pasta/23719fc153c42199cba32fafcd010ed8
    Explore at:
    zip(975568999 byte), csv(1531 byte)Available download formats
    Dataset updated
    Sep 24, 2025
    Dataset provided by
    EDI
    Authors
    Troy Gilmore; John Stranzl, Jr.; Mary Harner; Keegan Johnson; Chris Terry; Maggie Wells; Mackenzie Smith; Dawson Kosmicki; Jamila Bajelan; Jahir Uddin; Pavan Guggilla
    Time period covered
    Feb 9, 2023 - Dec 29, 2024
    Area covered
    Description

    Ground-based observations from fixed-mount cameras have the potential to fill an important role in environmental sensing, including direct measurement of water levels and qualitative observation of ecohydrological research sites. All of this is theoretically possible for anyone who can install a trail camera. Easy acquisition of ground-based imagery has resulted in millions of environmental images stored, some of which are public data, and many of which contain information that has yet to be used for scientific purposes. The goal of this project was to develop and document key image processing and machine learning workflows, primarily related to semi-automated image labeling, to increase the use and value of existing and emerging archives of imagery that is relevant to ecohydrological processes.

       This data package includes imagery, annotation files, water segmentation model and model performance plots, and model test results (overlay images and masks) for USGS Monitoring Site East Branch Brandywine Creek below Downingtown, PA. All imagery was acquired from the USGS Hydrologic Imagery Visualization and Information System (HIVIS; see https://apps.usgs.gov/hivis/camera/PA_East_Branch_Brandywine_Creek_below_Downingtown for this specific data set) and/or the National Imagery Management System (NIMS) API.
    
       Water segmentation models were created by tuning the open-source Segment Anything Model 2 (SAM2, https://github.com/facebookresearch/sam2) using images that were annotated by team members on this project. The models were trained on the "water" annotations, but annotation files may include additional labels, such as "snow", "sky", and "unknown". Image annotation was done in Computer Vision Annotation Tool (CVAT) and exported in COCO format (.json).
    
       All model training and testing was completed in GaugeCam Remote Image Manager Educational Artificial Intelligence (GRIME AI, https://gaugecam.org/) software (Version: Beta 16). Model performance plots were automatically generated during this process.
    
       This project was conducted in 2023-2025 by collaborators at the University of Nebraska-Lincoln, University of Nebraska at Kearney, and the U.S. Geological Survey.
    
       This material is based upon work supported by the U.S. Geological Survey under Grant/Cooperative Agreement No. G23AC00141-00. The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the opinions or policies of the U.S. Geological Survey. Mention of trade names or commercial products does not constitute their endorsement by the U.S. Geological Survey. We gratefully acknowledge graduate student support from Daugherty Water for Food Global Institute at the University of Nebraska.
    
  18. R

    Traffic Signs Sam2 Dataset

    • universe.roboflow.com
    zip
    Updated Jan 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    IdeassionMani (2025). Traffic Signs Sam2 Dataset [Dataset]. https://universe.roboflow.com/ideassionmani/traffic-signs-sam2
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jan 15, 2025
    Dataset authored and provided by
    IdeassionMani
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Traffic Signs Polygons
    Description

    Traffic Signs Sam2

    ## Overview
    
    Traffic Signs Sam2 is a dataset for instance segmentation tasks - it contains Traffic Signs annotations for 2,031 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  19. Kits23 Dataset

    • kaggle.com
    Updated Feb 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pawankumar1246 (2025). Kits23 Dataset [Dataset]. https://www.kaggle.com/datasets/pawankumar1246/sample
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 10, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Pawankumar1246
    Description

    Dataset

    This dataset was created by Pawankumar1246

    Contents

  20. sam2-fixtures

    • huggingface.co
    Updated Jun 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hugging Face Internal Testing Organization (2025). sam2-fixtures [Dataset]. https://huggingface.co/datasets/hf-internal-testing/sam2-fixtures
    Explore at:
    Dataset updated
    Jun 24, 2025
    Dataset provided by
    Hugging Facehttps://huggingface.co/
    Authors
    Hugging Face Internal Testing Organization
    Description

    hf-internal-testing/sam2-fixtures dataset hosted on Hugging Face and contributed by the HF Datasets community

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Jirka Borovec (2025). Forgery Detection: Segment Anything with SAM2 [Dataset]. https://www.kaggle.com/datasets/jirkaborovec/forgery-segment-sam2-npz
Organization logo

Forgery Detection: Segment Anything with SAM2

Generated instance segmentation of all object in the image...

Explore at:
zip(40460512 bytes)Available download formats
Dataset updated
Nov 26, 2025
Authors
Jirka Borovec
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

The Segment Anything Model (SAM) and its successor, SAM2, are highly influential foundation models in the field of computer vision, specifically designed for promptable image segmentation.

What SAM Is

  • Foundation Model for Segmentation: SAM is a large, general-purpose model trained on a massive dataset to perform image segmentation. Its goal is to segment any object in any image, even those it hasn't seen before.
  • Prompt-Based Segmentation: Its core feature is the ability to generate a high-quality segmentation mask based on a minimal user "prompt." These prompts can include:
    • Clicks: The user clicks a point on an object, and SAM generates the mask for that entire object.
    • Bounding Boxes: The user draws a box around an area, and SAM refines it into a precise mask.
    • Text or Scribbles: (In more advanced versions like SAM2) The model can generate masks based on more complex inputs.

Context in the Forgery Detection Competition

In the context of the Kaggle competition, the use of SAM/SAM2 is for Candidate Generation:

  • The Problem: The competition, "Recod.ai/LUC - Scientific Image Forgery Detection," requires models to not only detect a forgery (like a copy-move manipulation in a biomedical image) but also to segment the manipulated region at the pixel level.
  • SAM/SAM2's Role: SAM is a powerful tool for generating segmentation masks for any prominent object or region in an image. The notebook you linked, "Forgery using SAM2 for Candidate Generation," leverages this ability to:
    1. Run SAM2 over the forged images to automatically generate a large collection of general segmentation masks (the "candidates").
    2. The idea is that the actual forged regions, which often involve copying and pasting distinct objects or image patches, are likely to be contained within or closely align with one of these automatically generated SAM/SAM2 masks.
  • Aiding Forgery Detection: By using SAM2, the notebook transforms the problem from searching for a forgery anywhere in the image to classifying and refining a smaller, more manageable set of high-quality object masks. This can significantly simplify the task for the final forgery-specific model.
Search
Clear search
Close search
Google apps
Main menu