100+ datasets found
  1. Monochrome Image dataset

    • kaggle.com
    zip
    Updated Dec 5, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sarthak1799 (2020). Monochrome Image dataset [Dataset]. https://www.kaggle.com/datasets/sarthak1799/monochrome-image-dataset
    Explore at:
    zip(129142623 bytes)Available download formats
    Dataset updated
    Dec 5, 2020
    Authors
    Sarthak1799
    Description

    Context

    Richard Zhang's image colourizer model trained on the ImageNet dataset which converts grayscale images to colour images using the L channel of the Lab colour space. The dataset contains necessary files for loading the model and Grayscale images for Image colourization.

    Content

    Dataset contains necessary files for loading the model and Grayscale images for Image colourization.

    Acknowledgements

    Richard Zhang who created the model back in 2016 has made this dataset possible.

    Inspiration

    Your data will be in front of the world's largest data science community. What questions do you want to see answered?

  2. Image Data (Object Detection and Captioning)

    • kaggle.com
    Updated Apr 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Arunesh (2024). Image Data (Object Detection and Captioning) [Dataset]. https://www.kaggle.com/datasets/aruneshhh/object-detection-images
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 15, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Arunesh
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    🌟 Unlock the potential of advanced computer vision tasks with our comprehensive dataset comprising 15,000 high-quality images. Whether you're delving into segmentation, object detection, or image captioning, our dataset offers a diverse array of visual data to fuel your machine learning models.

    🔍 Our dataset is meticulously curated to encompass a wide range of streams, ensuring versatility and applicability across various domains. From natural landscapes to urban environments, from wildlife to everyday objects, our collection captures the richness and diversity of visual content.

    📊 Dataset Overview:

    Total ImagesTraining Set (70%)Testing Set (30%)
    15,00010,5004,500

    🔢 Image Details:

    • Format: JPG
    • Size Range: Approximately 150 to 300 KB per image

    Embark on your computer vision journey and leverage our dataset to develop cutting-edge algorithms, advance research, and push the boundaries of what's possible in visual recognition tasks. Join us in shaping the future of AI-powered image analysis.

  3. h

    new-image-dataset

    • huggingface.co
    Updated Oct 21, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yusuf Ansari (2023). new-image-dataset [Dataset]. https://huggingface.co/datasets/yusuf802/new-image-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 21, 2023
    Authors
    Yusuf Ansari
    Description

    Dataset Card for "new-image-dataset"

    More Information needed

  4. T

    open_images_v4

    • tensorflow.org
    • opendatalab.com
    Updated Jun 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). open_images_v4 [Dataset]. https://www.tensorflow.org/datasets/catalog/open_images_v4
    Explore at:
    Dataset updated
    Jun 1, 2024
    Description

    Open Images is a dataset of ~9M images that have been annotated with image-level labels and object bounding boxes.

    The training set of V4 contains 14.6M bounding boxes for 600 object classes on 1.74M images, making it the largest existing dataset with object location annotations. The boxes have been largely manually drawn by professional annotators to ensure accuracy and consistency. The images are very diverse and often contain complex scenes with several objects (8.4 per image on average). Moreover, the dataset is annotated with image-level labels spanning thousands of classes.

    To use this dataset:

    import tensorflow_datasets as tfds
    
    ds = tfds.load('open_images_v4', split='train')
    for ex in ds.take(4):
     print(ex)
    

    See the guide for more informations on tensorflow_datasets.

    https://storage.googleapis.com/tfds-data/visualization/fig/open_images_v4-original-2.0.0.png" alt="Visualization" width="500px">

  5. i

    Stop sign Image Classification Dataset

    • images.cv
    zip
    Updated Dec 19, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2021). Stop sign Image Classification Dataset [Dataset]. https://images.cv/dataset/stop-sign-image-classification-dataset
    Explore at:
    zipAvailable download formats
    Dataset updated
    Dec 19, 2021
    License

    https://images.cv/licensehttps://images.cv/license

    Description

    Labeled Stop sign images suitable for training and evaluating computer vision and deep learning models.

  6. o

    The Massively Multilingual Image Dataset (MMID)

    • registry.opendata.aws
    Updated Jan 23, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Penn NLP (2019). The Massively Multilingual Image Dataset (MMID) [Dataset]. https://registry.opendata.aws/mmid/
    Explore at:
    Dataset updated
    Jan 23, 2019
    Dataset provided by
    <a href="https://github.com/penn-nlp">Penn NLP</a>
    Description

    MMID is a large-scale, massively multilingual dataset of images paired with the words they represent collected at the University of Pennsylvania. The dataset is doubly parallel: for each language, words are stored parallel to images that represent the word, and parallel to the word's translation into English (and corresponding images.)

  7. Low-quality image dataset

    • kaggle.com
    zip
    Updated May 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Po-Chih Wu (2025). Low-quality image dataset [Dataset]. https://www.kaggle.com/datasets/pochihwu/low-quality-image-dataset
    Explore at:
    zip(11113072986 bytes)Available download formats
    Dataset updated
    May 15, 2025
    Authors
    Po-Chih Wu
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Description

    Due to the scarcity of suitable image datasets online related to low-quality images, we created a new dataset specifically for this purpose. The dataset can be used to develop or train models aimed at improving image quality, or serve as a benchmark dataset for evaluating the performance of computer vision on low-quality images. The image image processing code in this dataset is available at https://github.com/pochih-code/Low-quality-image-dataset

    Content

    Low-quality image dataset is based on the MS COCO 2017 validation images, with images processed into four categories, including lossy compression, image intensity, image noise and image blur. In total, the dataset comprises 100,000 processed images and is modified by humans to ensure that images are valid in the real world.

    Format

    • coco2017val image blur (25K images)
      • coco_average_4 (5K images)
      • coco_average_6 (5K images)
      • coco_average_8 (5K images)
      • coco_average_10 (5K images)
      • coco_average_12 (5K images)
    • coco2017val lossy compression (25K images)
      • coco_compressed_0 (5K images)
      • coco_compressed_20 (5K images)
      • coco_compressed_40 (5K images)
      • coco_compressed_60 (5K images)
      • coco_compressed_80 (5K images)
    • coco2017val gamma correction (25K images)
      • coco_gamma_4 (5K images)
      • coco_gamma_8 (5K images)
      • coco_gamma_12 (5K images)
      • coco_gamma_16 (5K images)
      • coco_gamma_20 (5K images)
    • coco2017val image noise (25K images)
      • coco_gaussian_10 (5K images)
      • coco_gaussian_20 (5K images)
      • coco_gaussian_30 (5K images)
      • coco_gaussian_40 (5K images)
      • coco_gaussian_50 (5K images) # Acknowledgements "https://cocodataset.org/#home">Microsoft COCO: Common Objects in Context
  8. h

    AI-Generated-vs-Real-Images-Datasets

    • huggingface.co
    Updated Aug 19, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hem Bahadur Gurung (2025). AI-Generated-vs-Real-Images-Datasets [Dataset]. https://huggingface.co/datasets/Hemg/AI-Generated-vs-Real-Images-Datasets
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 19, 2025
    Authors
    Hem Bahadur Gurung
    Description

    Dataset Card for "AI-Generated-vs-Real-Images-Datasets"

    More Information needed

  9. i

    Lighter Labeled Image Dataset

    • images.cv
    zip
    Updated Dec 19, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2021). Lighter Labeled Image Dataset [Dataset]. https://images.cv/dataset/lighter-image-classification-dataset
    Explore at:
    zipAvailable download formats
    Dataset updated
    Dec 19, 2021
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Labeled Lighter images suitable for AI and computer vision.

  10. i

    Pencil Image Classification Dataset

    • images.cv
    zip
    Updated Dec 24, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2021). Pencil Image Classification Dataset [Dataset]. https://images.cv/dataset/pencil-image-classification-dataset
    Explore at:
    zipAvailable download formats
    Dataset updated
    Dec 24, 2021
    License

    https://images.cv/licensehttps://images.cv/license

    Description

    Labeled Pencil images suitable for training and evaluating computer vision and deep learning models.

  11. h

    face-recognition-image-dataset

    • huggingface.co
    Updated Apr 15, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Unidata (2025). face-recognition-image-dataset [Dataset]. https://huggingface.co/datasets/UniDataPro/face-recognition-image-dataset
    Explore at:
    Dataset updated
    Apr 15, 2025
    Authors
    Unidata
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    Image Dataset of face images for compuer vision tasks

    Dataset comprises 500,600+ images of individuals representing various races, genders, and ages, with each person having a single face image. It is designed for facial recognition and face detection research, supporting the development of advanced recognition systems. By leveraging this dataset, researchers and developers can enhance deep learning models, improve face verification and face identification techniques, and refine… See the full description on the dataset page: https://huggingface.co/datasets/UniDataPro/face-recognition-image-dataset.

  12. g

    TIME – Image Dataset – Classification

    • gts.ai
    json
    Updated Apr 21, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GTS (2024). TIME – Image Dataset – Classification [Dataset]. https://gts.ai/dataset-download/time-image-dataset-classification/
    Explore at:
    jsonAvailable download formats
    Dataset updated
    Apr 21, 2024
    Dataset provided by
    GLOBOSE TECHNOLOGY SOLUTIONS PRIVATE LIMITED
    Authors
    GTS
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    A diverse image dataset containing clock faces with varying styles, angles, and hand positions, split into training, testing, and validation subsets for accurate time recognition and image classification tasks.

  13. s

    Landmark Image Dataset

    • shaip.com
    • lb.shaip.com
    • +4more
    json
    Updated Nov 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shaip (2024). Landmark Image Dataset [Dataset]. https://www.shaip.com/offerings/environment-scene-segmentation-datasets/
    Explore at:
    jsonAvailable download formats
    Dataset updated
    Nov 26, 2024
    Dataset authored and provided by
    Shaip
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Images of landmarks within the context of their environment

  14. g

    Ships Image Dataset

    • gts.ai
    json
    Updated Jul 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GTS (2024). Ships Image Dataset [Dataset]. https://gts.ai/dataset-download/ships-image-dataset/
    Explore at:
    jsonAvailable download formats
    Dataset updated
    Jul 2, 2024
    Dataset provided by
    GLOBOSE TECHNOLOGY SOLUTIONS PRIVATE LIMITED
    Authors
    GTS
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Explore our Ships Image Dataset, featuring 8,506 high-quality images and YOLO v5 annotations. Ideal for AI model training in ship detection and classification.

  15. CSIRO Sentinel-1 SAR image dataset of oil- and non-oil features for machine...

    • data.csiro.au
    • researchdata.edu.au
    Updated Dec 15, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    David Blondeau-Patissier; Thomas Schroeder; Foivos Diakogiannis; Zhibin Li (2022). CSIRO Sentinel-1 SAR image dataset of oil- and non-oil features for machine learning ( Deep Learning ) [Dataset]. http://doi.org/10.25919/4v55-dn16
    Explore at:
    Dataset updated
    Dec 15, 2022
    Dataset provided by
    CSIROhttp://www.csiro.au/
    Authors
    David Blondeau-Patissier; Thomas Schroeder; Foivos Diakogiannis; Zhibin Li
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Time period covered
    May 1, 2015 - Aug 31, 2022
    Area covered
    Dataset funded by
    ESA
    CSIROhttp://www.csiro.au/
    Description

    What this collection is: A curated, binary-classified image dataset of grayscale (1 band) 400 x 400-pixel size, or image chips, in a JPEG format extracted from processed Sentinel-1 Synthetic Aperture Radar (SAR) satellite scenes acquired over various regions of the world, and featuring clear open ocean chips, look-alikes (wind or biogenic features) and oil slick chips.

    This binary dataset contains chips labelled as: - "0" for chips not containing any oil features (look-alikes or clean seas)
    - "1" for those containing oil features.

    This binary dataset is imbalanced, and biased towards "0" labelled chips (i.e., no oil features), which correspond to 66% of the dataset. Chips containing oil features, labelled "1", correspond to 34% of the dataset.

    Why: This dataset can be used for training, validation and/or testing of machine learning, including deep learning, algorithms for the detection of oil features in SAR imagery. Directly applicable for algorithm development for the European Space Agency Sentinel-1 SAR mission (https://sentinel.esa.int/web/sentinel/missions/sentinel-1 ), it may be suitable for the development of detection algorithms for other SAR satellite sensors.

    Overview of this dataset: Total number of chips (both classes) is N=5,630 Class 0 1 Total 3,725 1,905

    Further information and description is found in the ReadMe file provided (ReadMe_Sentinel1_SAR_OilNoOil_20221215.txt)

  16. Chemistry Lab Image Dataset Covering 25 Apparatus Categories

    • figshare.com
    application/x-rar
    Updated Aug 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Md. Sakhawat Hossain; Md. Sadman Haque; Md. Mostafizur Rahman; Md. Mosaddik Mashrafi Mousum; Zobaer Ibn Razzaque; Robiul Awoul Robin (2025). Chemistry Lab Image Dataset Covering 25 Apparatus Categories [Dataset]. http://doi.org/10.6084/m9.figshare.29110433.v3
    Explore at:
    application/x-rarAvailable download formats
    Dataset updated
    Aug 3, 2025
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Md. Sakhawat Hossain; Md. Sadman Haque; Md. Mostafizur Rahman; Md. Mosaddik Mashrafi Mousum; Zobaer Ibn Razzaque; Robiul Awoul Robin
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains 4,599 high-quality, annotated images of 25 commonly used chemistry lab apparatuses. The images, each containing structures in real-world settings, have been captured from different angles, backgrounds, and distances, while also undergoing variations in lighting to aid in the robustness of object detection models. Every image has been labeled using bounding box annotation in TXT (YOLO) format, alongside the class IDs and normalized bounding box coordinates, making object detection more precise. The annotations and bounding boxes have been built using the Roboflow platform.To achieve a better learning procedure, the dataset has been split into three sub-datasets: training, validation, and testing. The training dataset constitutes 70% of the entire dataset, with validation and testing at 20% and 10% respectively. In addition, all images undergo scaling to a standard of 640x640 pixels while being auto-oriented to rectify rotation discrepancies brought about by the EXIF metadata. The dataset is structured in three main folders - train, valid, and test, and each contains images/ and labels/ subfolders. Every image contains a label file containing class and bounding box data corresponding to each detected object.The whole dataset features 6,960 labeled instances per 25 apparatus categories including beakers, conical flasks, measuring cylinders, test tubes, among others. The dataset can be utilized for the development of automation systems, real-time monitoring and tracking systems, tools for safety monitoring, alongside AI educational tools.

  17. Truck Image Dataset

    • zenodo.org
    zip
    Updated Mar 3, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Leandro Arab Marcomini; Leandro Arab Marcomini; Andre Luiz Cunha; Andre Luiz Cunha (2023). Truck Image Dataset [Dataset]. http://doi.org/10.5281/zenodo.5744737
    Explore at:
    zipAvailable download formats
    Dataset updated
    Mar 3, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Leandro Arab Marcomini; Leandro Arab Marcomini; Andre Luiz Cunha; Andre Luiz Cunha
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Collection of truck images, from a side point view, used to extract information about truck axles, collected on a highway in the State of São Paulo, Brazil. This is still a work in progress dataset and will be updated regularly, as new images are acquired. More info can be found on: Researchgate Lab Page, OrcID Profiles, or ITS Lab page on Github.

    The dataset includes 725 cropped images of trucks, taken with three different cameras, on five different locations.

    • 725 images
    • Format: JPG
    • Resolution: 1920xVarious, 96dpi, 24bits
    • Naming pattern:

    If this dataset helps in any way your research, please feel free to contact the authors. We really enjoy knowing about other researcher's projects and how everybody is making use of the images on this dataset. We are also open for collaborations and to answer any questions. We also have a paper that uses this dataset, so if you want to officially cite us in your research, please do so! We appreciate it!

    Marcomini, Leandro Arab, and André Luiz Cunha. "Truck Axle Detection with Convolutional Neural Networks." arXiv preprint arXiv:2204.01868 (2022).

  18. VegeNet - Image datasets and Codes

    • zenodo.org
    • data.niaid.nih.gov
    zip
    Updated Oct 27, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jo Yen Tan; Jo Yen Tan (2022). VegeNet - Image datasets and Codes [Dataset]. http://doi.org/10.5281/zenodo.7254508
    Explore at:
    zipAvailable download formats
    Dataset updated
    Oct 27, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Jo Yen Tan; Jo Yen Tan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Compilation of python codes for data preprocessing and VegeNet building, as well as image datasets (zip files).

    Image datasets:

    1. vege_original : Images of vegetables captured manually in data acquisition stage
    2. vege_cropped_renamed : Images in (1) cropped to remove background areas and image labels renamed
    3. non-vege images : Images of non-vegetable foods for CNN network to recognize other-than-vegetable foods
    4. food_image_dataset : Complete set of vege (2) and non-vege (3) images for architecture building.
    5. food_image_dataset_split : Image dataset (4) split into train and test sets
    6. process : Images created when cropping (pre-processing step) to create dataset (2).
  19. RGB Image Dataset

    • figshare.com
    zip
    Updated Jun 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ling-Qi Zhang (2023). RGB Image Dataset [Dataset]. http://doi.org/10.6084/m9.figshare.14542434.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 3, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Ling-Qi Zhang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Curated RGB image dataset for our analysis, splited into training and evalutaion set. Based on ImageNet ILSVRC dataset (Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M, Berg AC, 2015).

  20. h

    open-images-v7

    • huggingface.co
    Updated Mar 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    BitMind (2025). open-images-v7 [Dataset]. https://huggingface.co/datasets/bitmind/open-images-v7
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 13, 2025
    Dataset authored and provided by
    BitMind
    Description

    Dataset Card for Open Images Dataset

    This dataset contains images from the Open Images dataset. It includes image URLs, split into training, validation, and test sets.

      Dataset Details
    
    
    
    
    
      Dataset Description
    

    Open Images is a dataset of approximately 9 million URLs to images that have been annotated with image-level labels, bounding boxes, object segmentation masks, and visual relationships.

    Curated by: Google LLC License: Images: CC BY 2.0 license… See the full description on the dataset page: https://huggingface.co/datasets/bitmind/open-images-v7.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Sarthak1799 (2020). Monochrome Image dataset [Dataset]. https://www.kaggle.com/datasets/sarthak1799/monochrome-image-dataset
Organization logo

Monochrome Image dataset

Monochrome (grayscale) images for Image colourization.

Explore at:
zip(129142623 bytes)Available download formats
Dataset updated
Dec 5, 2020
Authors
Sarthak1799
Description

Context

Richard Zhang's image colourizer model trained on the ImageNet dataset which converts grayscale images to colour images using the L channel of the Lab colour space. The dataset contains necessary files for loading the model and Grayscale images for Image colourization.

Content

Dataset contains necessary files for loading the model and Grayscale images for Image colourization.

Acknowledgements

Richard Zhang who created the model back in 2016 has made this dataset possible.

Inspiration

Your data will be in front of the world's largest data science community. What questions do you want to see answered?

Search
Clear search
Close search
Google apps
Main menu