100+ datasets found
  1. Butterfly Image Classification

    • kaggle.com
    zip
    Updated Jun 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    DePie (2025). Butterfly Image Classification [Dataset]. https://www.kaggle.com/datasets/phucthaiv02/butterfly-image-classification
    Explore at:
    zip(236814249 bytes)Available download formats
    Dataset updated
    Jun 26, 2025
    Authors
    DePie
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    The dataset features 75 different classes of Butterflies. The dataset contains about 1000+ labelled images including the validation images. Each image belongs to only one butterfly category.

    The label of each image are saved in Training_set.csv.

    The Testing_set.csv contains names of image in test folder, which you need to predict the label and submit to Data Sprint 107 - Butterfly Image Classification.

  2. Cards Image Dataset-Classification

    • kaggle.com
    zip
    Updated Nov 17, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gerry (2022). Cards Image Dataset-Classification [Dataset]. https://www.kaggle.com/datasets/gpiosenka/cards-image-datasetclassification
    Explore at:
    zip(403866125 bytes)Available download formats
    Dataset updated
    Nov 17, 2022
    Authors
    Gerry
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    This is a very high quality dataset of playing card images. All images are 224 X 224 X 3 in jpg format. All images in the dataset have been cropped so that only the image of a single card is present and the card occupies well over 50% of the pixels in the image. There are 7624 training images, 265 test images and 265 validation images. The train, test and validation directories are partitioned into 53 sub directories , one for each of the 53 types of cards. The dataset also includes a csv file which can be used to load the datasets.

  3. Crop Disease Image Classification Dataset

    • kaggle.com
    zip
    Updated Mar 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bekhzod Olimov (2024). Crop Disease Image Classification Dataset [Dataset]. https://www.kaggle.com/datasets/killa92/crop-disease-image-classification-dataset
    Explore at:
    zip(2146518300 bytes)Available download formats
    Dataset updated
    Mar 17, 2024
    Authors
    Bekhzod Olimov
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    This dataset contains images and meta data for crop disease classification. For training purposes, it should be split into three sets necessary for Machine Learning and Deep Learning tasks, namely train, validation, and test splits.

    The images are located in the "images" folder and labels can be obtained from the meta data in the csv file. Also, dataset class names are given in the class_names json file.

    Good luck!

  4. Cats and Dogs Classification Dataset

    • kaggle.com
    zip
    Updated Oct 7, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bhavik Jikadara (2023). Cats and Dogs Classification Dataset [Dataset]. https://www.kaggle.com/datasets/bhavikjikadara/dog-and-cat-classification-dataset
    Explore at:
    zip(812748137 bytes)Available download formats
    Dataset updated
    Oct 7, 2023
    Authors
    Bhavik Jikadara
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    The Cat and Dog Classification dataset is a standard computer vision dataset that involves classifying photos as either containing a dog or a cat. This dataset is provided as a subset of photos from a much larger dataset of approximately 25 thousands.

    The dataset contains 24,998 images, split into 12,499 Cat images and 12,499 Dog images. The training images are divided equally between cat and dog images, while the test images are not labeled. This allows users to evaluate their models on unseen data.

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F7367057%2F498b0fc0a7a8cf40ac4337da82a4ebc5%2Fhow-to-introduce-a-dog-to-a-cat-blog-cover.webp?generation=1696702214010539&alt=media" alt="">

  5. Cats and Dogs image classification

    • kaggle.com
    zip
    Updated Dec 20, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Samuel Cortinhas (2022). Cats and Dogs image classification [Dataset]. https://www.kaggle.com/datasets/samuelcortinhas/cats-and-dogs-image-classification
    Explore at:
    zip(67566406 bytes)Available download formats
    Dataset updated
    Dec 20, 2022
    Authors
    Samuel Cortinhas
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Over a 1000 images of cats and dogs scraped off of google images. The problem statement is to build a model that can classify between a cat and a dog in an image as accurately as possible.

    Image sizes range from roughly 100x100 pixels to 2000x1000 pixels.

    Image format is jpeg.

    Duplicates have been removed.

  6. Scientific Image Classification Dataset

    • kaggle.com
    Updated Apr 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rushil Prajapati (2024). Scientific Image Classification Dataset [Dataset]. https://www.kaggle.com/datasets/rushilprajapati/data-final
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 13, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Rushil Prajapati
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Overview

    This dataset is a comprehensive collection of scientific images curated for the advancement of image classification algorithms in the scientific domain. It comprises a diverse set of images across six distinct classes, providing a unique challenge for machine learning enthusiasts and researchers. The base source of the data is derived from the Biofors dataset, with additional images incorporated to enhance variety and complexity. All images are either in .JPG or .PNG formats.

    Dataset Description

    The dataset is organized into six primary classes, each representing a different aspect of scientific imaging:

    Blot-Gel: Images of various blotting techniques and gel electrophoresis results used in molecular biology.

    FACS (Fluorescence-Activated Cell Sorting): Flow cytometry images showcasing cell populations based on fluorescent labeling.

    Histopathology: High-resolution images of tissue sections stained to reveal cellular structures and patterns indicative of pathological states.

    Macroscopy: Images captured without magnification, highlighting the gross features and details of biological specimens.

    Microscopy: A collection of microscopic images that reveal the intricate details of cells and microorganisms.

    Non-scientific: A control group of images unrelated to scientific inquiry, included to test the robustness of classification models. It mainly consists images from ImageNet dataset.

    Use Cases

    This dataset is ideal for developing and benchmarking image classification models that can be applied to:

    Image Falsification and Fabrication Detection: This dataset serves as a foundation for developing forensic tools to combat image falsification and fabrication in scientific publications. With the Biofors dataset as a base, participants have the opportunity to create models that can detect unethical manipulations, thereby safeguarding the credibility of scientific research. The challenge lies in identifying subtle alterations that may indicate misconduct, such as duplicated, spliced, or artificially enhanced images. Success in this area has far-reaching implications, potentially preventing the spread of misinformation and preserving the integrity of scientific literature.

    Automated Analysis of Scientific Experiments: The dataset facilitates the development of models for automated analysis in scientific experiments, which can significantly accelerate the pace of discovery. Automated research workflows, integrating computation, laboratory automation, and AI tools, are transforming how experiments are designed, conducted, and analyzed.

    Diagnostic Tools in Medicine: In the medical field, diagnostic tools are essential for achieving diagnostic excellence, which involves making correct and timely diagnoses while maximizing patient experience and managing uncertainty. AI in healthcare is revolutionizing diagnostics, from analyzing medical images to identifying disease patterns and predicting patient outcomes.

    References

    [1] https://ieeexplore.ieee.org/document/9710731

    [2] https://github.com/vimal-isi-edu/BioFors

    [3] https://link.springer.com/chapter/10.1007/978-3-031-53085-2_26

    [4] https://www.nationalacademies.org/news/2022/05/automated-research-workflows-are-speeding-pace-of-scientific-discovery-new-report-offers-recommendations-to-advance-their-development

    [5] https://warwick.ac.uk/fac/cross_fac/tia/data/pannuke (Histopathology images)

    [6] https://www.kaggle.com/datasets/chopinforest/esophageal-endoscopy-images (Macroscopy)

    [7] https://www.kaggle.com/datasets/safurahajiheidari/kidney-stone-images (Macroscopy)

    [8] https://www.kaggle.com/datasets/alifrahman/covid19-chest-xray-image-dataset (Macroscopy)

    [9] https://www.kaggle.com/datasets/vitaliykinakh/stable-imagenet1k (Non-scientific images)

    [10] https://www.kaggle.com/datasets/nodoubttome/skin-cancer9-classesisic (Macroscopy)

    [11] https://www.kaggle.com/datasets/sunedition/graphs-dataset (Non–scientific images)

  7. Vehicle Image Classification

    • kaggle.com
    zip
    Updated Aug 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mohamed Maher (2024). Vehicle Image Classification [Dataset]. https://www.kaggle.com/datasets/mohamedmaher5/vehicle-classification
    Explore at:
    zip(866783573 bytes)Available download formats
    Dataset updated
    Aug 9, 2024
    Authors
    Mohamed Maher
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Overview: This dataset is designed for vehicle classification tasks and contains a total of 5,600 images distributed across seven categories. Each category represents a different type of vehicle.

    Structure:

    • Main Folder: Vehicles
    • Subfolders:
      • Auto Rickshaws (800 images)
      • Bikes (800 images)
      • Cars (800 images)
      • Motorcycles (800 images)
      • Planes (800 images)
      • Ships (800 images)
      • Trains (800 images)

    Image Format: All images are in JPEG format with the .jpg extension.

    Size: 5,600 images in total.

    Usage: Ideal for building and testing image classification models to distinguish between different types of vehicles.

  8. Vegetable Image Dataset

    • kaggle.com
    zip
    Updated Dec 24, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    M Israk Ahmed (2021). Vegetable Image Dataset [Dataset]. https://www.kaggle.com/datasets/misrakahmed/vegetable-image-dataset
    Explore at:
    zip(560031432 bytes)Available download formats
    Dataset updated
    Dec 24, 2021
    Authors
    M Israk Ahmed
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    The following must be cited when using this dataset:

    https://www.researchgate.net/publication/352846889_DCNN-Based_Vegetable_Image_Classification_Using_Transfer_Learning_A_Comparative_Study

    🎯Citation in bibtex>>>

    @inproceedings{ahmed2021dcnn, title={DCNN-based vegetable image classification using transfer learning: A comparative study}, author={Ahmed, M Israk and Mamun, Shahriyar Mahmud and Asif, Asif Uz Zaman}, booktitle={2021 5th International Conference on Computer, Communication and Signal Processing (ICCCSP)}, pages={235--243}, year={2021}, organization={IEEE} }

    Context

    The initial experiment is done with 15 types of common vegetables that are found throughout the world. The vegetables that are chosen for the experimentation are- bean, bitter gourd, bottle gourd, brinjal, broccoli, cabbage, capsicum, carrot, cauliflower, cucumber, papaya, potato, pumpkin, radish and tomato. A total of 21000 images from 15 classes are used where each class contains 1400 images of size 224×224 and in *.jpg format. The dataset split 70% for training, 15% for validation, and 15% for testing purpose.

    Content

    This dataset contains three folders:

    • train (15000 images)
    • test (3000 images)
    • validation (3000 images) each of the above folders contains subfolders for different vegetables wherein the images for respective vegetables are present.

    Data Collection

    The images in this dataset were collected by us from vegetable farm and market for a project.

    Acknowledgements

    We would like to give thanks to the people who helped us regarding data collection.

    Inspiration

    From vegetable production to delivery, several common steps are operated manually. Like picking, and sorting vegetables. Therefore, we decided to solve this problem using deep neural architecture, by developing a model that can detect and classify vegetables. That model can be implemented in different types of devices and can also solve other problems related to the identification of vegetables, like labeling the vegetables automatically without any need for human work.

  9. Big Cats Image Classification Dataset 🦁

    • kaggle.com
    zip
    Updated Mar 29, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    iulia (2023). Big Cats Image Classification Dataset 🦁 [Dataset]. https://www.kaggle.com/datasets/patriciabrezeanu/big-cats-image-classification-dataset
    Explore at:
    zip(532304917 bytes)Available download formats
    Dataset updated
    Mar 29, 2023
    Authors
    iulia
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    This dataset, contains a curated collection of images featuring four distinct big cat species: lions, tigers, leopards, and cheetahs. The images were sourced using the DuckDuckGo search engine and are organized into separate directories for each animal. This dataset is ideal for machine learning and computer vision projects focused on image classification and species recognition. With this dataset, you can train and validate your models to accurately differentiate between these majestic big cats.

  10. Fashion Apparel Image Classification Dataset

    • kaggle.com
    zip
    Updated Jun 24, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ShreyanshVerma27 (2024). Fashion Apparel Image Classification Dataset [Dataset]. https://www.kaggle.com/datasets/shreyanshverma27/new-data-fashion
    Explore at:
    zip(135989032 bytes)Available download formats
    Dataset updated
    Jun 24, 2024
    Authors
    ShreyanshVerma27
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Introducing the Fashion Apparel Image Classification Dataset for Convolutional Neural Networks (CNN), a carefully curated collection of clothing images specifically designed for CNN-based image classification tasks. This dataset features 5,413 high-quality images of various clothing items in two primary colors: black and blue. The images are categorized into 10 distinct classes:

    • black_dress: 450
    • black_pants: 871
    • black_shirt: 715
    • black_shoes: 766
    • black_shorts: 328
    • blue_dress: 502
    • blue_pants: 798
    • blue_shirt: 741
    • blue_shoes: 523
    • blue_shorts: 299

    Each category contains a substantial number of images, ranging from 299 to 871, ensuring a well-balanced and diverse dataset for robust model training and testing. The dataset showcases a wide variety of clothing styles, designs, and textures, making it an ideal resource for developing and refining CNN models for fashion apparel image classification.

    This Fashion Apparel Image Classification Dataset for CNN is perfect for researchers, developers, and students working on computer vision, image processing, and deep learning projects in the fashion and apparel domain. Use it to train and test your CNN models for object detection, image segmentation, and clothing classification tasks. Explore this dataset and elevate your fashion apparel image classification projects to new heights.

  11. RealWaste Image Classification

    • kaggle.com
    zip
    Updated Jan 19, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Joakim Arvidsson (2024). RealWaste Image Classification [Dataset]. https://www.kaggle.com/datasets/joebeachcapital/realwaste
    Explore at:
    zip(688660611 bytes)Available download formats
    Dataset updated
    Jan 19, 2024
    Authors
    Joakim Arvidsson
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Overview

    An image classification dataset of waste items across 9 major material types, collected within an authentic landfill environment.

    Dataset Information

    For what purpose was the dataset created? RealWaste was created as apart of an honors thesis researching how convolution neural networks could perform on authentic waste material when trained on objects in pure and unadulterated forms, when compared to training via real waste items.

    What do the instances in this dataset represent? Color images of waste items captured at the point of reception in a landfill environment. Images are released in 524x524 resolution in line with accompanying research paper. For full size resolution images, please contact the corresponding author.

    Additional Information

    The labels applied to the images represent the material type present, however further refinement of labelling may be performed given the moderate dataset size (i.e., splitting the plastic class in transparent and opaque components). Under the proposed labels, image counts are as follows: - Cardboard: 461 - Food Organics: 411 - Glass: 420 - Metal: 790 - Miscellaneous Trash: 495 - Paper: 500 - Plastic: 921 - Textile Trash: 318 - Vegetation: 436

    Has Missing Values?

    No

    Introductory Paper

    RealWaste: A Novel Real-Life Data Set for Landfill Waste Classification Using Deep Learning By Sam Single, Saeid Iranmanesh, Raad Raad. 2023 Published in Information

    Class Labels

    Cardboard, Food Organics, Glass, Metal, Miscellaneous Trash, Paper, Plastic, Textile Trash, and Vegetation

  12. Intel Image Classification

    • kaggle.com
    zip
    Updated Jan 30, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Puneet Bansal (2019). Intel Image Classification [Dataset]. https://www.kaggle.com/puneet6060/intel-image-classification
    Explore at:
    zip(363152213 bytes)Available download formats
    Dataset updated
    Jan 30, 2019
    Authors
    Puneet Bansal
    Description

    Context

    This is image data of Natural Scenes around the world.

    Content

    This Data contains around 25k images of size 150x150 distributed under 6 categories. {'buildings' -> 0, 'forest' -> 1, 'glacier' -> 2, 'mountain' -> 3, 'sea' -> 4, 'street' -> 5 }

    The Train, Test and Prediction data is separated in each zip files. There are around 14k images in Train, 3k in Test and 7k in Prediction. This data was initially published on https://datahack.analyticsvidhya.com by Intel to host a Image classification Challenge.

    Acknowledgements

    Thanks to https://datahack.analyticsvidhya.com for the challenge and Intel for the Data

    Photo by Jan Böttinger on Unsplash

    Inspiration

    Want to build powerful Neural network that can classify these images with more accuracy.

  13. food-11 Image Classification Dataset

    • kaggle.com
    zip
    Updated Jul 7, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bikram Saha (2022). food-11 Image Classification Dataset [Dataset]. https://www.kaggle.com/datasets/imbikramsaha/food11
    Explore at:
    zip(544294379 bytes)Available download formats
    Dataset updated
    Jul 7, 2022
    Authors
    Bikram Saha
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    The main folder food11 contains two sub-folders : 1. train 2. test

    Both train and test folders have 11 folders named apple_pie cheesecake chicken_curry french_fries fried_rice hamburger hot_dog ice_cream omelette pizza sushi

    Each folders inside train-set(total=9900) contain 900 images, and folders inside test-set(total=1100) contain 100 images.

  14. Agricultural crops image classification

    • kaggle.com
    zip
    Updated Aug 26, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Md Waquar Azam (2022). Agricultural crops image classification [Dataset]. https://www.kaggle.com/datasets/mdwaquarazam/agricultural-crops-image-classification
    Explore at:
    zip(82830492 bytes)Available download formats
    Dataset updated
    Aug 26, 2022
    Authors
    Md Waquar Azam
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset contains 30 different types of crop images in separate folders

    Task To classify all types of agriculture crop images ( rice, sugarcane, maize ,lemon, banana,coconut , jute etc..) with better accuracy.

    Inspiration The question to be answered to classify crops in each type.

  15. Tree Nuts -Image Classification

    • kaggle.com
    zip
    Updated Jun 24, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gerry (2022). Tree Nuts -Image Classification [Dataset]. https://www.kaggle.com/datasets/gpiosenka/tree-nuts-image-classification
    Explore at:
    zip(151194165 bytes)Available download formats
    Dataset updated
    Jun 24, 2022
    Authors
    Gerry
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    dataset of 10 types of tree nuts. 1163train, 50 test,50 validation files 224 X 224 X 3 jpg format. Also includes a tensorflow trained model nuts)100.0.hs that achieved an F1 score of 100%. A csv file tree nuts.csv is also provided

  16. Concrete Crack Images for Classification

    • kaggle.com
    zip
    Updated Apr 28, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ArnavR (2022). Concrete Crack Images for Classification [Dataset]. https://www.kaggle.com/datasets/arnavr10880/concrete-crack-images-for-classification
    Explore at:
    zip(244476725 bytes)Available download formats
    Dataset updated
    Apr 28, 2022
    Authors
    ArnavR
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Description

    The dataset contains concrete images having cracks. The data is collected from various METU Campus Buildings. The dataset is divided into two as negative and positive crack images for image classification. Each class has 20000images with a total of 40000 images with 227 x 227 pixels with RGB channels. The dataset is generated from 458 high-resolution images (4032x3024 pixel) with the method proposed by Zhang et al (2016). High-resolution images have variance in terms of surface finish and illumination conditions. No data augmentation in terms of random rotation or flipping is applied.

    Acknowledgements

    Özgenel, Çağlar Fırat (2019), “Concrete Crack Images for Classification”, Mendeley Data, V2, doi: 10.17632/5y9wdsg2zt.2

  17. MOTHS IMAGE DATASET-CLASSIFICATION

    • kaggle.com
    Updated Aug 27, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gerry (2022). MOTHS IMAGE DATASET-CLASSIFICATION [Dataset]. https://www.kaggle.com/datasets/gpiosenka/moths-image-datasetclassification
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 27, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Gerry
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by Gerry

    Released under CC0: Public Domain

    Contents

  18. 5-class weather status image classification

    • kaggle.com
    zip
    Updated Aug 12, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ammar Alfaifi (2022). 5-class weather status image classification [Dataset]. https://www.kaggle.com/datasets/ammaralfaifi/5class-weather-status-image-classification
    Explore at:
    zip(522431689 bytes)Available download formats
    Dataset updated
    Aug 12, 2022
    Authors
    Ammar Alfaifi
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    About

    I wanted to collect real fresh outdoors images with fire classes in a part of Misk Foundation Data Science Immersive project. With MS Bing API I collected and cleaned up to 1500 images for all classes. Further, I collected data from four kaggle datasets, their credits are below.

    Check my GitHub repo for my work https://github.com/ammar-faifi/Weather_Status_Predictor_From_Images

    Check the report here https://ammar-faifi.github.io/Weather_Status_Predictor_From_Images/

    Online predictor here https://dsi-weather-predictor.herokuapp.com

    Data Summary

    ClassFolderImages Count
    Sunnysunny6702
    Cloudycloudy6274
    Foggyfoggy1261
    Rainyrainy1927
    Snowysnowy1875
    TotalNan18039

    Sources

    1 - Manually from Bing API 2 - https://www.kaggle.com/datasets/jagadeesh23/weather-classification 3 - https://www.kaggle.com/datasets/polavr/twoclass-weather-classification 4 - https://www.kaggle.com/datasets/jehanbhathena/weather-dataset 5 - https://www.kaggle.com/datasets/pratik2901/multiclass-weather-dataset

  19. CNN Image Classification Dataset

    • kaggle.com
    zip
    Updated Mar 22, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    David King_Rutgers (2022). CNN Image Classification Dataset [Dataset]. https://www.kaggle.com/datasets/davidkingrutgers/cnn-image-classification-dataset
    Explore at:
    zip(545101979 bytes)Available download formats
    Dataset updated
    Mar 22, 2022
    Authors
    David King_Rutgers
    Description

    Dataset

    This dataset was created by David King_Rutgers

    Contents

  20. Logo Images Dataset

    • kaggle.com
    zip
    Updated Mar 23, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Siddharth Sah (2023). Logo Images Dataset [Dataset]. https://www.kaggle.com/datasets/siddharthkumarsah/logo-dataset-2341-classes-and-167140-images
    Explore at:
    zip(2054788801 bytes)Available download formats
    Dataset updated
    Mar 23, 2023
    Authors
    Siddharth Sah
    Description

    The "Logo-2K+" dataset, published in the paper "Logo-2K+: Discriminative Region Navigation and Augmentation Network for Scalable Logo Classification", is a collection of 167,140 images of logos belonging to 2,341 sub-classes across 10 root-categories. The images were crawled from the Google and Baidu search engines.

    Before making the dataset available for public use, I have carefully cleaned the dataset to ensure that it can be loaded and used without any errors. I have removed all folders with special characters and spaces in their names, and only kept alphanumeric characters and underscores. This makes the dataset more accessible and easier to use for researchers and developers working on logo classification.

    The cleaned dataset is provided in three parts:

    "Logo-2K+.rar" contains the original 167,140 logo images, grouped into 10 root-categories and 2,341 sub-classes.

    a. "Logo-2K+classes.txt" provides labels for all sub-classes.

    b. "train_images_root.txt" lists the paths of training images starting with the root-category.

    c. "test_images_root.txt" lists the paths of testing images starting with the root-category.

    d. "train_images.txt" lists the relative paths of training images starting with the sub-class.

    e. "test_images.txt" lists the relative paths of testing images starting with the sub-class.

    The "Logo-2K+" dataset is a valuable resource for researchers and developers working on logo classification, as it contains a large and diverse set of logo images with well-defined sub-class labels. The provided training and testing images, along with the label files, can be used to train and evaluate logo classification models.

    The statistic comparison of 10 root categories from Logo-2K+ is shown as follows.

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F1885485%2F9473b590a6a770cf5b3cd42e9b66a13b%2FScreenshot%202023-03-23%20at%208.50.41%20PM.png?generation=1679575860470498&alt=media" alt="">

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
DePie (2025). Butterfly Image Classification [Dataset]. https://www.kaggle.com/datasets/phucthaiv02/butterfly-image-classification
Organization logo

Butterfly Image Classification

Identify the class to which each butterfly belongs to

Explore at:
zip(236814249 bytes)Available download formats
Dataset updated
Jun 26, 2025
Authors
DePie
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

The dataset features 75 different classes of Butterflies. The dataset contains about 1000+ labelled images including the validation images. Each image belongs to only one butterfly category.

The label of each image are saved in Training_set.csv.

The Testing_set.csv contains names of image in test folder, which you need to predict the label and submit to Data Sprint 107 - Butterfly Image Classification.

Search
Clear search
Close search
Google apps
Main menu