100+ datasets found
  1. PASCALRAW-derived Classification Dataset

    • kaggle.com
    zip
    Updated Oct 5, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mathias Viborg (2023). PASCALRAW-derived Classification Dataset [Dataset]. https://www.kaggle.com/datasets/mathiasviborg/pascalraw-derived-object-cropped-dataset
    Explore at:
    zip(8714509767 bytes)Available download formats
    Dataset updated
    Oct 5, 2023
    Authors
    Mathias Viborg
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    Further desciption of dataset and original useage can be found in the article: Enabling RAW Image Classification Using Existing RGB Classifiers.

    The dataset is consisting of unprocessed RAW sensor data cropped from the PASCALRAW dataset for RAW image classification.

    Data distribution is a follows: | Backpack |Bicycle |Car|Person| | --- | --- | --- | --- | | 3569| 5142 | 4856 | 4864 |



    To use the full-resolution RAW images from the datasets as inputs to a classifier, downscaling would be required, which would artificially process the original image, resulting in a lower-quality representation. This would be counter-intuitive for this work since one of the primary reasons for using RAW images is that they contain unaltered capture information. For these reasons, a more relevant dataset for investigating the hypothesis consists of small, unaltered RAW images. Since such a dataset is not publicly available, we create a custom dataset based on classes within the PASCALRAW dataset. The PASCALRAW dataset has the added benefit of being captured using a Nikon D3200 DSLR camera and therefore also being stored in the same Nikon Raw Image file format (NEF), removing the need for considerations for differences in capture data and image formats.

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F13804035%2Fec1031a81d4d46274c3db05ad31a087b%2FUnavngivet%20diagram.png?generation=1702328479100552&alt=media" alt="">

    Figure 2 is a grid of crops overlapping the semantic mask withat least 50 % of its pixels. Figure 3 is an overview of the first four extracted samples from the object presented in Figure 2. Shown in Original RGB and Original RAW (RAW displayed using Matplotlib).

    Our dataset is created by extracting 448x448-sized cropped images of people, bicycles, cars and backpacks that are present in each full-resolution RAW- and corresponding RGB images within the PASCALRAW dataset. The dataset-generating process utilizes YOLOv5 to apply instance segmentation to extract masks of each of the desired four classes that are present in each RGB image. By then placing a grid of cropped images of size 448x448 over each object's entire bounding box, each crop overlapping itself with a stride of 224 pixels along its x- and y-direction, see Figure 2, and only storing those crops that overlap the semantic mask with at least 50 \% of its pixels, data samples from both the RGB- and corresponding RAW image are extracted, see Figure 3.

  2. Custom Face Recognition Image Dataset

    • kaggle.com
    zip
    Updated Jul 3, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Unidata (2025). Custom Face Recognition Image Dataset [Dataset]. https://www.kaggle.com/datasets/unidpro/face-recognition-image-dataset
    Explore at:
    zip(27609695 bytes)Available download formats
    Dataset updated
    Jul 3, 2025
    Authors
    Unidata
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    Image Dataset of face images for compuer vision tasks

    Dataset comprises 500,600+ images of individuals representing various races, genders, and ages, with each person having a single face image. It is designed for facial recognition and face detection research, supporting the development of advanced recognition systems.

    By leveraging this dataset, researchers and developers can enhance deep learning models, improve face verification and face identification techniques, and refine detection algorithms for more accurate recognizing faces in real-world scenarios. - Get the data

    Metadata for the dataset

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F22059654%2F87acb75b060abcd7838e8a9fad21fb79%2FFrame%201%20(8).png?generation=1743153407873743&alt=media" alt=""> All images come with rigorously verified metadata annotations (age, gender, ethnicity), achieving ≥95% labeling accuracy. Also images are captured under different lighting conditions and resolutions, enhancing the dataset's utility for computer vision tasks and image classifications.

    💵 Buy the Dataset: This is a limited preview of the data. To access the full dataset, please contact us at https://unidata.pro to discuss your requirements and pricing options.

    Researchers can leverage this dataset to improve recognition technology and develop learning models that enhance the accuracy of face detections. The dataset also supports projects focused on face anti-spoofing and deep learning applications, making it an essential tool for those studying biometric security and liveness detection technologies.

    🌐 UniData provides high-quality datasets, content moderation, data collection and annotation for your AI/ML projects

  3. Google Scraped Image Dataset

    • kaggle.com
    zip
    Updated Sep 24, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Debadri Dutta (2018). Google Scraped Image Dataset [Dataset]. https://www.kaggle.com/duttadebadri/image-classification
    Explore at:
    zip(2514186873 bytes)Available download formats
    Dataset updated
    Sep 24, 2018
    Authors
    Debadri Dutta
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    So I have a knack of photography and travelling. I wanted to create a model for myself which can classify my own pictures. But to be honest, a Data Scientist should always know how to collect data. So I scraped data from google images using a Python Script and using other open-source data sources from MIT, Kaggle itself, etc. Request everyone to give a try. I'll update the no. of images in validation set as time goes on.

    The link to the scripting file is here: https://github.com/debadridtt/Scraping-Google-Images-using-Python

    The images belong typically to 4 classes:

    • Art & Culture
    • Architecture
    • Food and Drinks
    • Travel and Adventure
  4. F

    English Product Image OCR Dataset

    • futurebeeai.com
    wav
    Updated Aug 1, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FutureBee AI (2022). English Product Image OCR Dataset [Dataset]. https://www.futurebeeai.com/dataset/ocr-dataset/english-product-image-ocr-dataset
    Explore at:
    wavAvailable download formats
    Dataset updated
    Aug 1, 2022
    Dataset provided by
    FutureBeeAI
    Authors
    FutureBee AI
    License

    https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement

    Dataset funded by
    FutureBeeAI
    Description

    Introducing the English Product Image Dataset - a diverse and comprehensive collection of images meticulously curated to propel the advancement of text recognition and optical character recognition (OCR) models designed specifically for the English language.

    Dataset Contain & Diversity

    Containing a total of 2000 images, this English OCR dataset offers diverse distribution across different types of front images of Products. In this dataset, you'll find a variety of text that includes product names, taglines, logos, company names, addresses, product content, etc. Images in this dataset showcase distinct fonts, writing formats, colors, designs, and layouts.

    To ensure the diversity of the dataset and to build a robust text recognition model we allow limited (less than five) unique images from a single resource. Stringent measures have been taken to exclude any personally identifiable information (PII) and to ensure that in each image a minimum of 80% of space contains visible English text.

    Images have been captured under varying lighting conditions – both day and night – along with different capture angles and backgrounds, to build a balanced OCR dataset. The collection features images in portrait and landscape modes.

    All these images were captured by native English people to ensure the text quality, avoid toxic content and PII text. We used the latest iOS and Android mobile devices above 5MP cameras to click all these images to maintain the image quality. In this training dataset images are available in both JPEG and HEIC formats.

    Metadata

    Along with the image data, you will also receive detailed structured metadata in CSV format. For each image, it includes metadata like image orientation, county, language, and device information. Each image is properly renamed corresponding to the metadata.

    The metadata serves as a valuable tool for understanding and characterizing the data, facilitating informed decision-making in the development of English text recognition models.

    Update & Custom Collection

    We're committed to expanding this dataset by continuously adding more images with the assistance of our native English crowd community.

    If you require a custom product image OCR dataset tailored to your guidelines or specific device distribution, feel free to contact us. We're equipped to curate specialized data to meet your unique needs.

    Furthermore, we can annotate or label the images with bounding box or transcribe the text in the image to align with your specific project requirements using our crowd community.

    License

    This Image dataset, created by FutureBeeAI, is now available for commercial use.

    Conclusion:

    Leverage the power of this product image OCR dataset to elevate the training and performance of text recognition, text detection, and optical character recognition models within the realm of the English language. Your journey to enhanced language understanding and processing starts here.

  5. F

    Spanish Product Image OCR Dataset

    • futurebeeai.com
    wav
    Updated Aug 1, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FutureBee AI (2022). Spanish Product Image OCR Dataset [Dataset]. https://www.futurebeeai.com/dataset/ocr-dataset/spanish-product-image-ocr-dataset
    Explore at:
    wavAvailable download formats
    Dataset updated
    Aug 1, 2022
    Dataset provided by
    FutureBeeAI
    Authors
    FutureBee AI
    License

    https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement

    Dataset funded by
    FutureBeeAI
    Description

    Introducing the Spanish Product Image Dataset - a diverse and comprehensive collection of images meticulously curated to propel the advancement of text recognition and optical character recognition (OCR) models designed specifically for the Spanish language.

    Dataset Contain & Diversity

    Containing a total of 2000 images, this Spanish OCR dataset offers diverse distribution across different types of front images of Products. In this dataset, you'll find a variety of text that includes product names, taglines, logos, company names, addresses, product content, etc. Images in this dataset showcase distinct fonts, writing formats, colors, designs, and layouts.

    To ensure the diversity of the dataset and to build a robust text recognition model we allow limited (less than five) unique images from a single resource. Stringent measures have been taken to exclude any personally identifiable information (PII) and to ensure that in each image a minimum of 80% of space contains visible Spanish text.

    Images have been captured under varying lighting conditions – both day and night – along with different capture angles and backgrounds, to build a balanced OCR dataset. The collection features images in portrait and landscape modes.

    All these images were captured by native Spanish people to ensure the text quality, avoid toxic content and PII text. We used the latest iOS and Android mobile devices above 5MP cameras to click all these images to maintain the image quality. In this training dataset images are available in both JPEG and HEIC formats.

    Metadata

    Along with the image data, you will also receive detailed structured metadata in CSV format. For each image, it includes metadata like image orientation, county, language, and device information. Each image is properly renamed corresponding to the metadata.

    The metadata serves as a valuable tool for understanding and characterizing the data, facilitating informed decision-making in the development of Spanish text recognition models.

    Update & Custom Collection

    We're committed to expanding this dataset by continuously adding more images with the assistance of our native Spanish crowd community.

    If you require a custom product image OCR dataset tailored to your guidelines or specific device distribution, feel free to contact us. We're equipped to curate specialized data to meet your unique needs.

    Furthermore, we can annotate or label the images with bounding box or transcribe the text in the image to align with your specific project requirements using our crowd community.

    License

    This Image dataset, created by FutureBeeAI, is now available for commercial use.

    Conclusion:

    Leverage the power of this product image OCR dataset to elevate the training and performance of text recognition, text detection, and optical character recognition models within the realm of the Spanish language. Your journey to enhanced language understanding and processing starts here.

  6. d

    340K+ Jewelry Images | AI Training Data | Object Detection Data | Annotated...

    • datarade.ai
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Seeds, 340K+ Jewelry Images | AI Training Data | Object Detection Data | Annotated imagery data | Global Coverage [Dataset]. https://datarade.ai/data-products/200k-jewelry-images-ai-training-data-object-detection-da-data-seeds
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset authored and provided by
    Data Seeds
    Area covered
    Norway, Colombia, Chile, Jordan, Vietnam, Brunei Darussalam, Sri Lanka, Mongolia, Nicaragua, Korea (Democratic People's Republic of)
    Description

    This dataset features over 340,000 high-quality images of jewelry sourced from photographers worldwide. Designed to support AI and machine learning applications, it provides a richly detailed and carefully annotated collection of jewelry imagery across styles, materials, and contexts.

    Key Features: 1. Comprehensive Metadata: the dataset includes full EXIF data, detailing camera settings such as aperture, ISO, shutter speed, and focal length. Each image is pre-annotated with object and scene detection metadata, including jewelry type, material, and context—ideal for tasks like object detection, style classification, and fine-grained visual analysis. Popularity metrics, derived from engagement on our proprietary platform, are also included.

    1. Unique Sourcing Capabilities: the images are collected through a proprietary gamified platform for photographers. Competitions focused on jewelry photography ensure high-quality, well-lit, and visually appealing submissions. Custom datasets can be sourced on-demand within 72 hours to meet specific requirements such as jewelry category (rings, necklaces, bracelets, etc.), material type, or presentation style (worn vs. product shots).

    2. Global Diversity: photographs have been submitted by contributors in over 100 countries, offering an extensive range of cultural styles, design traditions, and jewelry aesthetics. The dataset includes handcrafted and luxury items, traditional and contemporary pieces, and representations across diverse ethnic and regional fashions.

    3. High-Quality Imagery: the dataset includes high-resolution images suitable for detailed product analysis. Both studio-lit commercial shots and lifestyle/editorial photography are included, allowing models to learn from various presentation styles and settings.

    4. Popularity Scores: each image is assigned a popularity score based on its performance in GuruShots competitions. This metric offers insight into aesthetic appeal and global consumer preferences, aiding AI models focused on trend analysis or user engagement.

    5. AI-Ready Design: this dataset is optimized for training AI in jewelry classification, attribute tagging, visual search, and recommendation systems. It integrates easily into retail AI workflows and supports model development for e-commerce and fashion platforms.

    6. Licensing & Compliance: the dataset complies fully with data privacy and IP standards, offering transparent licensing for commercial and academic purposes.

    Use Cases: 1. Training AI for visual search and recommendation engines in jewelry e-commerce. 2. Enhancing product recognition, classification, and tagging systems. 3. Powering AR/VR applications for virtual try-ons and 3D visualization. 4. Supporting fashion analytics, trend forecasting, and cultural design research.

    This dataset offers a diverse, high-quality resource for training AI and ML models in the jewelry and fashion space. Customizations are available to meet specific product or market needs. Contact us to learn more!

  7. Plain T-shirt Dataset for Color Classification

    • kaggle.com
    zip
    Updated Jun 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Priitom (2025). Plain T-shirt Dataset for Color Classification [Dataset]. https://www.kaggle.com/datasets/priitom/plain-t-shirt-dataset-for-color-classification
    Explore at:
    zip(1337098 bytes)Available download formats
    Dataset updated
    Jun 16, 2025
    Authors
    Priitom
    Description

    This dataset includes 200 AI-generated images of plain T-shirts, each in a unique color and shown on a white background. It's designed for training deep learning models — especially CNNs — to classify clothing based on color.

    Each image is labeled with a color name in the accompanying CSV file (plain_tshirt_lab.csv). This makes it a great resource for projects involving color recognition, fashion tech, or learning image classification with a clean, custom dataset.

  8. R

    Building Image Dataset

    • universe.roboflow.com
    zip
    Updated Jun 23, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ayush Wattal Ritu Bhamrah Ritanjali Jena Sai Kumar Sannidhi (2023). Building Image Dataset [Dataset]. https://universe.roboflow.com/ayush-wattal-ritu-bhamrah-ritanjali-jena-sai-kumar-sannidhi/building-image/model/17
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 23, 2023
    Dataset authored and provided by
    Ayush Wattal Ritu Bhamrah Ritanjali Jena Sai Kumar Sannidhi
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Variables measured
    Building Roof Bounding Boxes
    Description

    Custom Dataset for Rooftop Segmentation and Rooftop Classification

  9. A

    Artificial Intelligence Training Dataset Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated May 3, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). Artificial Intelligence Training Dataset Report [Dataset]. https://www.datainsightsmarket.com/reports/artificial-intelligence-training-dataset-1958994
    Explore at:
    doc, ppt, pdfAvailable download formats
    Dataset updated
    May 3, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The global Artificial Intelligence (AI) Training Dataset market is experiencing robust growth, driven by the increasing adoption of AI across diverse sectors. The market's expansion is fueled by the burgeoning need for high-quality data to train sophisticated AI algorithms capable of powering applications like smart campuses, autonomous vehicles, and personalized healthcare solutions. The demand for diverse dataset types, including image classification, voice recognition, natural language processing, and object detection datasets, is a key factor contributing to market growth. While the exact market size in 2025 is unavailable, considering a conservative estimate of a $10 billion market in 2025 based on the growth trend and reported market sizes of related industries, and a projected CAGR (Compound Annual Growth Rate) of 25%, the market is poised for significant expansion in the coming years. Key players in this space are leveraging technological advancements and strategic partnerships to enhance data quality and expand their service offerings. Furthermore, the increasing availability of cloud-based data annotation and processing tools is further streamlining operations and making AI training datasets more accessible to businesses of all sizes. Growth is expected to be particularly strong in regions with burgeoning technological advancements and substantial digital infrastructure, such as North America and Asia Pacific. However, challenges such as data privacy concerns, the high cost of data annotation, and the scarcity of skilled professionals capable of handling complex datasets remain obstacles to broader market penetration. The ongoing evolution of AI technologies and the expanding applications of AI across multiple sectors will continue to shape the demand for AI training datasets, pushing this market toward higher growth trajectories in the coming years. The diversity of applications—from smart homes and medical diagnoses to advanced robotics and autonomous driving—creates significant opportunities for companies specializing in this market. Maintaining data quality, security, and ethical considerations will be crucial for future market leadership.

  10. d

    25M+ Images | AI Training Data | Annotated imagery data for AI | Object &...

    • datarade.ai
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Seeds, 25M+ Images | AI Training Data | Annotated imagery data for AI | Object & Scene Detection | Global Coverage [Dataset]. https://datarade.ai/data-products/15m-images-ai-training-data-annotated-imagery-data-for-a-data-seeds
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset authored and provided by
    Data Seeds
    Area covered
    Cabo Verde, Bulgaria, Honduras, Macedonia (the former Yugoslav Republic of), Iraq, Botswana, Venezuela (Bolivarian Republic of), China, Sierra Leone, United Republic of
    Description

    This dataset features over 25,000,000 high-quality general-purpose images sourced from photographers worldwide. Designed to support a wide range of AI and machine learning applications, it offers a richly diverse and extensively annotated collection of everyday visual content.

    Key Features: 1. Comprehensive Metadata: the dataset includes full EXIF data, detailing camera settings such as aperture, ISO, shutter speed, and focal length. Additionally, each image is pre-annotated with object and scene detection metadata, making it ideal for tasks like classification, detection, and segmentation. Popularity metrics, derived from engagement on our proprietary platform, are also included.

    2.Unique Sourcing Capabilities: the images are collected through a proprietary gamified platform for photographers. Competitions spanning various themes ensure a steady influx of diverse, high-quality submissions. Custom datasets can be sourced on-demand within 72 hours, allowing for specific requirements—such as themes, subjects, or scenarios—to be met efficiently.

    1. Global Diversity: photographs have been sourced from contributors in over 100 countries, covering a wide range of human experiences, cultures, environments, and activities. The dataset includes images of people, nature, objects, animals, urban and rural life, and more—captured across different times of day, seasons, and lighting conditions.

    2. High-Quality Imagery: the dataset includes images with resolutions ranging from standard to high-definition to meet the needs of various projects. Both professional and amateur photography styles are represented, offering a balance of realism and creativity across visual domains.

    3. Popularity Scores: each image is assigned a popularity score based on its performance in GuruShots competitions. This unique metric reflects how well the image resonates with a global audience, offering an additional layer of insight for AI models focused on aesthetics, engagement, or content curation.

    4. AI-Ready Design: this dataset is optimized for AI applications, making it ideal for training models in general image recognition, multi-label classification, content filtering, and scene understanding. It integrates easily with leading machine learning frameworks and pipelines.

    5. Licensing & Compliance: the dataset complies fully with data privacy regulations and offers transparent licensing for both commercial and academic use.

    Use Cases: 1. Training AI models for general-purpose image classification and tagging. 2. Enhancing content moderation and visual search systems. 3. Building foundational datasets for large-scale vision-language models. 4. Supporting research in computer vision, multimodal AI, and generative modeling.

    This dataset offers a comprehensive, diverse, and high-quality resource for training AI and ML models across a wide array of domains. Customizations are available to suit specific project needs. Contact us to learn more!

  11. F

    Bahasa Product Image OCR Dataset

    • futurebeeai.com
    wav
    Updated Aug 1, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FutureBee AI (2022). Bahasa Product Image OCR Dataset [Dataset]. https://www.futurebeeai.com/dataset/ocr-dataset/bahasa-product-image-ocr-dataset
    Explore at:
    wavAvailable download formats
    Dataset updated
    Aug 1, 2022
    Dataset provided by
    FutureBeeAI
    Authors
    FutureBee AI
    License

    https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement

    Dataset funded by
    FutureBeeAI
    Description

    Introducing the Bahasa Product Image Dataset - a diverse and comprehensive collection of images meticulously curated to propel the advancement of text recognition and optical character recognition (OCR) models designed specifically for the Bahasa language.

    Dataset Contain & Diversity

    Containing a total of 2000 images, this Bahasa OCR dataset offers diverse distribution across different types of front images of Products. In this dataset, you'll find a variety of text that includes product names, taglines, logos, company names, addresses, product content, etc. Images in this dataset showcase distinct fonts, writing formats, colors, designs, and layouts.

    To ensure the diversity of the dataset and to build a robust text recognition model we allow limited (less than five) unique images from a single resource. Stringent measures have been taken to exclude any personally identifiable information (PII) and to ensure that in each image a minimum of 80% of space contains visible Bahasa text.

    Images have been captured under varying lighting conditions – both day and night – along with different capture angles and backgrounds, to build a balanced OCR dataset. The collection features images in portrait and landscape modes.

    All these images were captured by native Bahasa people to ensure the text quality, avoid toxic content and PII text. We used the latest iOS and Android mobile devices above 5MP cameras to click all these images to maintain the image quality. In this training dataset images are available in both JPEG and HEIC formats.

    Metadata

    Along with the image data, you will also receive detailed structured metadata in CSV format. For each image, it includes metadata like image orientation, county, language, and device information. Each image is properly renamed corresponding to the metadata.

    The metadata serves as a valuable tool for understanding and characterizing the data, facilitating informed decision-making in the development of Bahasa text recognition models.

    Update & Custom Collection

    We're committed to expanding this dataset by continuously adding more images with the assistance of our native Bahasa crowd community.

    If you require a custom product image OCR dataset tailored to your guidelines or specific device distribution, feel free to contact us. We're equipped to curate specialized data to meet your unique needs.

    Furthermore, we can annotate or label the images with bounding box or transcribe the text in the image to align with your specific project requirements using our crowd community.

    License

    This Image dataset, created by FutureBeeAI, is now available for commercial use.

    Conclusion:

    Leverage the power of this product image OCR dataset to elevate the training and performance of text recognition, text detection, and optical character recognition models within the realm of the Bahasa language. Your journey to enhanced language understanding and processing starts here.

  12. T

    sun397

    • tensorflow.org
    • huggingface.co
    Updated Jun 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). sun397 [Dataset]. https://www.tensorflow.org/datasets/catalog/sun397
    Explore at:
    Dataset updated
    Jun 1, 2024
    Description

    The database contains 108,753 images of 397 categories, used in the Scene UNderstanding (SUN) benchmark. The number of images varies across categories, but there are at least 100 images per category.

    Several configs of the dataset are made available through TFDS:

    • A custom (random) partition of the whole dataset with 76,128 training images, 10,875 validation images and 21,750 test images. Images have been resized to have at most 120,000 pixels, and encoded as JPEG with quality of 72.

    • "standard-part1-120k", "standard-part2-120k", ..., "standard-part10-120k": Each of the 10 official train/test partitions with 50 images per class in each split. Images have been resized to have at most 120,000 pixels, and encoded as JPEG with quality of 72.

    To use this dataset:

    import tensorflow_datasets as tfds
    
    ds = tfds.load('sun397', split='train')
    for ex in ds.take(4):
     print(ex)
    

    See the guide for more informations on tensorflow_datasets.

    https://storage.googleapis.com/tfds-data/visualization/fig/sun397-tfds-4.0.0.png" alt="Visualization" width="500px">

  13. Custom Images for Model Evaluation

    • kaggle.com
    zip
    Updated Nov 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Evil Spirit05 (2025). Custom Images for Model Evaluation [Dataset]. https://www.kaggle.com/datasets/evilspirit05/custom-image-dataset/code
    Explore at:
    zip(536062498 bytes)Available download formats
    Dataset updated
    Nov 15, 2025
    Authors
    Evil Spirit05
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description
    This dataset consists of a diverse collection of custom-downloaded images, including various categories such as dogs, cats, and other objects. It is designed to facilitate the performance evaluation and benchmarking of image classification models. With a rich assortment of images, this dataset aims to provide a comprehensive resource for testing and improving model accuracy in different scenarios.
    

    Dataset Details

    • Categories: Dogs, Cats, and Other Objects
    • Image Formats: JPG, PNG, etc.

    Features

    • Diverse Collection: Includes a wide range of images to ensure robust model performance evaluation.
    • Balanced Categories: A well-distributed number of images across primary categories to prevent bias in model testing.
    • High-Quality Images: Images are sourced and curated to maintain high quality for accurate performance assessment.

    Use Cases

    • Model Benchmarking: Evaluate and compare the performance of image classification algorithms.
    • Algorithm Tuning: Fine-tune models using a varied set of images to improve accuracy and generalization.
    • Feature Testing: Test different image features and preprocessing techniques to optimize model performance.

    Access

    You can download and explore the dataset from Kaggle. Please ensure to provide feedback and share any insights or improvements that can enhance the dataset’s utility for the community.
    
  14. F

    Polish Product Image OCR Dataset

    • futurebeeai.com
    wav
    Updated Aug 1, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FutureBee AI (2022). Polish Product Image OCR Dataset [Dataset]. https://www.futurebeeai.com/dataset/ocr-dataset/polish-product-image-ocr-dataset
    Explore at:
    wavAvailable download formats
    Dataset updated
    Aug 1, 2022
    Dataset provided by
    FutureBeeAI
    Authors
    FutureBee AI
    License

    https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement

    Dataset funded by
    FutureBeeAI
    Description

    Introducing the Polish Product Image Dataset - a diverse and comprehensive collection of images meticulously curated to propel the advancement of text recognition and optical character recognition (OCR) models designed specifically for the Polish language.

    Dataset Contain & Diversity

    Containing a total of 2000 images, this Polish OCR dataset offers diverse distribution across different types of front images of Products. In this dataset, you'll find a variety of text that includes product names, taglines, logos, company names, addresses, product content, etc. Images in this dataset showcase distinct fonts, writing formats, colors, designs, and layouts.

    To ensure the diversity of the dataset and to build a robust text recognition model we allow limited (less than five) unique images from a single resource. Stringent measures have been taken to exclude any personally identifiable information (PII) and to ensure that in each image a minimum of 80% of space contains visible Polish text.

    Images have been captured under varying lighting conditions – both day and night – along with different capture angles and backgrounds, to build a balanced OCR dataset. The collection features images in portrait and landscape modes.

    All these images were captured by native Polish people to ensure the text quality, avoid toxic content and PII text. We used the latest iOS and Android mobile devices above 5MP cameras to click all these images to maintain the image quality. In this training dataset images are available in both JPEG and HEIC formats.

    Metadata

    Along with the image data, you will also receive detailed structured metadata in CSV format. For each image, it includes metadata like image orientation, county, language, and device information. Each image is properly renamed corresponding to the metadata.

    The metadata serves as a valuable tool for understanding and characterizing the data, facilitating informed decision-making in the development of Polish text recognition models.

    Update & Custom Collection

    We're committed to expanding this dataset by continuously adding more images with the assistance of our native Polish crowd community.

    If you require a custom product image OCR dataset tailored to your guidelines or specific device distribution, feel free to contact us. We're equipped to curate specialized data to meet your unique needs.

    Furthermore, we can annotate or label the images with bounding box or transcribe the text in the image to align with your specific project requirements using our crowd community.

    License

    This Image dataset, created by FutureBeeAI, is now available for commercial use.

    Conclusion:

    Leverage the power of this product image OCR dataset to elevate the training and performance of text recognition, text detection, and optical character recognition models within the realm of the Polish language. Your journey to enhanced language understanding and processing starts here.

  15. t

    Xu Ma, Xiyang Dai, Yue Bai, Yizhou Wang, Yun Fu (2024). Dataset: StarNet....

    • service.tib.eu
    Updated Dec 16, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Xu Ma, Xiyang Dai, Yue Bai, Yizhou Wang, Yun Fu (2024). Dataset: StarNet. https://doi.org/10.57702/5vwcq7nw [Dataset]. https://service.tib.eu/ldmservice/dataset/starnet
    Explore at:
    Dataset updated
    Dec 16, 2024
    Description

    The dataset used in the paper is a custom dataset for image classification tasks, featuring images with varying sizes and resolutions.

  16. c

    Food Image Classification Dataset

    • cubig.ai
    zip
    Updated Aug 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CUBIG (2024). Food Image Classification Dataset [Dataset]. https://cubig.ai/store/products/40/food-image-classification-dataset
    Explore at:
    zipAvailable download formats
    Dataset updated
    Aug 1, 2024
    Dataset authored and provided by
    CUBIG
    License

    https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service

    Measurement technique
    Synthetic data generation using AI techniques for model training, Privacy-preserving data transformation via differential privacy
    Description

    1) Data Introduction • This dataset provides images and information on 10 types of foods commonly consumed by Koreans. The data is classified into 10 food classes: ""Ramen"", ""Gimbap"", ""Curry"", ""Army Stew (Budae Jjigae)"", ""Corn Soup (Corn Soup)"", ""Jajangmyeon (Black Bean Noodles)"", ""Boiled Dumplings (Mul Mandu)"", ""Fried Dumplings (Gun Mandu)"", ""White Rice (Baekmi)"", and ""Red Bean Porridge (Patjuk)".

    2) Data Utilization (1) Food Data has characteristics that: • The dataset includes images of each food item, which are useful for understanding and analyzing the characteristics of these foods. • It provides information that helps in understanding the diverse types and features of Korean cuisine. (2) Food Data can be used to: • Food analysis and trend research: Professionals in the food industry can utilize this data to analyze current food trends and develop new menus. • Development of recommendation systems: It can be used to develop personalized food recommendation systems on e-commerce platforms and food delivery apps, providing tailored recommendations based on customer preferences. • Machine learning and computer vision research: It can be utilized to train and evaluate machine learning models in areas such as food image classification and recognition. • Cultural research: Analyzing various Korean foods can be utilized for cultural research and educational materials. • Digital content creation: It can be used to create digital content for blogs, social media, online magazines, etc., to introduce and promote various Korean foods."

  17. Adapting face recognition to the masked world: leveraging deep attention...

    • tandf.figshare.com
    txt
    Updated Sep 10, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Himani Trivedi; Mahesh Goyani (2024). Adapting face recognition to the masked world: leveraging deep attention networks with a custom dataset [Dataset]. http://doi.org/10.6084/m9.figshare.25400212.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Sep 10, 2024
    Dataset provided by
    Taylor & Francishttps://taylorandfrancis.com/
    Authors
    Himani Trivedi; Mahesh Goyani
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In the era of COVID-19, past face recognition algorithms' performance is downgraded due to the partial occlusion of face mask. A new Indian face image dataset has been proposed titled, Handcrafted Indian Face (HIF) dataset, addressing the issues, viz. variegated illumination, pose, and partial occlusion conditions. It bridges the gap between the performance of the DL models used, pre and post COVID-19 effect. A novel idea for choosing the train-test sample has been presented in the paper, which improves the accuracy on existing state of art DL models. In this paper, a new DL architecture has been proposed named the InceptBlock Enhanced Attention Fusion Network (IBEAFNet) which consists of the combination of ECBAM (Enhanced Convolution Block Attention Module) and InceptionV3 architecture. The proposed architecture's attention layer placement allows it to suppress less relevant mask regions of face, while emphasizing on significant fine and coarse level features with reduced complexity. IBEAFNet is trained and tested on two existing datasets, viz. Casia & Yale (including simulated masked images) and the proposed HIF dataset. The performance of IBEAFNet is compared with the results fetched by changing the attention layers in IBEAFNet with the blocks of SENet and CBAM. IBEAFNet outperformed the state-of-art models with the accuracy of 91.00%, 89.5%, and 93.00% on CASIA, Yale, and HIF dataset, respectively.

  18. d

    Textile Microtexture Dataset: 100+ Annotated Images & 40+ Quantitative...

    • search.dataone.org
    • dataverse.harvard.edu
    Updated Oct 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aruswamy, Nithin (2025). Textile Microtexture Dataset: 100+ Annotated Images & 40+ Quantitative Classification Features [Dataset]. http://doi.org/10.7910/DVN/KUDCQX
    Explore at:
    Dataset updated
    Oct 28, 2025
    Dataset provided by
    Harvard Dataverse
    Authors
    Aruswamy, Nithin
    Description

    This dataset comprises high-resolution images of diverse textile fabrics alongside a comprehensive CSV file of microtexture features extracted for machine learning analysis. Each image is paired with quantitative descriptors, including Haralick, LBP, fractal, wavelet, and edge-based metrics. This enables detailed classification and defect detection tasks. The dataset includes multiple fabric types (ex., cotton, wool, polyester, silk) and covers a range of microtexture patterns, with all images captured using a custom imaging rig. Custom imaging rig consists of Raspberry Pi High Quality Camera with a 100X Lens. Feature extraction methods and parameters are documented for reproducibility. This resource supports research in computer vision, textile engineering, and automated quality assessment.

  19. The ORBIT (Object Recognition for Blind Image Training)-India Dataset

    • zenodo.org
    • data.niaid.nih.gov
    • +1more
    Updated Apr 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gesu India; Gesu India; Martin Grayson; Martin Grayson; Daniela Massiceti; Daniela Massiceti; Cecily Morrison; Cecily Morrison; Simon Robinson; Simon Robinson; Jennifer Pearson; Jennifer Pearson; Matt Jones; Matt Jones (2025). The ORBIT (Object Recognition for Blind Image Training)-India Dataset [Dataset]. http://doi.org/10.5281/zenodo.12608444
    Explore at:
    Dataset updated
    Apr 24, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Gesu India; Gesu India; Martin Grayson; Martin Grayson; Daniela Massiceti; Daniela Massiceti; Cecily Morrison; Cecily Morrison; Simon Robinson; Simon Robinson; Jennifer Pearson; Jennifer Pearson; Matt Jones; Matt Jones
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    India
    Description

    The ORBIT (Object Recognition for Blind Image Training) -India Dataset is a collection of 105,243 images of 76 commonly used objects, collected by 12 individuals in India who are blind or have low vision. This dataset is an "Indian subset" of the original ORBIT dataset [1, 2], which was collected in the UK and Canada. In contrast to the ORBIT dataset, which was created in a Global North, Western, and English-speaking context, the ORBIT-India dataset features images taken in a low-resource, non-English-speaking, Global South context, a home to 90% of the world’s population of people with blindness. Since it is easier for blind or low-vision individuals to gather high-quality data by recording videos, this dataset, like the ORBIT dataset, contains images (each sized 224x224) derived from 587 videos. These videos were taken by our data collectors from various parts of India using the Find My Things [3] Android app. Each data collector was asked to record eight videos of at least 10 objects of their choice.

    Collected between July and November 2023, this dataset represents a set of objects commonly used by people who are blind or have low vision in India, including earphones, talking watches, toothbrushes, and typical Indian household items like a belan (rolling pin), and a steel glass. These videos were taken in various settings of the data collectors' homes and workspaces using the Find My Things Android app.

    The image dataset is stored in the ‘Dataset’ folder, organized by folders assigned to each data collector (P1, P2, ...P12) who collected them. Each collector's folder includes sub-folders named with the object labels as provided by our data collectors. Within each object folder, there are two subfolders: ‘clean’ for images taken on clean surfaces and ‘clutter’ for images taken in cluttered environments where the objects are typically found. The annotations are saved inside a ‘Annotations’ folder containing a JSON file per video (e.g., P1--coffee mug--clean--231220_084852_coffee mug_224.json) that contains keys corresponding to all frames/images in that video (e.g., "P1--coffee mug--clean--231220_084852_coffee mug_224--000001.jpeg": {"object_not_present_issue": false, "pii_present_issue": false}, "P1--coffee mug--clean--231220_084852_coffee mug_224--000002.jpeg": {"object_not_present_issue": false, "pii_present_issue": false}, ...). The ‘object_not_present_issue’ key is True if the object is not present in the image, and the ‘pii_present_issue’ key is True, if there is a personally identifiable information (PII) present in the image. Note, all PII present in the images has been blurred to protect the identity and privacy of our data collectors. This dataset version was created by cropping images originally sized at 1080 × 1920; therefore, an unscaled version of the dataset will follow soon.

    This project was funded by the Engineering and Physical Sciences Research Council (EPSRC) Industrial ICASE Award with Microsoft Research UK Ltd. as the Industrial Project Partner. We would like to acknowledge and express our gratitude to our data collectors for their efforts and time invested in carefully collecting videos to build this dataset for their community. The dataset is designed for developing few-shot learning algorithms, aiming to support researchers and developers in advancing object-recognition systems. We are excited to share this dataset and would love to hear from you if and how you use this dataset. Please feel free to reach out if you have any questions, comments or suggestions.

    REFERENCES:

    1. Daniela Massiceti, Lida Theodorou, Luisa Zintgraf, Matthew Tobias Harris, Simone Stumpf, Cecily Morrison, Edward Cutrell, and Katja Hofmann. 2021. ORBIT: A real-world few-shot dataset for teachable object recognition collected from people who are blind or low vision. DOI: https://doi.org/10.25383/city.14294597

    2. microsoft/ORBIT-Dataset. https://github.com/microsoft/ORBIT-Dataset

    3. Linda Yilin Wen, Cecily Morrison, Martin Grayson, Rita Faia Marques, Daniela Massiceti, Camilla Longden, and Edward Cutrell. 2024. Find My Things: Personalized Accessibility through Teachable AI for People who are Blind or Low Vision. In Extended Abstracts of the 2024 CHI Conference on Human Factors in Computing Systems (CHI EA '24). Association for Computing Machinery, New York, NY, USA, Article 403, 1–6. https://doi.org/10.1145/3613905.3648641

  20. d

    200K+ Landmark Images | AI Training Data | Annotated imagery data for AI |...

    • datarade.ai
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Seeds, 200K+ Landmark Images | AI Training Data | Annotated imagery data for AI | Object & Scene Detection | Global Coverage [Dataset]. https://datarade.ai/data-products/120k-landmark-images-ai-training-data-annotated-imagery-data-seeds
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset authored and provided by
    Data Seeds
    Area covered
    Rwanda, French Guiana, Jamaica, Kosovo, Egypt, Wallis and Futuna, Lesotho, Greece, American Samoa, Niger
    Description

    This dataset features over 200,000 high-quality images of historical and cultural landmarks sourced from photographers worldwide. Designed to support AI and machine learning applications, it provides a diverse and richly annotated collection of landmark imagery.

    Key Features: 1. Comprehensive Metadata: the dataset includes full EXIF data, detailing camera settings such as aperture, ISO, shutter speed, and focal length. Additionally, each image is pre-annotated with object and scene detection metadata, making it ideal for tasks like classification, detection, and segmentation. Popularity metrics, derived from engagement on our proprietary platform, are also included.

    1. Unique Sourcing Capabilities: the images are collected through a proprietary gamified platform for photographers. Competitions focused on landmark photography ensure fresh, relevant, and high-quality submissions. Custom datasets can be sourced on-demand within 72 hours, allowing for specific requirements such as particular landmarks or geographic regions to be met efficiently.

    2. Global Diversity: photographs have been sourced from contributors in over 100 countries, ensuring a vast array of landmark types and cultural contexts. The images feature varied settings, including historical monuments, iconic structures, natural landmarks, and urban architecture, providing an unparalleled level of diversity.

    3. High-Quality Imagery: the dataset includes images with resolutions ranging from standard to high-definition to meet the needs of various projects. Both professional and amateur photography styles are represented, offering a mix of artistic and practical perspectives suitable for a variety of applications.

    4. Popularity Scores: each image is assigned a popularity score based on its performance in GuruShots competitions. This unique metric reflects how well the image resonates with a global audience, offering an additional layer of insight for AI models focused on user preferences or engagement trends.

    5. AI-Ready Design: this dataset is optimized for AI applications, making it ideal for training models in tasks such as image recognition, classification, and segmentation. It is compatible with a wide range of machine learning frameworks and workflows, ensuring seamless integration into your projects.

    6. Licensing & Compliance: the dataset complies fully with data privacy regulations and offers transparent licensing for both commercial and academic use.

    Use Cases: 1. Training AI systems for landmark recognition and geolocation. 2. Enhancing navigation and travel AI applications. 3. Building datasets for educational, tourism, and augmented reality tools. 4.Supporting cultural heritage preservation and analysis through AI-powered solutions.

    This dataset offers a comprehensive, diverse, and high-quality resource for training AI and ML models, tailored to deliver exceptional performance for your projects. Customizations are available to suit specific project needs. Contact us to learn more!

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Mathias Viborg (2023). PASCALRAW-derived Classification Dataset [Dataset]. https://www.kaggle.com/datasets/mathiasviborg/pascalraw-derived-object-cropped-dataset
Organization logo

PASCALRAW-derived Classification Dataset

Dataset: Enabling RAW Image Classification Using Existing RGB Classifiers

Explore at:
zip(8714509767 bytes)Available download formats
Dataset updated
Oct 5, 2023
Authors
Mathias Viborg
License

Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically

Description

Further desciption of dataset and original useage can be found in the article: Enabling RAW Image Classification Using Existing RGB Classifiers.

The dataset is consisting of unprocessed RAW sensor data cropped from the PASCALRAW dataset for RAW image classification.

Data distribution is a follows: | Backpack |Bicycle |Car|Person| | --- | --- | --- | --- | | 3569| 5142 | 4856 | 4864 |



To use the full-resolution RAW images from the datasets as inputs to a classifier, downscaling would be required, which would artificially process the original image, resulting in a lower-quality representation. This would be counter-intuitive for this work since one of the primary reasons for using RAW images is that they contain unaltered capture information. For these reasons, a more relevant dataset for investigating the hypothesis consists of small, unaltered RAW images. Since such a dataset is not publicly available, we create a custom dataset based on classes within the PASCALRAW dataset. The PASCALRAW dataset has the added benefit of being captured using a Nikon D3200 DSLR camera and therefore also being stored in the same Nikon Raw Image file format (NEF), removing the need for considerations for differences in capture data and image formats.

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F13804035%2Fec1031a81d4d46274c3db05ad31a087b%2FUnavngivet%20diagram.png?generation=1702328479100552&alt=media" alt="">

Figure 2 is a grid of crops overlapping the semantic mask withat least 50 % of its pixels. Figure 3 is an overview of the first four extracted samples from the object presented in Figure 2. Shown in Original RGB and Original RAW (RAW displayed using Matplotlib).

Our dataset is created by extracting 448x448-sized cropped images of people, bicycles, cars and backpacks that are present in each full-resolution RAW- and corresponding RGB images within the PASCALRAW dataset. The dataset-generating process utilizes YOLOv5 to apply instance segmentation to extract masks of each of the desired four classes that are present in each RGB image. By then placing a grid of cropped images of size 448x448 over each object's entire bounding box, each crop overlapping itself with a stride of 224 pixels along its x- and y-direction, see Figure 2, and only storing those crops that overlap the semantic mask with at least 50 \% of its pixels, data samples from both the RGB- and corresponding RAW image are extracted, see Figure 3.

Search
Clear search
Close search
Google apps
Main menu