100+ datasets found
  1. yolo format dataset

    • kaggle.com
    zip
    Updated Oct 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    meer atif magsi (2023). yolo format dataset [Dataset]. https://www.kaggle.com/datasets/meeratif/yolo-format-data
    Explore at:
    zip(256907308 bytes)Available download formats
    Dataset updated
    Oct 16, 2023
    Authors
    meer atif magsi
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset Highlights:

    1. Diverse Image Collection: Our dataset encompasses a wide range of general images covering various categories such as objects, scenes, people, and more. The images are carefully curated to offer a rich source of visual data.

    2. Sindhi Language Titles: One of the distinctive features of our dataset is the inclusion of Sindhi language titles for each image.

    3. Annotations in YOLO Format: To facilitate your object detection tasks, we have meticulously annotated the images in YOLO format, making it compatible with the YOLOv3 or YOLOv4 models. This ensures that you can jump right into training your model without the hassle of converting annotations.

    4. Comprehensive Metadata: Each image in the dataset is accompanied by a YAML file providing additional metadata, including information about the image source, date of capture, and any relevant context that may be useful for your research.

    By publishing this YOLO-style dataset with Sindhi language titles, we aim to contribute to the machine learning and computer vision community, fostering innovation and inclusivity in the field. We encourage you to explore, experiment, and create cutting-edge models using this dataset, and we look forward to seeing the incredible projects that emerge from it.

  2. object-detection-person-data

    • kaggle.com
    zip
    Updated Mar 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ritesh1420 (2024). object-detection-person-data [Dataset]. https://www.kaggle.com/datasets/ritesh1420/yolov8-person-data/discussion
    Explore at:
    zip(436540956 bytes)Available download formats
    Dataset updated
    Mar 19, 2024
    Authors
    ritesh1420
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    The dataset is structured for person object detection tasks, containing separate directories for training, validation, and testing. Each split has an images folder with corresponding images and a labels folder with annotation files.

    Train Set: Contains images and annotations for model training.

    Validation Set: Includes images and labels for model evaluation during training.

    Test Set: Provides unseen images and labels for final model performance assessment.

    Each annotation file (TXT format) corresponds to an image and likely contains bounding box coordinates and class labels. This structure follows standard object detection dataset formats, ensuring easy integration with detection models like yolo,RT-DETR.

    Dataset Structure

    ๐Ÿ“‚ dataset/ โ”œโ”€โ”€ ๐Ÿ“ train/ โ”‚ โ”œโ”€โ”€ ๐Ÿ“‚ images/ โ”‚ โ”‚ โ”œโ”€โ”€ ๐Ÿ–ผ image1.jpg (Training image) โ”‚ โ”‚ โ”œโ”€โ”€ ๐Ÿ–ผ image2.jpg (Training image) โ”‚ โ”œโ”€โ”€ ๐Ÿ“‚ labels/ โ”‚ โ”‚ โ”œโ”€โ”€ ๐Ÿ“„ image1.txt (Annotation for image1.jpg) โ”‚ โ”‚ โ”œโ”€โ”€ ๐Ÿ“„ image2.txt (Annotation for image2.jpg) โ”‚ โ”œโ”€โ”€ ๐Ÿ“ val/ โ”‚ โ”œโ”€โ”€ ๐Ÿ“‚ images/ โ”‚ โ”‚ โ”œโ”€โ”€ ๐Ÿ–ผ image3.jpg (Validation image) โ”‚ โ”‚ โ”œโ”€โ”€ ๐Ÿ–ผ image4.jpg (Validation image) โ”‚ โ”œโ”€โ”€ ๐Ÿ“‚ labels/ โ”‚ โ”‚ โ”œโ”€โ”€ ๐Ÿ“„ image3.txt (Annotation for image3.jpg) โ”‚ โ”‚ โ”œโ”€โ”€ ๐Ÿ“„ image4.txt (Annotation for image4.jpg) โ”‚ โ”œโ”€โ”€ ๐Ÿ“ test/ โ”‚ โ”œโ”€โ”€ ๐Ÿ“‚ images/ โ”‚ โ”‚ โ”œโ”€โ”€ ๐Ÿ–ผ image5.jpg (Test image) โ”‚ โ”‚ โ”œโ”€โ”€ ๐Ÿ–ผ image6.jpg (Test image) โ”‚ โ”œโ”€โ”€ ๐Ÿ“‚ labels/ โ”‚ โ”‚ โ”œโ”€โ”€ ๐Ÿ“„ image5.txt (Annotation for image5.jpg) โ”‚ โ”‚ โ”œโ”€โ”€ ๐Ÿ“„ image6.txt (Annotation for image6.jpg)

  3. Image Annotation Services | Image Labeling for AI & ML |Computer Vision...

    • datarade.ai
    Updated Dec 29, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nexdata (2023). Image Annotation Services | Image Labeling for AI & ML |Computer Vision Data| Annotated Imagery Data [Dataset]. https://datarade.ai/data-products/nexdata-image-annotation-services-ai-assisted-labeling-nexdata
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset updated
    Dec 29, 2023
    Dataset authored and provided by
    Nexdata
    Area covered
    El Salvador, Austria, Romania, Bulgaria, Latvia, Bosnia and Herzegovina, Japan, Hong Kong, Grenada, India
    Description
    1. Overview We provide various types of Annotated Imagery Data annotation services, including:
    2. Bounding box
    3. Polygon
    4. Segmentation
    5. Polyline
    6. Key points
    7. Image classification
    8. Image description ...
    9. Our Capacity
    10. Platform: Our platform supports human-machine interaction and semi-automatic labeling, increasing labeling efficiency by more than 30% per annotator.It has successfully been applied to nearly 5,000 projects.
    • Annotation Tools: Nexdata's platform integrates 30 sets of annotation templates, covering audio, image, video, point cloud and text.

    -Secure Implementation: NDA is signed to gurantee secure implementation and Annotated Imagery Data is destroyed upon delivery.

    -Quality: Multiple rounds of quality inspections ensures high quality data output, certified with ISO9001

    1. About Nexdata Nexdata has global data processing centers and more than 20,000 professional annotators, supporting on-demand data annotation services, such as speech, image, video, point cloud and Natural Language Processing (NLP) Data, etc. Please visit us at https://www.nexdata.ai/computerVisionTraining?source=Datarade
  4. R

    Yolo Coco Data Format Dataset

    • universe.roboflow.com
    zip
    Updated Oct 24, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Md Abdur Rob (2025). Yolo Coco Data Format Dataset [Dataset]. https://universe.roboflow.com/md-abdur-rob-x4zgr/yolo-coco-data-format/model/1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Oct 24, 2025
    Dataset authored and provided by
    Md Abdur Rob
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Variables measured
    Objects Bounding Boxes
    Description

    YOLO Coco Data Format

    ## Overview
    
    YOLO Coco Data Format is a dataset for object detection tasks - it contains Objects annotations for 692 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [Public Domain license](https://creativecommons.org/licenses/Public Domain).
    
  5. m

    Annotated UAV Image Dataset for Object Detection Using LabelImg and Roboflow...

    • data.mendeley.com
    Updated Aug 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anindita Das (2025). Annotated UAV Image Dataset for Object Detection Using LabelImg and Roboflow [Dataset]. http://doi.org/10.17632/fwg6pt6ckd.1
    Explore at:
    Dataset updated
    Aug 21, 2025
    Authors
    Anindita Das
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The dataset consists of drone images that were obtained for agricultural field monitoring to detect weeds and crops through computer vision and machine learning approaches. The images were obtained through high-resolution UAVs and annotated using the LabelImg and Roboflow tool. Each image has a corresponding YOLO annotation file that contains bounding box information and class IDs for detected objects. The dataset includes:

    Original images in .jpg format with a resolution of 585 ร— 438 pixels.

    Annotation files (.txt) corresponding to each image, following the YOLO format: class_id x_center y_center width height.

    A classes.txt file listing the object categories used in labeling (e.g., Weed, Crop).

    The dataset is intended for use in machine learning model development, particularly for precision agriculture, weed detection, and plant health monitoring. It can be directly used for training YOLOv7 and other object detection models.

  6. u

    Data from: Variable Message Signal annotated images for object detection

    • portalcientifico.universidadeuropea.com
    • zenodo.org
    Updated 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    De Las Heras De Matรญas, Gonzalo; Sรกnchez-Soriano, Javier; Puertas, Enrique; De Las Heras De Matรญas, Gonzalo; Sรกnchez-Soriano, Javier; Puertas, Enrique (2022). Variable Message Signal annotated images for object detection [Dataset]. https://portalcientifico.universidadeuropea.com/documentos/668fc42eb9e7c03b01bd5af8
    Explore at:
    Dataset updated
    2022
    Authors
    De Las Heras De Matรญas, Gonzalo; Sรกnchez-Soriano, Javier; Puertas, Enrique; De Las Heras De Matรญas, Gonzalo; Sรกnchez-Soriano, Javier; Puertas, Enrique
    Description

    If you use this dataset, please cite this paper: Puertas, E.; De-Las-Heras, G.; Sรกnchez-Soriano, J.; Fernรกndez-Andrรฉs, J. Dataset: Variable Message Signal Annotated Images for Object Detection. Data 2022, 7, 41. https://doi.org/10.3390/data7040041 This dataset consists of Spanish road images taken from inside a vehicle, as well as annotations in XML files in PASCAL VOC format that indicate the location of Variable Message Signals within them. Also, a CSV file is attached with information regarding the geographic position, the folder where the image is located, and the text in Spanish. This can be used to train supervised learning computer vision algorithms, such as convolutional neural networks. Throughout this work, the process followed to obtain the dataset, image acquisition, and labeling, and its specifications are detailed. The dataset is constituted of 1216 instances, 888 positives, and 328 negatives, in 1152 jpg images with a resolution of 1280x720 pixels. These are divided into 576 real images and 576 images created from the data-augmentation technique. The purpose of this dataset is to help in road computer vision research since there is not one specifically for VMSs. The folder structure of the dataset is as follows: vms_dataset/ data.csv real_images/ imgs/ annotations/ data-augmentation/ imgs/ annotations/ In which: data.csv: Each row contains the following information separated by commas (,): image_name, x_min, y_min, x_max, y_max, class_name, lat, long, folder, text. real_images: Images extracted directly from the videos. data-augmentation: Images created using data-augmentation imgs: Image files in .jpg format. annotations: Annotation files in .xml format.

  7. Human Tracking & Object Detection Dataset

    • kaggle.com
    zip
    Updated Jul 27, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Unique Data (2023). Human Tracking & Object Detection Dataset [Dataset]. https://www.kaggle.com/datasets/trainingdatapro/people-tracking
    Explore at:
    zip(46156442 bytes)Available download formats
    Dataset updated
    Jul 27, 2023
    Authors
    Unique Data
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    People Tracking & Object Detection dataset

    The dataset comprises of annotated video frames from positioned in a public space camera. The tracking of each individual in the camera's view has been achieved using the rectangle tool in the Computer Vision Annotation Tool (CVAT).

    The dataset is created on the basis of Real-Time Traffic Video Dataset

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F12421376%2Fc5a8dc4f63fe85c64a5fead10fad3031%2Fpersons_gif.gif?generation=1690705558283123&alt=media" alt="">

    Dataset Structure

    • The images directory houses the original video frames, serving as the primary source of raw data.
    • The annotations.xml file provides the detailed annotation data for the images.
    • The boxes directory contains frames that visually represent the bounding box annotations, showing the locations of the tracked individuals within each frame. These images can be used to understand how the tracking has been implemented and to visualize the marked areas for each individual.

    Data Format

    The annotations are represented as rectangle bounding boxes that are placed around each individual. Each bounding box annotation contains the position ( xtl-ytl-xbr-ybr coordinates ) for the respective box within the frame. https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F12421376%2F4f274551e10db2754c4d8a16dff97b33%2Fcarbon%20(10).png?generation=1687776281548084&alt=media" alt="">

    ๐Ÿ‘‰ Legally sourced datasets and carefully structured for AI training and model development. Explore samples from our dataset of 95,000+ human images & videos - Full dataset

    ๐Ÿš€ You can learn more about our high-quality unique datasets here

    keywords: multiple people tracking, human detection dataset, object detection dataset, people tracking dataset, tracking human object interactions, human Identification tracking dataset, people detection annotations, detecting human in a crowd, human trafficking dataset, deep learning object tracking, multi-object tracking dataset, labeled web tracking dataset, large-scale object tracking dataset

  8. Microsoft COCO 2017 Object Detection Dataset - raw

    • public.roboflow.com
    zip
    Updated Feb 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Microsoft (2025). Microsoft COCO 2017 Object Detection Dataset - raw [Dataset]. https://public.roboflow.com/object-detection/microsoft-coco-subset/2
    Explore at:
    zipAvailable download formats
    Dataset updated
    Feb 1, 2025
    Dataset authored and provided by
    Microsofthttp://microsoft.com/
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Bounding Boxes of coco-objects
    Description

    This is the full 2017 COCO object detection dataset (train and valid), which is a subset of the most recent 2020 COCO object detection dataset.

    COCO is a large-scale object detection, segmentation, and captioning dataset of many object types easily recognizable by a 4-year-old. The data is initially collected and published by Microsoft. The original source of the data is here and the paper introducing the COCO dataset is here.

  9. Vehicle Detection Image Dataset

    • kaggle.com
    zip
    Updated Apr 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Parisa Karimi Darabi (2024). Vehicle Detection Image Dataset [Dataset]. https://www.kaggle.com/datasets/pkdarabi/vehicle-detection-image-dataset
    Explore at:
    zip(274761684 bytes)Available download formats
    Dataset updated
    Apr 9, 2024
    Authors
    Parisa Karimi Darabi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Vehicle Detection Image Dataset

    Introduction

    Welcome to the Vehicle Detection Image Dataset! This dataset is meticulously curated for object detection and tracking tasks, with a specific focus on vehicle detection. It serves as a valuable resource for researchers, developers, and enthusiasts seeking to advance the capabilities of computer vision systems.

    Objective

    The primary aim of this dataset is to facilitate precise object detection tasks, particularly in identifying and tracking vehicles within images. Whether you are engaged in academic research, developing commercial applications, or exploring the frontiers of computer vision, this dataset provides a solid foundation for your projects.

    Preprocessing and Augmentation

    Both versions of the dataset undergo essential preprocessing steps, including resizing and orientation adjustments. Additionally, the Apply_Grayscale version undergoes augmentation to introduce grayscale variations, thereby enriching the dataset and improving model robustness.

    1. Apply_Grayscale

    • This version comprises grayscale images and is further augmented to enhance the diversity of training data.

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F14850461%2F4f23bd8094c892d1b6986c767b42baf4%2Fv2.png?generation=1712264632232641&alt=media" alt="">

    2. No_Apply_Grayscale

    • This version includes images without applying grayscale augmentation.

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F14850461%2Fbfb10eb2a4db31a62eb4615da824c387%2Fdetails_v1.png?generation=1712264660626280&alt=media" alt="">

    Data Formats

    To ensure compatibility with a wide range of object detection frameworks and tools, each version of the dataset is available in multiple formats:

    1. COCO
    2. YOLOv8
    3. YOLOv9
    4. TensorFlow

    These formats facilitate seamless integration into various machine learning frameworks and libraries, empowering users to leverage their preferred development environments.

    Real-Time Object Detection

    In addition to image datasets, we also provide a video for real-time object detection evaluation. This video allows users to test the performance of their models in real-world scenarios, providing invaluable insights into the effectiveness of their detection algorithms.

    Getting Started

    To begin exploring the Vehicle Detection Image Dataset, simply download the version and format that best suits your project requirements. Whether you are an experienced practitioner or just embarking on your journey in computer vision, this dataset offers a valuable resource for advancing your understanding and capabilities in object detection and tracking tasks.

    Citation

    If you utilize this dataset in your work, we kindly request that you cite the following:

    Parisa Karimi Darabi. (2024). Vehicle Detection Image Dataset: Suitable for Object Detection and tracking Tasks. Retrieved from https://www.kaggle.com/datasets/pkdarabi/vehicle-detection-image-dataset/

    Feedback and Contributions

    I welcome feedback and contributions from the Kaggle community to continually enhance the quality and usability of this dataset. Please feel free to reach out if you have suggestions, questions, or additional data and annotations to contribute. Together, we can drive innovation and progress in computer vision.

  10. R

    Microsoft Coco Dataset

    • universe.roboflow.com
    zip
    Updated Jul 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Microsoft (2025). Microsoft Coco Dataset [Dataset]. https://universe.roboflow.com/microsoft/coco/model/3
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jul 23, 2025
    Dataset authored and provided by
    Microsoft
    Variables measured
    Object Bounding Boxes
    Description

    Microsoft Common Objects in Context (COCO) Dataset

    The Common Objects in Context (COCO) dataset is a widely recognized collection designed to spur object detection, segmentation, and captioning research. Created by Microsoft, COCO provides annotations, including object categories, keypoints, and more. The model it a valuable asset for machine learning practitioners and researchers. Today, many model architectures are benchmarked against COCO, which has enabled a standard system by which architectures can be compared.

    While COCO is often touted to comprise over 300k images, it's pivotal to understand that this number includes diverse formats like keypoints, among others. Specifically, the labeled dataset for object detection stands at 123,272 images.

    The full object detection labeled dataset is made available here, ensuring researchers have access to the most comprehensive data for their experiments. With that said, COCO has not released their test set annotations, meaning the test data doesn't come with labels. Thus, this data is not included in the dataset.

    The Roboflow team has worked extensively with COCO. Here are a few links that may be helpful as you get started working with this dataset:

  11. R

    Data Format Dataset

    • universe.roboflow.com
    zip
    Updated Jul 26, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Artefact (2025). Data Format Dataset [Dataset]. https://universe.roboflow.com/artefact-l8hot/data-format-qp550/model/3
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jul 26, 2025
    Dataset authored and provided by
    Artefact
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Defects Bounding Boxes
    Description

    Data Format

    ## Overview
    
    Data Format is a dataset for object detection tasks - it contains Defects annotations for 1,502 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  12. R

    Data from: Product Types Dataset

    • universe.roboflow.com
    zip
    Updated Mar 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Matija workspace (2025). Product Types Dataset [Dataset]. https://universe.roboflow.com/matija-workspace/product-types-ueubb
    Explore at:
    zipAvailable download formats
    Dataset updated
    Mar 27, 2025
    Dataset authored and provided by
    Matija workspace
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Square Circle Bounding Boxes
    Description

    Product Types

    ## Overview
    
    Product Types is a dataset for object detection tasks - it contains Square Circle annotations for 4,788 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  13. Nexdata | Handwriting OCR Data On Board of 8 Languages | 9,574 Images

    • datarade.ai
    Updated Nov 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nexdata (2025). Nexdata | Handwriting OCR Data On Board of 8 Languages | 9,574 Images [Dataset]. https://datarade.ai/data-products/nexdata-handwriting-ocr-data-on-board-of-8-languages-9-57-nexdata
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset updated
    Nov 15, 2025
    Dataset authored and provided by
    Nexdata
    Area covered
    Japan, United Kingdom, Portugal, Netherlands, United States of America, France, Spain, Germany, Italy
    Description

    9,574 Images โ€“ Handwriting OCR Data On Board of 8 Languages. The language distribution includes English, Spanish and Portuguese, etc. The data diversity includes multiple collecting scenes, multiple text carriers, multiple languages, multiple photographic angles. The collecting angeles are looking up angle, eye-level angle and looking down angle. In terms of annotation, row-level quadrilateral (polygon) annotation and content transcription are performed on the text. The dataset can be used for tasks such as handwriting OCR.

    Data size

    9,574 images, 243,240 bounding boxes

    Language distribution

    English, Spanish, Portuguese, French, German, Japanese, Italian and Dutch

    Collecting environment

    black boards, white boards, green boards

    Device

    cellphone

    Photographic angle

    eye-level angle, looking down angle, looking up angle

    Data format

    the image data format is .jpg and other common image formats, the annotation file data format is.json

    Annotation content

    line-level quadrilateral (polygon) bounding box annotation and transcription for the texts

    Accuracy rate

    the error bound of each vertex of quadrilateral bounding box is within 5 pixels, which is a qualified annotation, the accuracy of bounding boxes is not less than 95%; the texts transcription accuracy is not less than 95%

  14. R

    Tfrecord Data Format Dataset

    • universe.roboflow.com
    zip
    Updated Jun 1, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TFRecord data format (2022). Tfrecord Data Format Dataset [Dataset]. https://universe.roboflow.com/tfrecord-data-format/tfrecord-data-format/model/1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 1, 2022
    Dataset authored and provided by
    TFRecord data format
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Weapons And Animals Bounding Boxes
    Description

    Tfrecord Data Format

    ## Overview
    
    Tfrecord Data Format is a dataset for object detection tasks - it contains Weapons And Animals annotations for 1,333 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  15. Multi-race Human Body Data | 300,000 ID | Computer Vision Data| Image/Video...

    • datarade.ai
    Updated Mar 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nexdata (2024). Multi-race Human Body Data | 300,000 ID | Computer Vision Data| Image/Video Deep Learning (DL) Data [Dataset]. https://datarade.ai/data-products/nexdata-multi-race-human-body-data-300-000-id-image-vi-nexdata
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset updated
    Mar 16, 2024
    Dataset authored and provided by
    Nexdata
    Area covered
    Albania, Japan, El Salvador, Dominican Republic, Armenia, Latvia, State of, Macedonia (the former Yugoslav Republic of), Vietnam, Peru
    Description
    1. Specifications Data size : 200,000 ID

    Race distribution : Asians, Caucasians, black people

    Gender distribution : gender balance

    Age distribution : ranging from teenager to the elderly, the middle-aged and young people are the majorities

    Collecting environment : including indoor and outdoor scenes

    Data diversity : different shooting heights, different ages, different light conditions, different collecting environment, clothes in different seasons, multiple human poses

    Device : cameras

    Data format : the data format is .jpg/mp4, the annotation file format is .json, the camera parameter file format is .json, the point cloud file format is .pcd

    Accuracy : based on the accuracy of the poses, the accuracy exceeds 97%;the accuracy of labels of gender, race, age, collecting environment and clothes are more than 97%

    1. About Nexdata Nexdata owns off-the-shelf PB-level Large Language Model(LLM) Data, 3 million hours of Audio Data and 800TB of Annotated Imagery Data. These ready-to-go machine learning (ML) data support instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at hhttps://www.nexdata.ai/datasets/computervision?source=Datarade
  16. f

    ID's photo Dataset | 67 countries | 11 types of documents | Document...

    • data.filemarket.ai
    Updated Jul 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FileMarket (2025). ID's photo Dataset | 67 countries | 11 types of documents | Document Recognition | OCR Training | Computer Vision [Dataset]. https://data.filemarket.ai/products/id-s-photo-dataset-67-countries-11-types-of-documents-d-filemarket
    Explore at:
    Dataset updated
    Jul 26, 2025
    Dataset authored and provided by
    FileMarket
    Area covered
    United States, Brazil, France
    Description

    Dataset of 3623 images from 1661 users (~2.18/user), mainly front/back ID documents, ideal for OCR training, document recognition, and automated identity verification tasks.

  17. Nexdata | Document OCR&Parsing Data | 10 Million Pages

    • datarade.ai
    Updated Nov 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nexdata (2025). Nexdata | Document OCR&Parsing Data | 10 Million Pages [Dataset]. https://datarade.ai/data-products/nexdata-document-ocr-parsing-data-10-million-pages-nexdata
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset updated
    Nov 7, 2025
    Dataset authored and provided by
    Nexdata
    Area covered
    Singapore, Iraq, Liechtenstein, Finland, Denmark, Mexico, Belarus, Russian Federation, Paraguay, United States of America
    Description

    Million Level Document OCR and Structured Analysis Data, Including textbooks, e-books, teaching aids, patents, theses, etc. The annotated files include OCR annotations and structured analysis.

    Data Size

    10 Million Pages

    Data Types

    Textbooks, Chinese E-books, Teaching Reference Books, Paper, Chinese Journals, English Journals

    Data Format

    The original document file format is PDF, the document image file format is. png, the OCR annotation file format is JSON, and the structured parsing file format is markdown(Tables and formulas are in Latex format or screenshot links)

    Data Accuracy

    If the transcription of the text is basically accurate, the markings are basically aligned, and there are no obvious typos, it is considered correct annotation. Divided by punctuation marks, the number of correctly annotated sentences should not be less than 90%.

  18. d

    FileMarket | Diverse Human Face Data | 20,000 IDs | Face Recognition Data |...

    • datarade.ai
    Updated Jul 5, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FileMarket (2024). FileMarket | Diverse Human Face Data | 20,000 IDs | Face Recognition Data | Image/Video AI Training Data | Biometric Data [Dataset]. https://datarade.ai/data-products/filemarket-diverse-human-face-data-20-000-ids-face-reco-filemarket
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset updated
    Jul 5, 2024
    Dataset authored and provided by
    FileMarket
    Area covered
    Georgia, United Kingdom, Martinique, Oman, Iceland, Hong Kong, Libya, Curaรงao, Sri Lanka, Kyrgyzstan
    Description

    Biometric Data

    FileMarket provides a comprehensive Biometric Data set, ideal for enhancing AI applications in security, identity verification, and more. In addition to Biometric Data, we offer specialized datasets across Object Detection Data, Machine Learning (ML) Data, Large Language Model (LLM) Data, and Deep Learning (DL) Data. Each dataset is meticulously crafted to support the development of cutting-edge AI models.

    Data Size: 20,000 IDs

    Race Distribution: The dataset encompasses individuals from diverse racial backgrounds, including Black, Caucasian, Indian, and Asian groups.

    Gender Distribution: The dataset equally represents all genders, ensuring a balanced and inclusive collection.

    Age Distribution: The data spans a broad age range, including young, middle-aged, and senior individuals, providing comprehensive age coverage.

    Collection Environment: Data has been gathered in both indoor and outdoor environments, ensuring variety and relevance for real-world applications.

    Data Diversity: This dataset includes a rich variety of face poses, racial backgrounds, age groups, lighting conditions, and scenes, making it ideal for robust biometric model training.

    Device: All data has been collected using mobile phones, reflecting common real-world usage scenarios.

    Data Format: The data is provided in .jpg and .png formats, ensuring compatibility with various processing tools and systems.

    Accuracy: The labels for face pose, race, gender, and age are highly accurate, exceeding 95%, making this dataset reliable for training high-performance biometric models.

  19. Re-ID Data | 600,000 ID | CCTV Data |Computer Vision Data| Identity Data|...

    • datarade.ai
    Updated Dec 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nexdata (2023). Re-ID Data | 600,000 ID | CCTV Data |Computer Vision Data| Identity Data| Imagery Data [Dataset]. https://datarade.ai/data-products/nexdata-re-id-data-60-000-id-image-video-ai-ml-train-nexdata
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset updated
    Dec 8, 2023
    Dataset authored and provided by
    Nexdata
    Area covered
    Luxembourg, Ecuador, Cuba, Bolivia (Plurinational State of), Trinidad and Tobago, Portugal, Turkmenistan, Sri Lanka, Russian Federation, United Arab Emirates
    Description
    1. Specifications Data size : 60,000 ID

    Population distribution : the race distribution is Asians, Caucasians and black people, the gender distribution is male and female, the age distribution is from children to the elderly

    Collecting environment : including indoor and outdoor scenes (such as supermarket, mall and residential area, etc.)

    Data diversity : different ages, different time periods, different cameras, different human body orientations and postures, different ages collecting environment

    Device : surveillance cameras, the image resolution is not less than 1,9201,080

    Data format : the image data format is .jpg, the annotation file format is .json

    Annotation content : human body rectangular bounding boxes, 15 human body attributes

    Quality Requirements : A rectangular bounding box of human body is qualified when the deviation is not more than 3 pixels, and the qualified rate of the bounding boxes shall not be lower than 97%;Annotation accuracy of attributes is over 97%

    1. About Nexdata Nexdata owns off-the-shelf PB-level Large Language Model(LLM) Data, 3 million hours of Speech Data and 800TB of Imagery Data.These ready-to-go Identity Data support instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at https://www.nexdata.ai/datasets/computervision?source=Datarade
  20. m

    Data from: Tracking Plant Growth Using Image Sequence Analysis- Datasets

    • data.mendeley.com
    Updated Jan 10, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yiftah Szoke (2025). Tracking Plant Growth Using Image Sequence Analysis- Datasets [Dataset]. http://doi.org/10.17632/z2fp5kbgbh.1
    Explore at:
    Dataset updated
    Jan 10, 2025
    Authors
    Yiftah Szoke
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset consists of five subsets with annotated images in COCO format, designed for object detection and tracking plant growth: 1. Cucumber_Train Dataset (for Faster R-CNN) - Includes training, validation, and test images of cucumbers from different angles. - Annotations: Bounding boxes in COCO format for object detection tasks.

    1. Tomato Dataset
    2. Contains images of tomato plants for 24 hours at hourly intervals from a fixed angle.
    3. Annotations: Bounding boxes in COCO format.

    4. Pepper Dataset

    5. Contains images of pepper plants for 24 hours at hourly intervals from a fixed angle.

    6. Annotations: Bounding boxes in COCO format.

    7. Cannabis Dataset

    8. Contains images of cannabis plants for 24 hours at hourly intervals from a fixed angle.

    9. Annotations: Bounding boxes in COCO format.

    10. Cucumber Dataset

    11. Contains images of cucumber plants for 24 hours at hourly intervals from a fixed angle.

    12. Annotations: Bounding boxes in COCO format.

    This dataset supports training and evaluation of object detection models across diverse crops.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
meer atif magsi (2023). yolo format dataset [Dataset]. https://www.kaggle.com/datasets/meeratif/yolo-format-data
Organization logo

yolo format dataset

๐Ÿค™ yolo format data || sindhi language title

Explore at:
zip(256907308 bytes)Available download formats
Dataset updated
Oct 16, 2023
Authors
meer atif magsi
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

Dataset Highlights:

  1. Diverse Image Collection: Our dataset encompasses a wide range of general images covering various categories such as objects, scenes, people, and more. The images are carefully curated to offer a rich source of visual data.

  2. Sindhi Language Titles: One of the distinctive features of our dataset is the inclusion of Sindhi language titles for each image.

  3. Annotations in YOLO Format: To facilitate your object detection tasks, we have meticulously annotated the images in YOLO format, making it compatible with the YOLOv3 or YOLOv4 models. This ensures that you can jump right into training your model without the hassle of converting annotations.

  4. Comprehensive Metadata: Each image in the dataset is accompanied by a YAML file providing additional metadata, including information about the image source, date of capture, and any relevant context that may be useful for your research.

By publishing this YOLO-style dataset with Sindhi language titles, we aim to contribute to the machine learning and computer vision community, fostering innovation and inclusivity in the field. We encourage you to explore, experiment, and create cutting-edge models using this dataset, and we look forward to seeing the incredible projects that emerge from it.

Search
Clear search
Close search
Google apps
Main menu