100+ datasets found

R
Microsoft Coco Dataset
universe.roboflow.com
zip
Updated Jul 23, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Microsoft (2025). Microsoft Coco Dataset [Dataset]. https://universe.roboflow.com/microsoft/coco/model/3
Explore at:
zipAvailable download formats
Dataset updated
Jul 23, 2025
Dataset authored and provided by
Microsoft
Variables measured
Object Bounding Boxes
Description
Microsoft Common Objects in Context (COCO) Dataset

The Common Objects in Context (COCO) dataset is a widely recognized collection designed to spur object detection, segmentation, and captioning research. Created by Microsoft, COCO provides annotations, including object categories, keypoints, and more. The model it a valuable asset for machine learning practitioners and researchers. Today, many model architectures are benchmarked against COCO, which has enabled a standard system by which architectures can be compared.

While COCO is often touted to comprise over 300k images, it's pivotal to understand that this number includes diverse formats like keypoints, among others. Specifically, the labeled dataset for object detection stands at 123,272 images.

The full object detection labeled dataset is made available here, ensuring researchers have access to the most comprehensive data for their experiments. With that said, COCO has not released their test set annotations, meaning the test data doesn't come with labels. Thus, this data is not included in the dataset.

The Roboflow team has worked extensively with COCO. Here are a few links that may be helpful as you get started working with this dataset:

An introduction to the COCO dataset

Weird images in COCO, and what that tells us about the utility and limits of COCO
COCO Dataset 2017
kaggle.com
gts.ai
Updated Mar 18, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Saba Hesaraki (2023). COCO Dataset 2017 [Dataset]. https://www.kaggle.com/datasets/sabahesaraki/2017-2017
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 18, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Saba Hesaraki
Description
The MS COCO (Microsoft Common Objects in Context) dataset is a large-scale object detection, segmentation, key-point detection, and captioning dataset. The dataset consists of 328K images.

Splits: The first version of MS COCO dataset was released in 2014. It contains 164K images split into training (83K), validation (41K) and test (41K) sets. In 2015 additional test set of 81K images was released, including all the previous test images and 40K new images.

Based on community feedback, in 2017 the training/validation split was changed from 83K/41K to 118K/5K. The new split uses the same images and annotations. The 2017 test set is a subset of 41K images of the 2015 test set. Additionally, the 2017 release contains a new unannotated dataset of 123K images.
R
Coco Limited (person Only) Dataset
universe.roboflow.com
zip
Updated May 31, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
shreks swamp (2022). Coco Limited (person Only) Dataset [Dataset]. https://universe.roboflow.com/shreks-swamp/coco-dataset-limited--person-only/model/1
Explore at:
zipAvailable download formats
Dataset updated
May 31, 2022
Dataset authored and provided by
shreks swamp
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
People Bounding Boxes
Description
COCO Dataset Limited (Person Only)

## Overview COCO Dataset Limited (Person Only) is a dataset for object detection tasks - it contains People annotations for 5,438 images. ## Getting Started You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model. ## License This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
R
Coco_car Dataset
universe.roboflow.com
zip
Updated Apr 5, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
yolo10 (2025). Coco_car Dataset [Dataset]. https://universe.roboflow.com/yolo10-srfz1/coco_car/model/6
Explore at:
zipAvailable download formats
Dataset updated
Apr 5, 2025
Dataset authored and provided by
yolo10
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
Coco_car Bounding Boxes
Description
Coco_car

## Overview Coco_car is a dataset for object detection tasks - it contains Coco_car annotations for 2,000 images. ## Getting Started You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model. ## License This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
Coco Dataset for Multi-label Image Classification
kaggle.com
zip
Updated Apr 19, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shubham Sharma (2024). Coco Dataset for Multi-label Image Classification [Dataset]. https://www.kaggle.com/datasets/shubham2703/coco-dataset-for-multi-label-image-classification
Explore at:
zip(0 bytes)Available download formats
Dataset updated
Apr 19, 2024
Authors
Shubham Sharma
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
Dataset Overview

This page contains a modified Cocos dataset along with details about the dataset used.

File Descriptions

imgs.zip - Train: 🚂 This folder contains the training set, which can be split into train/validation data for model training. - Test: 🧪 Your trained models should be used to produce predictions on the test set.

labels.zip - categories.csv: 📝 This file lists all the object classes in the dataset, ordered according to the column ordering in the train labels file. - train_labels.csv: 📊 This file contains data regarding which image contains which categories.
f
Comparison of the state-of-the-art model on MS COCO data.
plos.figshare.com
xls
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hong Li; Qian Zhou; Yao Mao; Bing Zhang; Chao Liu (2023). Comparison of the state-of-the-art model on MS COCO data. [Dataset]. http://doi.org/10.1371/journal.pone.0276581.t002
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0276581.t002
Dataset updated
May 31, 2023
Dataset provided by
PLOS ONE
Authors
Hong Li; Qian Zhou; Yao Mao; Bing Zhang; Chao Liu
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Comparison of the state-of-the-art model on MS COCO data.
Person-Collecting-Waste COCO Dataset
kaggle.com
Updated Mar 31, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ashutosh Sharma (2025). Person-Collecting-Waste COCO Dataset [Dataset]. https://www.kaggle.com/datasets/ashu009/person-collecting-waste-coco-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 31, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Ashutosh Sharma
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset: COCO-Formatted Object Detection Dataset

Overview

This dataset is designed for object detection tasks and follows the COCO format. It contains 300 images and corresponding annotation files in JSON format. The dataset is split into training, validation, and test sets, ensuring a balanced distribution for model evaluation.

Dataset Structure

The dataset is organized into three main folders:

train/ (70% - 210 images)

valid/ (15% - 45 images)

test/ (15% - 45 images)

Each folder contains:

Images in JPEG/PNG format.

A corresponding _annotations.coco.json file that includes bounding box annotations.

Preprocessing & Augmentations

The dataset has undergone several preprocessing and augmentation steps to enhance model generalization:

Image Preprocessing:

Auto-orientation applied

Resized to 640x640 pixels (stretched)

Augmentation Techniques:

Flip: Horizontal flipping

Crop: 0% minimum zoom, 5% maximum zoom

Rotation: Between -5° and +5°

Saturation: Adjusted between -4% and +4%

Brightness: Adjusted between -10% and +10%

Blur: Up to 0px

Noise: Up to 0.1% of pixels

Bounding Box Augmentations:

Flipping, cropping, rotation, brightness adjustments, blur, and noise applied accordingly to maintain annotation consistency.

Annotation Format

The dataset follows the COCO (Common Objects in Context) format, which includes:

images section: Contains image metadata such as filename, width, and height.

annotations section: Includes bounding boxes, category IDs, and segmentation masks (if applicable).

categories section: Defines class labels.
g
Coco Damage Detection Trained Models
gts.ai
json
Updated Nov 20, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
GTS (2023). Coco Damage Detection Trained Models [Dataset]. https://gts.ai/dataset-download/coco-damage-detection-trained-models/
Explore at:
jsonAvailable download formats
Dataset updated
Nov 20, 2023
Dataset provided by
GLOBOSE TECHNOLOGY SOLUTIONS PRIVATE LIMITED
Authors
GTS
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Enhance your AI-powered damage detection with our Coco Damage Detection Trained Models. Designed for precision and efficiency, these models are versatile and easily integrated into various applications..
Z
COCO, LVIS, Open Images V4 classes mapping
data.niaid.nih.gov
zenodo.org
Updated Oct 13, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Giuseppe Amato (2022). COCO, LVIS, Open Images V4 classes mapping [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7194299
Explore at:
Dataset updated
Oct 13, 2022
Dataset provided by
Lucia Vadicamo
Claudio Gennaro
Fabio Carrara
Claudio Vairo
Nicola Messina
Fabrizio Falchi
Paolo Bolettieri
Giuseppe Amato
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This repository contains a mapping between the classes of COCO, LVIS, and Open Images V4 datasets into a unique set of 1460 classes.

COCO [Lin et al 2014] contains 80 classes, LVIS [gupta2019lvis] contains 1460 classes, Open Images V4 [Kuznetsova et al. 2020] contains 601 classes.

We built a mapping of these classes using a semi-automatic procedure in order to have a unique final list of 1460 classes. We also generated a hierarchy for each class, using wordnet

This repository contains the following files:

coco_classes_map.txt, contains the mapping for the 80 coco classes

lvis_classes_map.txt, contains the mapping for the 1460 coco classes

openimages_classes_map.txt, contains the mapping for the 601 coco classes

classname_hyperset_definition.csv, contains the final set of 1460 classes, their definition and hierarchy

all-classnames.xlsx, contains a side-by-side view of all classes considered

This mapping was used in VISIONE [Amato et al. 2021, Amato et al. 2022] that is a content-based retrieval system that supports various search functionalities (text search, object/color-based search, semantic and visual similarity search, temporal search). For the object detection VISIONE uses three pre-trained models: VfNet Zhang et al. 2021, Mask R-CNN He et al. 2017, and a Faster R-CNN+Inception ResNet (trained on the Open Images V4).

This is repository is released under a Creative Commons Attribution license, please cite the following paper if you use it in your work in any form:

@inproceedings{amato2021visione, title={The visione video search system: exploiting off-the-shelf text search engines for large-scale video retrieval}, author={Amato, Giuseppe and Bolettieri, Paolo and Carrara, Fabio and Debole, Franca and Falchi, Fabrizio and Gennaro, Claudio and Vadicamo, Lucia and Vairo, Claudio}, journal={Journal of Imaging}, volume={7}, number={5}, pages={76}, year={2021}, publisher={Multidisciplinary Digital Publishing Institute} }

References:

[Amato et al. 2022] Amato, G. et al. (2022). VISIONE at Video Browser Showdown 2022. In: , et al. MultiMedia Modeling. MMM 2022. Lecture Notes in Computer Science, vol 13142. Springer, Cham. https://doi.org/10.1007/978-3-030-98355-0_52

[Amato et al. 2021] Amato, G., Bolettieri, P., Carrara, F., Debole, F., Falchi, F., Gennaro, C., Vadicamo, L. and Vairo, C., 2021. The visione video search system: exploiting off-the-shelf text search engines for large-scale video retrieval. Journal of Imaging, 7(5), p.76.

[Gupta et al.2019] Gupta, A., Dollar, P. and Girshick, R., 2019. Lvis: A dataset for large vocabulary instance segmentation. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 5356-5364).

[He et al. 2017] He, K., Gkioxari, G., Dollár, P. and Girshick, R., 2017. Mask r-cnn. In Proceedings of the IEEE international conference on computer vision (pp. 2961-2969).

[Kuznetsova et al. 2020] Kuznetsova, A., Rom, H., Alldrin, N., Uijlings, J., Krasin, I., Pont-Tuset, J., Kamali, S., Popov, S., Malloci, M., Kolesnikov, A. and Duerig, T., 2020. The open images dataset v4. International Journal of Computer Vision, 128(7), pp.1956-1981.

[Lin et al. 2014] Lin, T.Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Dollár, P. and Zitnick, C.L., 2014, September. Microsoft coco: Common objects in context. In European conference on computer vision (pp. 740-755). Springer, Cham.

[Zhang et al. 2021] Zhang, H., Wang, Y., Dayoub, F. and Sunderhauf, N., 2021. Varifocalnet: An iou-aware dense object detector. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 8514-8523).
h
coco-clip-vit-l-14
huggingface.co
Updated Nov 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Simeon Emanuilov (2023). coco-clip-vit-l-14 [Dataset]. http://doi.org/10.57967/hf/3225
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.57967/hf/3225
Dataset updated
Nov 30, 2023
Authors
Simeon Emanuilov
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
COCO Dataset Processed with CLIP ViT-L/14

Overview

This dataset represents a processed version of the '2017 Unlabeled images' subset of the COCO dataset (COCO Dataset), utilizing the CLIP ViT-L/14 model from OpenAI. The original dataset comprises 123K images, approximately 19GB in size, which have been processed to generate 786-dimensional vectors. These vectors can be utilized for various applications like semantic search systems, image similarity assessments, and more.… See the full description on the dataset page: https://huggingface.co/datasets/s-emanuilov/coco-clip-vit-l-14.
R
Yolov8 Coco Dataset
universe.roboflow.com
zip
Updated Jan 8, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NDT test (2024). Yolov8 Coco Dataset [Dataset]. https://universe.roboflow.com/ndt-test/yolov8-coco/dataset/2
Explore at:
zipAvailable download formats
Dataset updated
Jan 8, 2024
Dataset authored and provided by
NDT test
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
All Bounding Boxes
Description
Yolov8 Coco

## Overview Yolov8 Coco is a dataset for object detection tasks - it contains All annotations for 5,000 images. ## Getting Started You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model. ## License This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
h
depth_coco
huggingface.co
Updated Jul 11, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neil de la fuente (2024). depth_coco [Dataset]. http://doi.org/10.57967/hf/2704
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.57967/hf/2704
Dataset updated
Jul 11, 2024
Authors
Neil de la fuente
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
Dataset Card for MS COCO Depth Maps

This dataset is a collection of depth maps generated from the MS COCO dataset images using the Depth-Anything-V2 model, along with the original MS COCO images.

Dataset Details Dataset Description

This dataset contains depth maps generated from the MS COCO (Common Objects in Context) dataset images using the Depth-Anything-V2 model. It provides depth information for each image in the original MS COCO dataset, offering a new… See the full description on the dataset page: https://huggingface.co/datasets/neildlf/depth_coco.
Common Object Detection
sdiinnovation-geoplatform.hub.arcgis.com
hub.arcgis.com
Updated Feb 28, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Esri (2023). Common Object Detection [Dataset]. https://sdiinnovation-geoplatform.hub.arcgis.com/content/a91bed8bc0fe4e1bb8db45c23959e5f1
Explore at:
Dataset updated
Feb 28, 2023
Dataset authored and provided by
Esrihttp://esri.com/
Description
This is an open source object detection model by TensorFlow in TensorFlow Lite format. While it is not recommended to use this model in production surveys, it can be useful for demonstration purposes and to get started with smart assistants in ArcGIS Survey123. You are responsible for the use of this model. When using Survey123, it is your responsibility to review and manually correct outputs.This object detection model was trained using the Common Objects in Context (COCO) dataset. COCO is a large-scale object detection dataset that is available for use under the Creative Commons Attribution 4.0 License.The dataset contains 80 object categories and 1.5 million object instances that include people, animals, food items, vehicles, and household items. For a complete list of common objects this model can detect, see Classes.The model can be used in ArcGIS Survey123 to detect common objects in photos that are captured with the Survey123 field app. Using the modelFollow the guide to use the model. You can use this model to detect or redact common objects in images captured with the Survey123 field app. The model must be configured for a survey in Survey123 Connect.Fine-tuning the modelThis model cannot be fine-tuned using ArcGIS tools.InputCamera feed (either low-resolution preview or high-resolution capture).OutputImage with common object detections written to its EXIF metadata or an image with detected objects redacted.Model architectureThis is an open source object detection model by TensorFlow in TensorFlow Lite format with MobileNet architecture. The model is available for use under the Apache License 2.0.Sample resultsHere are a few results from the model.
h
llava-bench-coco
huggingface.co
Updated Apr 21, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
LMMs-Lab (2024). llava-bench-coco [Dataset]. https://huggingface.co/datasets/lmms-lab/llava-bench-coco
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 21, 2024
Dataset authored and provided by
LMMs-Lab
Description
Large-scale Multi-modality Models Evaluation Suite

Accelerating the development of large-scale multi-modality models (LMMs) with lmms-eval

🏠 Homepage | 📚 Documentation | 🤗 Huggingface Datasets

This Dataset

This is a formatted version of LLaVA-Bench(COCO) that is used in LLaVA. It is used in our lmms-eval pipeline to allow for one-click evaluations of large multi-modality models. @misc{liu2023improvedllava, author={Liu, Haotian and Li, Chunyuan and… See the full description on the dataset page: https://huggingface.co/datasets/lmms-lab/llava-bench-coco.
Z
COCO dataset and neural network weights for micro-FTIR particle detection on...
data.niaid.nih.gov
Updated Aug 13, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Schowing, Thibault (2024). COCO dataset and neural network weights for micro-FTIR particle detection on filters. [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_10839526
Explore at:
Dataset updated
Aug 13, 2024
Dataset authored and provided by
Schowing, Thibault
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The IMPTOX project has received funding from the EU's H2020 framework programme for research and innovation under grant agreement n. 965173. Imptox is part of the European MNP cluster on human health.

More information about the project here.

Description: This repository includes the trained weights and a custom COCO-formatted dataset used for developing and testing a Faster R-CNN R_50_FPN_3x object detector, specifically designed to identify particles in micro-FTIR filter images.

Contents:

Weights File (neuralNetWeights_V3.pth):

Format: .pth

Description: This file contains the trained weights for a Faster R-CNN model with a ResNet-50 backbone and a Feature Pyramid Network (FPN), trained for 3x schedule. These weights are specifically tuned for detecting particles in micro-FTIR filter images.

Custom COCO Dataset (uFTIR_curated_square.v5-uftir_curated_square_2024-03-14.coco-segmentation.zip):

Format: .zip

Description: This zip archive contains a custom COCO-formatted dataset, including JPEG images and their corresponding annotation file. The dataset consists of images of micro-FTIR filters with annotated particles.

Contents:

Images: JPEG format images of micro-FTIR filters.

Annotations: A JSON file in COCO format providing detailed annotations of the particles in the images.

Management: The dataset can be managed and manipulated using the Pycocotools library, facilitating easy integration with existing COCO tools and workflows.

Applications: The provided weights and dataset are intended for researchers and practitioners in the field of microscopy and particle detection. The dataset and model can be used for further training, validation, and fine-tuning of object detection models in similar domains.

Usage Notes:

The neuralNetWeights_V3.pth file should be loaded into a PyTorch model compatible with the Faster R-CNN architecture, such as Detectron2.

The contents of uFTIR_curated_square.v5-uftir_curated_square_2024-03-14.coco-segmentation.zip should be extracted and can be used with any COCO-compatible object detection framework for training and evaluation purposes.

Code can be found on the related Github repository.
MJ-COCO-2025 Dataset
kaggle.com
Updated May 28, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
MJ-COCO-2025 (2025). MJ-COCO-2025 Dataset [Dataset]. http://doi.org/10.34740/kaggle/dsv/11977654
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.34740/kaggle/dsv/11977654
Dataset updated
May 28, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
MJ-COCO-2025
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
MJ-COCO-2025 is a modified version of the MS-COCO-2017 dataset, in which the annotation errors have been automatically corrected using model-driven methods. The name "MJ" originates from the initials of Min Je Kim, the individual who updated the dataset. "MJ" also stands for "Modification & Justification," emphasizing that the modifications were not manually edited but were systematically validated through machine learning models to increase reliability and quality. Thus, MJ-COCO-2025 reflects both a personal identity and a commitment to improving the dataset through thoughtful modification, ensuring improved accuracy, reliability and consistency. The comparative results of MS-COCO and MJ-COCO datasets are presented in Table 1 and Figure 1. The MJ-COCO-2025 dataset features the improvements, including fixes for group annotations, addition of missing annotations, removal of redundant or overlapping labels, etc. These refinements aim to improve training and evaluation performance in object detection tasks.

Summary of Improvements:

The re-labeled MJ-COCO-2025 dataset exhibits notable improvements in annotation quality compared to the original MS-COCO-2017 dataset. As shown in Table 1, it includes substantial increases in categories such as previously missing annotations and group annotations. At the same time, the dataset has been refined by reducing annotation noise through the removal of duplicates, resolution of challenging or debatable cases, and elimination of non-existent object annotations.

Table 1: Comparison of Class Names | MS-COCO | MJ-COCO | ---------------------|---------|---------|----- Airplane | 5,135 | 5,810 | 675 | Kite Apple | 5,851 | 19,527 | 13,676 | Knife Backpack | 8,720 | 10,029 | 1,309 | Laptop Banana | 9,458 | 49,705 | 40,247 | Microwave Baseball Bat | 3,276 | 3,517 | 241 | Motorcycle Baseball Glove | 3,747 | 3,440 | -307 | Mouse Bear | 1,294 | 1,311 | 17 | Orange Bed | 4,192 | 4,177 | -15 | Oven Bench | 9,838 | 9,784 | -54 | Parking Meter Bicycle | 7,113 | 7,853 | 740 | Person Bird | 10,806 | 13,346 | 2,540 | Pizza Boat | 10,759 | 13,386 | 2,627 | Potted Plant Book | 24,715 | 35,712 | 10,997 | Refrigerator Bottle | 24,342 | 32,455 | 8,113 | Remote Bowl | 14,358 | 13,591 | -767 | Sandwich Broccoli | 7,308 | 14,275 | 6,967 | Scissors Bus | 6,069 | 7,132 | 1,063 | Sheep Cake | 6,353 | 8,968 | 2,615 | Sink Car | 43,867 | 51,662 | 7,795 | Skateboard Carrot | 7,852 | 15,411 | 7,559 | Skis Cat | 4,768 | 4,895 | 127 | Snowboard Cell Phone | 6,434 | 6,642 | 208 | Spoon Chair | 38,491 | 56,750 | 18,259 | Sports Ball Clock | 6,334 | 7,618 | 1,284 | Stop Sign Couch | 5,779 | 5,598 | -181 | Suitcase Cow | 8,147 | 8,990 | 843 | Surfboard Cup | 20,650 | 22,545 | 1,895 | Teddy Bear Dining Table | 15,714 | 16,569 | 855 Dog | 5,508 | 5,870 | 362 | Tie Donut | 7,179 | 11,622 | 4,443 ... Class-wise Annotations: MS-COCO-2017 and MJ-COCO-2025. Difference | Class Names | MS-COCO | MJ-COCO | Difference -------|----------------------|---------|---------|------------ | 9,076 | 15,092 | 6,016 | 7,770 | 6,697 | -1,073 | 4,970 | 5,280 | 310 | 1,673 | 1,755 | 82 | 8,725 | 10,045 | 1,320 | 2,262 | 2,377 | 115 | 6,399 | 18,416 | 12,017 | 3,334 | 4,310 | 976 | 1,285 | 1,355 | 70 | 262,465 | 435,252 | 172,787 | 5,821 | 6,049 | 228 | 8,652 | 11,252 | 2,600 | 2,637 | 2,728 | 91 | 5,703 | 5,428 | -275 | 4,373 | 3,925 | -448 | 1,481 | 1,558 | 77 | 9,509 | 12,813 | 3,304 | 5,610 | 5,969 | 359 | 5,543 | 5,761 | 218 | 6,646 | 8,945 | 2,299 | 2,685 | 2,565 | -120 | 6,165 | 6,156 | -9 | 6,347 | 6,060 | -287 | 1,983 | 2,684 | 701 | 6,192 | 7,447 | 1,255 | 6,126 | 6,175 | 49 | 4,793 | 6,432 | 1,639 | Tennis Racket | 4,812 | 4,932 | 120 | 6,496 | 6,048 | -448
f
Dataset-I-drinking-related-object-detection (in both YoloV8 and COCO format)...
kcl.figshare.com
Updated Feb 27, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Xin Chen; Xinqi Bao; Ernest Kamavuako (2025). Dataset-I-drinking-related-object-detection (in both YoloV8 and COCO format) [Dataset]. http://doi.org/10.18742/26337085.v1
Explore at:
Unique identifier
https://doi.org/10.18742/26337085.v1
Dataset updated
Feb 27, 2025
Dataset provided by
King's College London
Authors
Xin Chen; Xinqi Bao; Ernest Kamavuako
License
https://www.kcl.ac.uk/researchsupport/assets/DataAccessAgreement-Description.pdfhttps://www.kcl.ac.uk/researchsupport/assets/DataAccessAgreement-Description.pdf
Description
This dataset contains annotated images for object detection for containers and hands in a first-person view (egocentric view) during drinking activities. Both YOLOV8 format and COCO format are provided.Please refer to our paper for more details.Purpose: Training and testing the object detection model.Content: Videos from Session 1 of Subjects 1-20.Images: Extracted from the videos of Subjects 1-20 Session 1.Additional Images:~500 hand/container images from Roboflow Open Source data.~1500 null (background) images from VOC Dataset and MIT Indoor Scene Recognition Dataset:1000 indoor scenes from 'MIT Indoor Scene Recognition'400 other unrelated objects from VOC DatasetData Augmentation:Horizontal flipping±15% brightness change±10° rotationFormats Provided:COCO formatPyTorch YOLOV8 formatImage Size: 416x416 pixelsTotal Images: 16,834Training: 13,862Validation: 1,975Testing: 997Instance Numbers:Containers: Over 10,000Hands: Over 8,000
image-caption-coco-model
kaggle.com
Updated Apr 26, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
lachonman2 (2020). image-caption-coco-model [Dataset]. https://www.kaggle.com/lachonman2/imagecaptioncocomodel
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 26, 2020
Dataset provided by
Kagglehttp://kaggle.com/
Authors
lachonman2
Description
Dataset

This dataset was created by lachonman2

Contents
h
SargeZT-coco-stuff-captioned
huggingface.co
Updated Apr 29, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
AbstractPhila (2025). SargeZT-coco-stuff-captioned [Dataset]. https://huggingface.co/datasets/AbstractPhil/SargeZT-coco-stuff-captioned
Explore at:
Dataset updated
Apr 29, 2025
Authors
AbstractPhila
Description
⚠️ WARNING: COCO Dataset Contamination Risk The COCO dataset contains latent risks of inappropriate label associations when training adult-oriented models, particularly involving child-descriptive language. Despite its academic origin and wide usage, COCO embeds captions such as “little girl,” “young boy,” “baby,” and “child” across a range of depictions. When used as-is in diffusion model training, this poses a serious ethical and representational hazard, as tags can be wrongly associated… See the full description on the dataset page: https://huggingface.co/datasets/AbstractPhil/SargeZT-coco-stuff-captioned.
R
Coco New Dataset
universe.roboflow.com
zip
Updated Jul 21, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
object size measurement (2025). Coco New Dataset [Dataset]. https://universe.roboflow.com/object-size-measurement/coco-new-dataset/model/1
Explore at:
zipAvailable download formats
Dataset updated
Jul 21, 2025
Dataset authored and provided by
object size measurement
Variables measured
All Coco Class Plus Shapes Bounding Boxes
Description
Coco New Dataset

## Overview Coco New Dataset is a dataset for object detection tasks - it contains All Coco Class Plus Shapes annotations for 753 images. ## Getting Started You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.

Facebook

Twitter

Click to copy link

Link copied

Cite

Microsoft (2025). Microsoft Coco Dataset [Dataset]. https://universe.roboflow.com/microsoft/coco/model/3

Microsoft Coco Dataset

coco

microsoft-coco-dataset

Explore at:

zipAvailable download formats

Dataset updated

Jul 23, 2025

Dataset authored and provided by

Microsoft

Variables measured

Object Bounding Boxes

Description

Microsoft Common Objects in Context (COCO) Dataset

The Common Objects in Context (COCO) dataset is a widely recognized collection designed to spur object detection, segmentation, and captioning research. Created by Microsoft, COCO provides annotations, including object categories, keypoints, and more. The model it a valuable asset for machine learning practitioners and researchers. Today, many model architectures are benchmarked against COCO, which has enabled a standard system by which architectures can be compared.

While COCO is often touted to comprise over 300k images, it's pivotal to understand that this number includes diverse formats like keypoints, among others. Specifically, the labeled dataset for object detection stands at 123,272 images.

The full object detection labeled dataset is made available here, ensuring researchers have access to the most comprehensive data for their experiments. With that said, COCO has not released their test set annotations, meaning the test data doesn't come with labels. Thus, this data is not included in the dataset.

The Roboflow team has worked extensively with COCO. Here are a few links that may be helpful as you get started working with this dataset:

Clear search

Close search

Google apps

Main menu

Microsoft Coco Dataset

Microsoft Common Objects in Context (COCO) Dataset

COCO Dataset 2017

Coco Limited (person Only) Dataset

COCO Dataset Limited (Person Only)

Coco_car Dataset

Coco_car

Coco Dataset for Multi-label Image Classification

Dataset Overview

Comparison of the state-of-the-art model on MS COCO data.

Person-Collecting-Waste COCO Dataset

Dataset: COCO-Formatted Object Detection Dataset

Overview

Dataset Structure

The dataset is organized into three main folders:

Each folder contains:

Preprocessing & Augmentations

Image Preprocessing:

Augmentation Techniques:

Annotation Format

Coco Damage Detection Trained Models

COCO, LVIS, Open Images V4 classes mapping

coco-clip-vit-l-14

Yolov8 Coco Dataset

Yolov8 Coco

depth_coco

Common Object Detection

llava-bench-coco

COCO dataset and neural network weights for micro-FTIR particle detection on...

MJ-COCO-2025 Dataset

Summary of Improvements:

Dataset-I-drinking-related-object-detection (in both YoloV8 and COCO format)...

image-caption-coco-model

Dataset

Contents

SargeZT-coco-stuff-captioned

Coco New Dataset

Coco New Dataset

Microsoft Coco Dataset

coco

microsoft-coco-dataset

Microsoft Common Objects in Context (COCO) Dataset