53 datasets found

R
Microsoft Coco Dataset
universe.roboflow.com
zip
Updated Jul 23, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Microsoft (2025). Microsoft Coco Dataset [Dataset]. https://universe.roboflow.com/microsoft/coco/model/3
Explore at:
zipAvailable download formats
Dataset updated
Jul 23, 2025
Dataset authored and provided by
Microsoft
Variables measured
Object Bounding Boxes
Description
Microsoft Common Objects in Context (COCO) Dataset

The Common Objects in Context (COCO) dataset is a widely recognized collection designed to spur object detection, segmentation, and captioning research. Created by Microsoft, COCO provides annotations, including object categories, keypoints, and more. The model it a valuable asset for machine learning practitioners and researchers. Today, many model architectures are benchmarked against COCO, which has enabled a standard system by which architectures can be compared.

While COCO is often touted to comprise over 300k images, it's pivotal to understand that this number includes diverse formats like keypoints, among others. Specifically, the labeled dataset for object detection stands at 123,272 images.

The full object detection labeled dataset is made available here, ensuring researchers have access to the most comprehensive data for their experiments. With that said, COCO has not released their test set annotations, meaning the test data doesn't come with labels. Thus, this data is not included in the dataset.

The Roboflow team has worked extensively with COCO. Here are a few links that may be helpful as you get started working with this dataset:

An introduction to the COCO dataset

Weird images in COCO, and what that tells us about the utility and limits of COCO
Microsoft COCO 2017 Object Detection Dataset - raw
public.roboflow.com
zip
Updated Feb 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Microsoft (2025). Microsoft COCO 2017 Object Detection Dataset - raw [Dataset]. https://public.roboflow.com/object-detection/microsoft-coco-subset/2
Explore at:
zipAvailable download formats
Dataset updated
Feb 1, 2025
Dataset authored and provided by
Microsofthttp://microsoft.com/
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
Bounding Boxes of coco-objects
Description
This is the full 2017 COCO object detection dataset (train and valid), which is a subset of the most recent 2020 COCO object detection dataset.

COCO is a large-scale object detection, segmentation, and captioning dataset of many object types easily recognizable by a 4-year-old. The data is initially collected and published by Microsoft. The original source of the data is here and the paper introducing the COCO dataset is here.
g
COCO Dataset 2017
gts.ai
json
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
GTS, COCO Dataset 2017 [Dataset]. https://gts.ai/dataset-download/coco-dataset-2017/
Explore at:
jsonAvailable download formats
Dataset provided by
GLOBOSE TECHNOLOGY SOLUTIONS PRIVATE LIMITED
Authors
GTS
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
The MS COCO (Microsoft Common Objects in Context) 2017 dataset is a large-scale benchmark for object detection, segmentation, key-point detection, and image captioning. It includes over 328K images with comprehensive annotations that drive advancements in computer vision research.
i
Microsoft COCO
ieee-dataport.org
Updated Jun 30, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Meng Shen (2025). Microsoft COCO [Dataset]. https://ieee-dataport.org/documents/microsoft-coco
Explore at:
Dataset updated
Jun 30, 2025
Authors
Meng Shen
Description
segmentation
t
T. Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P....
service.tib.eu
Updated Dec 17, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). T. Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Doll´ar, C. L. Zitnick (2024). Dataset: Microsoft COCO (MS-COCO) Dataset. https://doi.org/10.57702/7dh4cogk [Dataset]. https://service.tib.eu/ldmservice/dataset/microsoft-coco--ms-coco--dataset
Explore at:
Dataset updated
Dec 17, 2024
Description
A large-scale dataset for object detection and instance segmentation.
R
Microsoft Coco 2017 Dataset
universe.roboflow.com
zip
Updated Feb 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jacob Solawetz (2025). Microsoft Coco 2017 Dataset [Dataset]. https://universe.roboflow.com/jacob-solawetz/microsoft-coco/model/9
Explore at:
zipAvailable download formats
Dataset updated
Feb 1, 2025
Dataset authored and provided by
Jacob Solawetz
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
Coco Objects Bounding Boxes
Description
This is the full 2017 COCO object detection dataset (train and valid), which is a subset of the most recent 2020 COCO object detection dataset.

COCO is a large-scale object detection, segmentation, and captioning dataset of many object types easily recognizable by a 4-year-old. The data is initially collected and published by Microsoft. The original source of the data is here and the paper introducing the COCO dataset is here.
coco dataset
kaggle.com
zip
Updated Jul 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ProgramerSalar (2025). coco dataset [Dataset]. https://www.kaggle.com/datasets/salargamer/coco-dataset
Explore at:
zip(20043918455 bytes)Available download formats
Dataset updated
Jul 5, 2025
Authors
ProgramerSalar
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
The COCO dataset is a foundational large-scale benchmark for object detection, segmentation, captioning, and keypoint analysis. Created by Microsoft, it features complex everyday scenes with common objects in their natural contexts. With over 330,000 images and 2.5 million labeled instances, it has become the gold standard for training and evaluating computer vision models.

File Information

images/ Contains 2 subdirectories split by usage: train2017/: Main training set (118K images) val2017/: Validation set (5K images) File Naming: 000000000009.jpg (12-digit zero-padded IDs) Formats: JPEG images with varying resolutions (average 640×480) annotations/ Contains task-specific JSON files with consistent naming: captions_*.json: 5 human-generated descriptions per image
a
COCO
datasets.activeloop.ai
huggingface.co
deeplake
Updated Feb 5, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tsung-Yi Lin (2022). COCO [Dataset]. https://datasets.activeloop.ai/docs/ml/datasets/coco-dataset/
Explore at:
deeplakeAvailable download formats
Dataset updated
Feb 5, 2022
Authors
Tsung-Yi Lin
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Jan 1, 2014 - Dec 31, 2015
Dataset funded by
Microsoft Research
Description
The COCO dataset is a large dataset of labeled images and annotations. It is a popular dataset for machine learning and artificial intelligence research. The dataset consists of 330,000 images and 500,000 object annotations. The annotations include the bounding boxes of objects in the images, as well as the labels of the objects.
Microsoft COCO (Zhao et al 2017)
kaggle.com
zip
Updated Oct 21, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rachael Tatman (2019). Microsoft COCO (Zhao et al 2017) [Dataset]. https://www.kaggle.com/rtatman/ms-coco
Explore at:
zip(19282796 bytes)Available download formats
Dataset updated
Oct 21, 2019
Authors
Rachael Tatman
Description
Context

This dataset contains pickled Python objects with data from the annotations of the Microsoft (MS) COCO dataset. COCO is a large-scale object detection, segmentation, and captioning dataset.

Content

Except for the objs file, which is a plain text file continuing a list of objects, the data in this dataset is all in the pickle format, a way of storing Python objects at binary data files.

Important: These pickles were pickled using Python 2. Since Kernels use Python 3, you will need to specify the encoding when unpickling these files. The Python utility scripts here have been updated to correctly unpickle these files.

# the correct syntax to read these pickled files into Python 3 pickle.load(open('file_path, 'rb'), encoding = "latin1")

Acknowledgements

As a derivative of the original COCO dataset, this dataset is distributed under a CC-BY 4.0 license. These files were distributed as part of the supporting materials for Zhao et al 2017. If you use these files in your work, please cite the following paper:

Zhao, J., Wang, T., Yatskar, M., Ordonez, V., & Chang, K. W. (2017). Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (pp. 2979-2989).
Coco Dataset
universe.roboflow.com
zip
Updated Oct 3, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Microsoft (2024). Coco Dataset [Dataset]. https://universe.roboflow.com/microsoft/coco-dataset-vdnr1/model/3
Explore at:
zipAvailable download formats
Dataset updated
Oct 3, 2024
Dataset authored and provided by
Microsofthttp://microsoft.com/
Variables measured
Common Objects Polygons
Description
COCO Dataset

## Overview COCO Dataset is a dataset for instance segmentation tasks - it contains Common Objects annotations for 123,272 images. ## Getting Started You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
g
COCO 2014 Dataset (for YOLOv3)
gts.ai
json
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
GLOBOSE TECHNOLOGY SOLUTIONS PRIVATE LIMITED, COCO 2014 Dataset (for YOLOv3) [Dataset]. https://gts.ai/dataset-download/coco-2014-dataset-for-yolov3/
Explore at:
jsonAvailable download formats
Dataset authored and provided by
GLOBOSE TECHNOLOGY SOLUTIONS PRIVATE LIMITED
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
The MS COCO (Microsoft Common Objects in Context) 2014 dataset is a large-scale benchmark for object detection, segmentation, and key-point detection. It contains 164,000+ annotated images across 80 object categories.
t
Tsung-Yi Lin, Michael Maire, Serge J. Belongie, Lubomir D. Bourdev, Ross B....
service.tib.eu
Updated Dec 3, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Tsung-Yi Lin, Michael Maire, Serge J. Belongie, Lubomir D. Bourdev, Ross B. Girshick, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, C. Lawrence Zitnick (2024). Dataset: Microsoft COCO: common objects in context. https://doi.org/10.57702/lydv1ylk [Dataset]. https://service.tib.eu/ldmservice/dataset/microsoft-coco--common-objects-in-context
Explore at:
Dataset updated
Dec 3, 2024
Description
The COCO dataset is a large-scale dataset for object detection and image classification.
COCO2017 Monochrome Greyscale YOLO annotations
kaggle.com
zip
Updated Aug 12, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Alexander Y. (2024). COCO2017 Monochrome Greyscale YOLO annotations [Dataset]. https://www.kaggle.com/datasets/alexanderyyy/coco2017-monochrome-greyscale-yolo-annotations
Explore at:
zip(16332741878 bytes)Available download formats
Dataset updated
Aug 12, 2024
Authors
Alexander Y.
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
Purpose: experiments with YOLO models in monochrome.

The original COCO2017 dataset has been processed: - added YOLO annotations for 80 classes; - all images are converted to monochrome (greyscale) with an equalized histogram.

The number of images: - training: 118,287; - validation: 5,000.

Links to the original COCO 2017 dataset https://cocodataset.org by Microsoft: url_images = 'http://images.cocodataset.org/zips/' url_annotations = 'http://images.cocodataset.org/annotations/annotations_trainval2017.zip'
h
coco
huggingface.co
Updated Mar 3, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Detection datasets (2023). coco [Dataset]. https://huggingface.co/datasets/detection-datasets/coco
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 3, 2023
Dataset authored and provided by
Detection datasets
Description
detection-datasets/coco dataset hosted on Hugging Face and contributed by the HF Datasets community
Z
COCO, LVIS, Open Images V4 classes mapping
data.niaid.nih.gov
zenodo.org
+1more
Updated Oct 13, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Giuseppe Amato; Paolo Bolettieri; Fabio Carrara; Fabrizio Falchi; Claudio Gennaro; Nicola Messina; Lucia Vadicamo; Claudio Vairo (2022). COCO, LVIS, Open Images V4 classes mapping [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7194299
Explore at:
Dataset updated
Oct 13, 2022
Dataset provided by
ISTI-CNR
Authors
Giuseppe Amato; Paolo Bolettieri; Fabio Carrara; Fabrizio Falchi; Claudio Gennaro; Nicola Messina; Lucia Vadicamo; Claudio Vairo
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This repository contains a mapping between the classes of COCO, LVIS, and Open Images V4 datasets into a unique set of 1460 classes.

COCO [Lin et al 2014] contains 80 classes, LVIS [gupta2019lvis] contains 1460 classes, Open Images V4 [Kuznetsova et al. 2020] contains 601 classes.

We built a mapping of these classes using a semi-automatic procedure in order to have a unique final list of 1460 classes. We also generated a hierarchy for each class, using wordnet

This repository contains the following files:

coco_classes_map.txt, contains the mapping for the 80 coco classes

lvis_classes_map.txt, contains the mapping for the 1460 coco classes

openimages_classes_map.txt, contains the mapping for the 601 coco classes

classname_hyperset_definition.csv, contains the final set of 1460 classes, their definition and hierarchy

all-classnames.xlsx, contains a side-by-side view of all classes considered

This mapping was used in VISIONE [Amato et al. 2021, Amato et al. 2022] that is a content-based retrieval system that supports various search functionalities (text search, object/color-based search, semantic and visual similarity search, temporal search). For the object detection VISIONE uses three pre-trained models: VfNet Zhang et al. 2021, Mask R-CNN He et al. 2017, and a Faster R-CNN+Inception ResNet (trained on the Open Images V4).

This is repository is released under a Creative Commons Attribution license, please cite the following paper if you use it in your work in any form:

@inproceedings{amato2021visione, title={The visione video search system: exploiting off-the-shelf text search engines for large-scale video retrieval}, author={Amato, Giuseppe and Bolettieri, Paolo and Carrara, Fabio and Debole, Franca and Falchi, Fabrizio and Gennaro, Claudio and Vadicamo, Lucia and Vairo, Claudio}, journal={Journal of Imaging}, volume={7}, number={5}, pages={76}, year={2021}, publisher={Multidisciplinary Digital Publishing Institute} }

References:

[Amato et al. 2022] Amato, G. et al. (2022). VISIONE at Video Browser Showdown 2022. In: , et al. MultiMedia Modeling. MMM 2022. Lecture Notes in Computer Science, vol 13142. Springer, Cham. https://doi.org/10.1007/978-3-030-98355-0_52

[Amato et al. 2021] Amato, G., Bolettieri, P., Carrara, F., Debole, F., Falchi, F., Gennaro, C., Vadicamo, L. and Vairo, C., 2021. The visione video search system: exploiting off-the-shelf text search engines for large-scale video retrieval. Journal of Imaging, 7(5), p.76.

[Gupta et al.2019] Gupta, A., Dollar, P. and Girshick, R., 2019. Lvis: A dataset for large vocabulary instance segmentation. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 5356-5364).

[He et al. 2017] He, K., Gkioxari, G., Dollár, P. and Girshick, R., 2017. Mask r-cnn. In Proceedings of the IEEE international conference on computer vision (pp. 2961-2969).

[Kuznetsova et al. 2020] Kuznetsova, A., Rom, H., Alldrin, N., Uijlings, J., Krasin, I., Pont-Tuset, J., Kamali, S., Popov, S., Malloci, M., Kolesnikov, A. and Duerig, T., 2020. The open images dataset v4. International Journal of Computer Vision, 128(7), pp.1956-1981.

[Lin et al. 2014] Lin, T.Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Dollár, P. and Zitnick, C.L., 2014, September. Microsoft coco: Common objects in context. In European conference on computer vision (pp. 740-755). Springer, Cham.

[Zhang et al. 2021] Zhang, H., Wang, Y., Dayoub, F. and Sunderhauf, N., 2021. Varifocalnet: An iou-aware dense object detector. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 8514-8523).
O
COCO 2017
opendatalab.com
huggingface.co
zip
Updated Sep 30, 2017
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Microsoft (2017). COCO 2017 [Dataset]. https://opendatalab.com/OpenDataLab/COCO_2017
Explore at:
zip(49105147630 bytes)Available download formats
Dataset updated
Sep 30, 2017
Dataset provided by
Microsoft
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
COCO is a large-scale object detection, segmentation, and captioning dataset. COCO has several features: Object segmentation Recognition in context Superpixel stuff segmentation 330K images (>200K labeled) 1.5 million object instances 80 object categories 91 stuff categories 5 captions per image 250,000 people with keypoints
Microsoft Coco 2017 Dataset
universe.roboflow.com
zip
Updated Jan 4, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Roboflow Public (2022). Microsoft Coco 2017 Dataset [Dataset]. https://universe.roboflow.com/roboflow-public/microsoft-coco-2017-dataset
Explore at:
zipAvailable download formats
Dataset updated
Jan 4, 2022
Dataset provided by
Roboflowhttps://roboflow.com/
Authors
Roboflow Public
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
Coco Objects Bounding Boxes
Description
Microsoft COCO 2017 Dataset

## Overview Microsoft COCO 2017 Dataset is a dataset for object detection tasks - it contains Coco Objects annotations for 2,245 images. ## Getting Started You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model. ## License This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
COCO 2014 Val Subset
kaggle.com
zip
Updated Apr 21, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Masoud Ilkhani (2025). COCO 2014 Val Subset [Dataset]. https://www.kaggle.com/datasets/masoudilkhani/coco-2014-val-subset
Explore at:
zip(6858196257 bytes)Available download formats
Dataset updated
Apr 21, 2025
Authors
Masoud Ilkhani
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Source: This dataset is a subset of the MS COCO dataset, originally released by Microsoft under the CC BY 4.0 License. This subset was extracted for educational and research purposes.
COCO 2014 test
kaggle.com
zip
Updated Jul 23, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
akashdeepjassal (2019). COCO 2014 test [Dataset]. https://www.kaggle.com/akashdeepjassal/coco-2014-test
Explore at:
zip(0 bytes)Available download formats
Dataset updated
Jul 23, 2019
Authors
akashdeepjassal
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
What is COCO?

COCO is a large-scale object detection, segmentation, and captioning dataset. COCO has several features:

MS-COCO website
Microsoft Coco Pose Detection Dataset
universe.roboflow.com
zip
Updated Jul 31, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Microsoft (2025). Microsoft Coco Pose Detection Dataset [Dataset]. https://universe.roboflow.com/microsoft/coco-pose-detection/model/3
Explore at:
zipAvailable download formats
Dataset updated
Jul 31, 2025
Dataset authored and provided by
Microsofthttp://microsoft.com/
Variables measured
Objects
Description
Microsoft COCO Pose Detection

## Overview Microsoft COCO Pose Detection is a dataset for computer vision tasks - it contains Objects annotations for 5,105 images. ## Getting Started You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.

Facebook

Twitter

Click to copy link

Link copied

Cite

Microsoft (2025). Microsoft Coco Dataset [Dataset]. https://universe.roboflow.com/microsoft/coco/model/3

Microsoft Coco Dataset

coco

microsoft-coco-dataset

Explore at:

zipAvailable download formats

Dataset updated

Jul 23, 2025

Dataset authored and provided by

Microsoft

Variables measured

Object Bounding Boxes

Description

Microsoft Common Objects in Context (COCO) Dataset

The Common Objects in Context (COCO) dataset is a widely recognized collection designed to spur object detection, segmentation, and captioning research. Created by Microsoft, COCO provides annotations, including object categories, keypoints, and more. The model it a valuable asset for machine learning practitioners and researchers. Today, many model architectures are benchmarked against COCO, which has enabled a standard system by which architectures can be compared.

While COCO is often touted to comprise over 300k images, it's pivotal to understand that this number includes diverse formats like keypoints, among others. Specifically, the labeled dataset for object detection stands at 123,272 images.

The full object detection labeled dataset is made available here, ensuring researchers have access to the most comprehensive data for their experiments. With that said, COCO has not released their test set annotations, meaning the test data doesn't come with labels. Thus, this data is not included in the dataset.

The Roboflow team has worked extensively with COCO. Here are a few links that may be helpful as you get started working with this dataset:

Clear search

Close search

Google apps

Main menu

Microsoft Coco Dataset

Microsoft Common Objects in Context (COCO) Dataset

Microsoft COCO 2017 Object Detection Dataset - raw

COCO Dataset 2017

Microsoft COCO

T. Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P....

Microsoft Coco 2017 Dataset

coco dataset

File Information

COCO

Microsoft COCO (Zhao et al 2017)

Context

Content

Acknowledgements

Coco Dataset

COCO Dataset

COCO 2014 Dataset (for YOLOv3)

Tsung-Yi Lin, Michael Maire, Serge J. Belongie, Lubomir D. Bourdev, Ross B....

COCO2017 Monochrome Greyscale YOLO annotations

coco

COCO, LVIS, Open Images V4 classes mapping

COCO 2017

Microsoft Coco 2017 Dataset

Microsoft COCO 2017 Dataset

COCO 2014 Val Subset

COCO 2014 test

What is COCO?

Microsoft Coco Pose Detection Dataset

Microsoft COCO Pose Detection

Microsoft Coco Dataset

coco

microsoft-coco-dataset

Microsoft Common Objects in Context (COCO) Dataset