7 datasets found
  1. P

    Objaverse Dataset

    • paperswithcode.com
    Updated Jan 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Matt Deitke; Dustin Schwenk; Jordi Salvador; Luca Weihs; Oscar Michel; Eli VanderBilt; Ludwig Schmidt; Kiana Ehsani; Aniruddha Kembhavi; Ali Farhadi (2025). Objaverse Dataset [Dataset]. https://paperswithcode.com/dataset/objaverse
    Explore at:
    Dataset updated
    Jan 20, 2025
    Authors
    Matt Deitke; Dustin Schwenk; Jordi Salvador; Luca Weihs; Oscar Michel; Eli VanderBilt; Ludwig Schmidt; Kiana Ehsani; Aniruddha Kembhavi; Ali Farhadi
    Description

    Objaverse is a large dataset of objects with 800K+ (and growing) 3D models with descriptive captions, tags, and animations. Objaverse improves upon present day 3D repositories in terms of scale, number of categories, and in the visual diversity of instances within a category.

  2. P

    BigDetection Dataset

    • paperswithcode.com
    Updated May 8, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Likun Cai; Zhi Zhang; Yi Zhu; Li Zhang; Mu Li; xiangyang xue (2022). BigDetection Dataset [Dataset]. https://paperswithcode.com/dataset/bigdetection
    Explore at:
    Dataset updated
    May 8, 2022
    Authors
    Likun Cai; Zhi Zhang; Yi Zhu; Li Zhang; Mu Li; xiangyang xue
    Description

    BigDetection is a new large-scale benchmark to build more general and powerful object detection systems. It leverages the training data from existing datasets (LVIS, OpenImages and Object365) with carefully designed principles, and curate a larger dataset for improved detector pre-training. BigDetection dataset has 600 object categories and contains 3.4M training images with 36M object bounding boxes.

  3. P

    PACO Dataset

    • paperswithcode.com
    Updated Jan 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vignesh Ramanathan; Anmol Kalia; Vladan Petrovic; Yi Wen; Baixue Zheng; Baishan Guo; Rui Wang; Aaron Marquez; Rama Kovvuri; Abhishek Kadian; Amir Mousavi; Yiwen Song; Abhimanyu Dubey; Dhruv Mahajan (2023). PACO Dataset [Dataset]. https://paperswithcode.com/dataset/paco
    Explore at:
    Dataset updated
    Jan 3, 2023
    Authors
    Vignesh Ramanathan; Anmol Kalia; Vladan Petrovic; Yi Wen; Baixue Zheng; Baishan Guo; Rui Wang; Aaron Marquez; Rama Kovvuri; Abhishek Kadian; Amir Mousavi; Yiwen Song; Abhimanyu Dubey; Dhruv Mahajan
    Description

    Parts and Attributes of Common Objects (PACO) is a detection dataset that goes beyond traditional object boxes and masks and provides richer annotations such as part masks and attributes. It spans 75 object categories, 456 object-part categories and 55 attributes across image (LVIS) and video (Ego4D) datasets. The dataset contains 641K part masks annotated across 260K object boxes, with half of them exhaustively annotated with attributes as well.

  4. P

    BURST Dataset

    • paperswithcode.com
    Updated Dec 13, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ali Athar; Jonathon Luiten; Paul Voigtlaender; Tarasha Khurana; Achal Dave; Bastian Leibe; Deva Ramanan (2023). BURST Dataset [Dataset]. https://paperswithcode.com/dataset/burst
    Explore at:
    Dataset updated
    Dec 13, 2023
    Authors
    Ali Athar; Jonathon Luiten; Paul Voigtlaender; Tarasha Khurana; Achal Dave; Bastian Leibe; Deva Ramanan
    Description

    BURST is a benchmark suite built upon TAO that requires tracking and segmenting multiple objects from camera video. The benchmark contains 6 different sub-tasks divided into 2 groups that all share the same data for training/validation/testing.

    Class-guided

    Common: Track and segment all objects belonging to a set of 78 common classes (based on the COCO class set) Long-tail: Track and segment all objects belonging to an extended set of 482 object classes (based on the LVIS class set) Open-world: Methods are only allowed to use the annotations of the 78 common classes during training, but during inference they are expected to track and segment all 482 object classes (class label predictions are not required)

    Exemplar-guided

    Mask: Track and segment all objects in the video for which the first-frame object masks are given. This task is identical to Video Object Segmentation (VOS). Box: Track and segment all objects in the video for which the first-frame object bounding-boxes are given. Point: Track and segment all objects in the video for which we are only given the (x,y) point coordinates of the mask centroid in the first-frame in which the objects appear.

    An illustration of the task hierarchy is given here and a detailed explanation is given in Sec. 5 of the dataset paper

  5. P

    CARER Dataset

    • paperswithcode.com
    Updated May 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Elvis Saravia; Hsien-Chi Toby Liu; Yen-Hao Huang; Junlin Wu; Yi-Shin Chen (2023). CARER Dataset [Dataset]. https://paperswithcode.com/dataset/emotion
    Explore at:
    Dataset updated
    May 16, 2023
    Authors
    Elvis Saravia; Hsien-Chi Toby Liu; Yen-Hao Huang; Junlin Wu; Yi-Shin Chen
    Description

    CARER is an emotion dataset collected through noisy labels, annotated via distant supervision as in (Go et al., 2009).

    The subset of data provided here corresponds to the six emotions variant described in the paper. The six emotions are anger, fear, joy, love, sadness, and surprise.

  6. P

    Argoverse-HD Dataset

    • paperswithcode.com
    Updated Mar 29, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mengtian Li; Yu-Xiong Wang; Deva Ramanan (2023). Argoverse-HD Dataset [Dataset]. https://paperswithcode.com/dataset/argoverse-hd
    Explore at:
    Dataset updated
    Mar 29, 2023
    Authors
    Mengtian Li; Yu-Xiong Wang; Deva Ramanan
    Description

    Argoverse-HD is a dataset built for streaming object detection, which encompasses real-time object detection, video object detection, tracking, and short-term forecasting. It contains the video data from Argoverse 1.1 with our own MS COCO-style bounding box annotations with track IDs. The annotations are backward-compatible with COCO as one can directly evaluate COCO pre-trained models on this dataset to estimate the efficiency or the cross-dataset generalization capability of the models. The dataset contains high-quality and temporally-dense annotations for high-resolution videos (1920 x 1200 @ 30 FPS). Overall, there are 70,000 image frames and 1.3 million bounding boxes.

    Argoverse-HD is the dataset used in the Streaming Perception Challenge, which includes two tracks:

    Detection-only (real-time object detection). In this track, the participants will develop single-frame object detectors as they would for COCO and LVIS challenges. The crucial distinction is that the evaluation will score latency through streaming accuracy. Full-stack. In this track, the method is unrestricted. However, most likely tracking and forecasting will be used to compensate for the latency of the detectors.

    By default, all submissions measure their latency on a V100 GPU with the official toolkit.

  7. P

    OmniObject3D Dataset

    • paperswithcode.com
    Updated Jan 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tong Wu; Jiarui Zhang; Xiao Fu; Yuxin Wang; Jiawei Ren; Liang Pan; Wayne Wu; Lei Yang; Jiaqi Wang; Chen Qian; Dahua Lin; Ziwei Liu (2025). OmniObject3D Dataset [Dataset]. https://paperswithcode.com/dataset/omniobject3d
    Explore at:
    Dataset updated
    Jan 12, 2025
    Authors
    Tong Wu; Jiarui Zhang; Xiao Fu; Yuxin Wang; Jiawei Ren; Liang Pan; Wayne Wu; Lei Yang; Jiaqi Wang; Chen Qian; Dahua Lin; Ziwei Liu
    Description

    OmniObject3D is a large vocabulary 3D object dataset with massive high-quality real-scanned 3D objects. OmniObject3D has several appealing properties:

    1) Large Vocabulary: It comprises 6,000 scanned objects in 190 daily categories, sharing common classes with popular 2D datasets (e.g., ImageNet and LVIS), benefiting the pursuit of generalizable 3D representations.

    2) Rich Annotations: Each 3D object is captured with both 2D and 3D sensors, providing textured meshes, point clouds, multiview rendered images, and multiple real-captured videos.

    3) Realistic Scans: The professional scanners support highquality object scans with precise shapes and realistic appearances.

  8. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Matt Deitke; Dustin Schwenk; Jordi Salvador; Luca Weihs; Oscar Michel; Eli VanderBilt; Ludwig Schmidt; Kiana Ehsani; Aniruddha Kembhavi; Ali Farhadi (2025). Objaverse Dataset [Dataset]. https://paperswithcode.com/dataset/objaverse

Objaverse Dataset

Explore at:
Dataset updated
Jan 20, 2025
Authors
Matt Deitke; Dustin Schwenk; Jordi Salvador; Luca Weihs; Oscar Michel; Eli VanderBilt; Ludwig Schmidt; Kiana Ehsani; Aniruddha Kembhavi; Ali Farhadi
Description

Objaverse is a large dataset of objects with 800K+ (and growing) 3D models with descriptive captions, tags, and animations. Objaverse improves upon present day 3D repositories in terms of scale, number of categories, and in the visual diversity of instances within a category.

Search
Clear search
Close search
Google apps
Main menu