100+ datasets found
  1. Real Time Anomaly Detection in CCTV Surveillance

    • kaggle.com
    Updated Apr 28, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    webadvisor (2023). Real Time Anomaly Detection in CCTV Surveillance [Dataset]. https://www.kaggle.com/datasets/webadvisor/real-time-anomaly-detection-in-cctv-surveillance
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 28, 2023
    Dataset provided by
    Kaggle
    Authors
    webadvisor
    Description

    UCF Crime Dataset in the most suitable structure. Contains 1900 videos from 13 different categories. To ensure the quality of this dataset, it is trained ten annotators (having different levels of computer vision expertise) to collect the dataset. Using videos search on YouTube and LiveLeak using text search queries (with slight variations e.g. “car crash”, “road accident”) of each anomaly.

  2. tiny-video-dataset

    • huggingface.co
    Updated Sep 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hugging Face Internal Testing Organization (2024). tiny-video-dataset [Dataset]. https://huggingface.co/datasets/hf-internal-testing/tiny-video-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 17, 2024
    Dataset provided by
    Hugging Facehttps://huggingface.co/
    Authors
    Hugging Face Internal Testing Organization
    Description

    hf-internal-testing/tiny-video-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community

  3. h

    PE-Video

    • huggingface.co
    Updated Apr 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AI at Meta (2025). PE-Video [Dataset]. https://huggingface.co/datasets/facebook/PE-Video
    Explore at:
    Dataset updated
    Apr 17, 2025
    Dataset authored and provided by
    AI at Meta
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    PE Video Dataset (PVD)

    [📃 Tech Report] [📂 Github] The PE Video Dataset (PVD) is a large-scale collection of 1 million diverse videos, featuring 120,000+ expertly annotated clips. The dataset was introduced in our paper "Perception Encoder".

      Overview
    

    PE Video Dataset (PVD) comprises 1M high quality and diverse videos. Among them, 120K videos are accompanied by automated and human-verified annotations. and all videos are accompanied with video description and keywords.… See the full description on the dataset page: https://huggingface.co/datasets/facebook/PE-Video.

  4. h

    video-dataset

    • huggingface.co
    Updated May 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ProgramerSalar (2025). video-dataset [Dataset]. https://huggingface.co/datasets/ProgramerSalar/video-dataset
    Explore at:
    Dataset updated
    May 11, 2025
    Authors
    ProgramerSalar
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Video Dataset on Hugging Face

    This repository hosts the video dataset, a widely used benchmark dataset for human action recognition in videos. The dataset has been processed and uploaded to the Hugging Face Hub for easy access, sharing, and integration into machine learning workflows.

      Introduction
    

    The dataset is a large-scale video dataset designed for action recognition tasks. It contains 13,320 video clips across 101 action categories, making it one of the most… See the full description on the dataset page: https://huggingface.co/datasets/ProgramerSalar/video-dataset.

  5. P

    Plenoptic Video Dataset Dataset

    • paperswithcode.com
    Updated Dec 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tianye Li; Mira Slavcheva; Michael Zollhoefer; Simon Green; Christoph Lassner; Changil Kim; Tanner Schmidt; Steven Lovegrove; Michael Goesele; Richard Newcombe; Zhaoyang Lv, Plenoptic Video Dataset Dataset [Dataset]. https://paperswithcode.com/dataset/plenoptic-video-dataset
    Explore at:
    Dataset updated
    Dec 29, 2024
    Authors
    Tianye Li; Mira Slavcheva; Michael Zollhoefer; Simon Green; Christoph Lassner; Changil Kim; Tanner Schmidt; Steven Lovegrove; Michael Goesele; Richard Newcombe; Zhaoyang Lv
    Description

    3D video data asset of CVPR 2022 Paper "Neural 3D Video Synthesis"

  6. P

    Long Video Dataset Dataset

    • paperswithcode.com
    Updated Nov 18, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yongqing Liang; Xin Li; Navid Jafari; Qin Chen (2020). Long Video Dataset Dataset [Dataset]. https://paperswithcode.com/dataset/long-video-dataset
    Explore at:
    Dataset updated
    Nov 18, 2020
    Authors
    Yongqing Liang; Xin Li; Navid Jafari; Qin Chen
    Description

    We randomly selected three videos from the Internet, that are longer than 1.5K frames and have their main objects continuously appearing. Each video has 20 uniformly sampled frames manually annotated for evaluation.

  7. P

    YCB-Video Dataset

    • paperswithcode.com
    • library.toponeai.link
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yu Xiang; Tanner Schmidt; Venkatraman Narayanan; Dieter Fox, YCB-Video Dataset [Dataset]. https://paperswithcode.com/dataset/ycb-video
    Explore at:
    Authors
    Yu Xiang; Tanner Schmidt; Venkatraman Narayanan; Dieter Fox
    Description

    The YCB-Video dataset is a large-scale video dataset for 6D object pose estimation. provides accurate 6D poses of 21 objects from the YCB dataset observed in 92 videos with 133,827 frames.

  8. R

    Dataset Video Dataset

    • universe.roboflow.com
    zip
    Updated Jul 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Learn Yolo v8 (2024). Dataset Video Dataset [Dataset]. https://universe.roboflow.com/learn-yolo-v8/dataset-video
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jul 25, 2024
    Dataset authored and provided by
    Learn Yolo v8
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Senang Murung Bingung Normal Bounding Boxes
    Description

    Dataset Video

    ## Overview
    
    Dataset Video is a dataset for object detection tasks - it contains Senang Murung Bingung Normal annotations for 226 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  9. s

    Image & Video Datasets

    • sapien.io
    Updated Feb 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sapien (2025). Image & Video Datasets [Dataset]. https://www.sapien.io/dataset-marketplace/image-video-datasets-for-ai-applications
    Explore at:
    Dataset updated
    Feb 11, 2025
    Dataset authored and provided by
    Sapien
    License

    https://www.sapien.io/termshttps://www.sapien.io/terms

    Description

    High-quality image and video datasets for AI training in computer vision applications, including object recognition, scene understanding, and more.

  10. P

    i3-video Dataset

    • paperswithcode.com
    Updated Apr 28, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jack Hessel; Zhenhai Zhu; Bo Pang; Radu Soricut (2020). i3-video Dataset [Dataset]. https://paperswithcode.com/dataset/i3-video
    Explore at:
    Dataset updated
    Apr 28, 2020
    Authors
    Jack Hessel; Zhenhai Zhu; Bo Pang; Radu Soricut
    Description

    The i3-video dataset contains "is-it-instructional" annotations for 6.4k videos from Youtube-8M. The videos are considered to be instructional if they focus on real-world human actions accompanied by procedural language that explains what’s happening on screen in reasonable details.

  11. E

    ProciGen-video dataset for "InterTrack: Tracking Human Object Interaction...

    • edmond.mpg.de
    tar, zip
    Updated Mar 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xianghui Xie; Xianghui Xie (2025). ProciGen-video dataset for "InterTrack: Tracking Human Object Interaction without Object Templates" (3DV'25) [Dataset]. http://doi.org/10.17617/3.B6BM5R
    Explore at:
    zip(23164925414), zip(90311075518), zip(18509263726), zip(42254982775), zip(7463933343), zip(14903265605), zip(29849772469), zip(7638586699), zip(69254618545), zip(3313569089), zip(642625962), zip(47439677402), zip(52010009771), zip(92916969277), tar(1190041600), zip(22367831094), zip(34158105311), zip(23334561347)Available download formats
    Dataset updated
    Mar 22, 2025
    Dataset provided by
    Edmond
    Authors
    Xianghui Xie; Xianghui Xie
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A large scale synthetic dataset about dynamic human-object interactions. It features about 10 hours of video with 8337 sequences and 2M images. The generation of this dataset is described in the paper "InterTrack: Tracking Human Object Interaction without Object Templates" (3DV'25). Please check the github repo for detailed file structure of the dataset: https://github.com/xiexh20/ProciGen If you use our data, please cite: @inproceedings{xie2024InterTrack, title = {InterTrack: Tracking Human Object Interaction without Object Templates}, author = {Xie, Xianghui and Lenssen, Jan Eric and Pons-Moll, Gerard}, booktitle = {International Conference on 3D Vision (3DV)}, month = {March}, year = {2025}, }

  12. i

    Sintel 4D Light Field Video Dataset

    • ieee-dataport.org
    Updated Mar 26, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Takahiro Kinoshita (2021). Sintel 4D Light Field Video Dataset [Dataset]. https://ieee-dataport.org/open-access/sintel-4d-light-field-video-dataset
    Explore at:
    Dataset updated
    Mar 26, 2021
    Authors
    Takahiro Kinoshita
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    9x9 views

  13. Human Activity Recognition (UCF50): Video Dataset

    • kaggle.com
    Updated Jan 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    VK (2024). Human Activity Recognition (UCF50): Video Dataset [Dataset]. https://www.kaggle.com/datasets/venkatkumar001/human-activity-recognition-ucf50-video-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 26, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    VK
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    It is used to develop the human activity recognition, classification

  14. P

    Kinetics Dataset

    • paperswithcode.com
    Updated Apr 21, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Will Kay; Joao Carreira; Karen Simonyan; Brian Zhang; Chloe Hillier; Sudheendra Vijayanarasimhan; Fabio Viola; Tim Green; Trevor Back; Paul Natsev; Mustafa Suleyman; Andrew Zisserman (2021). Kinetics Dataset [Dataset]. https://paperswithcode.com/dataset/kinetics
    Explore at:
    Dataset updated
    Apr 21, 2021
    Authors
    Will Kay; Joao Carreira; Karen Simonyan; Brian Zhang; Chloe Hillier; Sudheendra Vijayanarasimhan; Fabio Viola; Tim Green; Trevor Back; Paul Natsev; Mustafa Suleyman; Andrew Zisserman
    Description

    The Kinetics dataset is a large-scale, high-quality dataset for human action recognition in videos. The dataset consists of around 500,000 video clips covering 600 human action classes with at least 600 video clips for each action class. Each video clip lasts around 10 seconds and is labeled with a single action class. The videos are collected from YouTube.

  15. S

    D²-City: A Large-Scale Dashcam Video Dataset of Diverse Traffic Scenarios

    • scidb.cn
    Updated Feb 4, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zhengping Che; Bo Jiang; Yiping Meng; Guangyu Li; Tracy Li; Ke Dong; Xinsheng Zhang; Xuefeng Shi; Ying Lyu; Guobin Wu; Yan Liu; Jian Tang; Jieping Ye (2021). D²-City: A Large-Scale Dashcam Video Dataset of Diverse Traffic Scenarios [Dataset]. http://doi.org/10.11922/sciencedb.00603
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 4, 2021
    Dataset provided by
    Science Data Bank
    Authors
    Zhengping Che; Bo Jiang; Yiping Meng; Guangyu Li; Tracy Li; Ke Dong; Xinsheng Zhang; Xuefeng Shi; Ying Lyu; Guobin Wu; Yan Liu; Jian Tang; Jieping Ye
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    D²-City is a large-scale driving video dataset that provides more than 10,000 dashcam videos recorded in 720p HD or 1080p FHD. Around 1000 of the videos come with detection and tracking annotation in each frame of all road objects, including bounding boxes and the tracking IDs of cars, vans, buses, trucks, pedestrians, motorcycles, bicycles, open- and closed-tricycles, forklifts, and large- and small-blocks. Some of the remainders of the videos come with road objects annotated in keyframes. Compared with existing datasets, D²-City benefits from its huge amount of diversity as data is collected from several cities throughout China and features varying weather, road, and traffic conditions. D²-City pays special attention to challenges in complex and various traffic scenarios. By bring more challenging cases to the community, we hope that this dataset will encourage and help new advances in the perception area of intelligent driving. The D²-City dataset and the corresponding challenges are originally hosted on DiDi GAIA's platform (URL: https://outreach.didichuxing.com/d2city/d2city)

  16. Deep Video Understanding Annotations Dataset

    • data.nist.gov
    • catalog.data.gov
    Updated May 25, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institute of Standards and Technology (2021). Deep Video Understanding Annotations Dataset [Dataset]. http://doi.org/10.18434/mds2-2535
    Explore at:
    Dataset updated
    May 25, 2021
    Dataset provided by
    National Institute of Standards and Technologyhttp://www.nist.gov/
    License

    https://www.nist.gov/open/licensehttps://www.nist.gov/open/license

    Description

    The BBC Land Girls TV series is a 3 season series. Each season is 5 episodes of about 45mins each. The TRECVID group at NIST worked with the BBC Corp. to release the dataset to the research community to work on video understanding tasks. Unfortunately, the hosting arrangement for the dataset was not successful and the release of the video dataset couldn't be done. We are releasing the annotations conducted by NIST, without any video data, so that the researchers interested in working on knowledge graph understanding and natural language analysis can take advantage of them.

  17. P

    Gen-Video Dataset

    • paperswithcode.com
    Updated Jun 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Haoxing Chen; Yan Hong; Zizheng Huang; Zhuoer Xu; Zhangxuan Gu; Yaohui Li; Jun Lan; Huijia Zhu; Jianfu Zhang; Weiqiang Wang; Huaxiong Li (2025). Gen-Video Dataset [Dataset]. https://paperswithcode.com/dataset/gen-video
    Explore at:
    Dataset updated
    Jun 19, 2025
    Authors
    Haoxing Chen; Yan Hong; Zizheng Huang; Zhuoer Xu; Zhangxuan Gu; Yaohui Li; Jun Lan; Huijia Zhu; Jianfu Zhang; Weiqiang Wang; Huaxiong Li
    Description

    The first AI-generated video detection datasets.

  18. i

    110K Sensitive Video Dataset

    • ieee-dataport.org
    Updated Feb 3, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pedro Almeida de Freitas (2022). 110K Sensitive Video Dataset [Dataset]. https://ieee-dataport.org/documents/110k-sensitive-video-dataset
    Explore at:
    Dataset updated
    Feb 3, 2022
    Authors
    Pedro Almeida de Freitas
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    ATTENTION: THIS DATASET DOES NOT HOST ANY SOURCE VIDEOS. WE PROVIDE ONLY HIDDEN FEATURES GENERATED BY PRE-TRAINED DEEP MODELS AS DATA

  19. l

    Talking Head Video Dataset - 23k identities

    • lipsynthesis.com
    csv
    Updated Mar 31, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    LipSynthesis (2025). Talking Head Video Dataset - 23k identities [Dataset]. https://lipsynthesis.com/dataset
    Explore at:
    csvAvailable download formats
    Dataset updated
    Mar 31, 2025
    Dataset authored and provided by
    LipSynthesis
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    We provide a comprehensive talking-head video dataset with over 50,000 videos, totaling more than 600 hours of footage and featuring 20,841 unique identities from around the world.

  20. R

    20 Video Dataset

    • universe.roboflow.com
    zip
    Updated Jun 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TENNIS4 (2025). 20 Video Dataset [Dataset]. https://universe.roboflow.com/tennis4/20-video
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 2, 2025
    Dataset authored and provided by
    TENNIS4
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Bcdj Bounding Boxes
    Description

    20 Video

    ## Overview
    
    20 Video is a dataset for object detection tasks - it contains Bcdj annotations for 760 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
webadvisor (2023). Real Time Anomaly Detection in CCTV Surveillance [Dataset]. https://www.kaggle.com/datasets/webadvisor/real-time-anomaly-detection-in-cctv-surveillance
Organization logo

Real Time Anomaly Detection in CCTV Surveillance

Contains Videos for 13 different Class of Anomalies and Normal Events.

Explore at:
2 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 28, 2023
Dataset provided by
Kaggle
Authors
webadvisor
Description

UCF Crime Dataset in the most suitable structure. Contains 1900 videos from 13 different categories. To ensure the quality of this dataset, it is trained ten annotators (having different levels of computer vision expertise) to collect the dataset. Using videos search on YouTube and LiveLeak using text search queries (with slight variations e.g. “car crash”, “road accident”) of each anomaly.

Search
Clear search
Close search
Google apps
Main menu