100+ datasets found

Real Time Anomaly Detection in CCTV Surveillance
kaggle.com
Updated Apr 28, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
webadvisor (2023). Real Time Anomaly Detection in CCTV Surveillance [Dataset]. https://www.kaggle.com/datasets/webadvisor/real-time-anomaly-detection-in-cctv-surveillance
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 28, 2023
Dataset provided by
Kaggle
Authors
webadvisor
Description
UCF Crime Dataset in the most suitable structure. Contains 1900 videos from 13 different categories. To ensure the quality of this dataset, it is trained ten annotators (having different levels of computer vision expertise) to collect the dataset. Using videos search on YouTube and LiveLeak using text search queries (with slight variations e.g. “car crash”, “road accident”) of each anomaly.
tiny-video-dataset
huggingface.co
Updated Sep 17, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hugging Face Internal Testing Organization (2024). tiny-video-dataset [Dataset]. https://huggingface.co/datasets/hf-internal-testing/tiny-video-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 17, 2024
Dataset provided by
Hugging Facehttps://huggingface.co/
Authors
Hugging Face Internal Testing Organization
Description
hf-internal-testing/tiny-video-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community
h
PE-Video
huggingface.co
Updated Apr 17, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
AI at Meta (2025). PE-Video [Dataset]. https://huggingface.co/datasets/facebook/PE-Video
Explore at:
Dataset updated
Apr 17, 2025
Dataset authored and provided by
AI at Meta
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
PE Video Dataset (PVD)

[📃 Tech Report] [📂 Github] The PE Video Dataset (PVD) is a large-scale collection of 1 million diverse videos, featuring 120,000+ expertly annotated clips. The dataset was introduced in our paper "Perception Encoder".

Overview

PE Video Dataset (PVD) comprises 1M high quality and diverse videos. Among them, 120K videos are accompanied by automated and human-verified annotations. and all videos are accompanied with video description and keywords.… See the full description on the dataset page: https://huggingface.co/datasets/facebook/PE-Video.
h
video-dataset
huggingface.co
Updated May 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ProgramerSalar (2025). video-dataset [Dataset]. https://huggingface.co/datasets/ProgramerSalar/video-dataset
Explore at:
Dataset updated
May 11, 2025
Authors
ProgramerSalar
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Video Dataset on Hugging Face

This repository hosts the video dataset, a widely used benchmark dataset for human action recognition in videos. The dataset has been processed and uploaded to the Hugging Face Hub for easy access, sharing, and integration into machine learning workflows.

Introduction

The dataset is a large-scale video dataset designed for action recognition tasks. It contains 13,320 video clips across 101 action categories, making it one of the most… See the full description on the dataset page: https://huggingface.co/datasets/ProgramerSalar/video-dataset.
P
Plenoptic Video Dataset Dataset
paperswithcode.com
Updated Dec 29, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tianye Li; Mira Slavcheva; Michael Zollhoefer; Simon Green; Christoph Lassner; Changil Kim; Tanner Schmidt; Steven Lovegrove; Michael Goesele; Richard Newcombe; Zhaoyang Lv, Plenoptic Video Dataset Dataset [Dataset]. https://paperswithcode.com/dataset/plenoptic-video-dataset
Explore at:
Dataset updated
Dec 29, 2024
Authors
Tianye Li; Mira Slavcheva; Michael Zollhoefer; Simon Green; Christoph Lassner; Changil Kim; Tanner Schmidt; Steven Lovegrove; Michael Goesele; Richard Newcombe; Zhaoyang Lv
Description
3D video data asset of CVPR 2022 Paper "Neural 3D Video Synthesis"
P
Long Video Dataset Dataset
paperswithcode.com
Updated Nov 18, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yongqing Liang; Xin Li; Navid Jafari; Qin Chen (2020). Long Video Dataset Dataset [Dataset]. https://paperswithcode.com/dataset/long-video-dataset
Explore at:
Dataset updated
Nov 18, 2020
Authors
Yongqing Liang; Xin Li; Navid Jafari; Qin Chen
Description
We randomly selected three videos from the Internet, that are longer than 1.5K frames and have their main objects continuously appearing. Each video has 20 uniformly sampled frames manually annotated for evaluation.
P
YCB-Video Dataset
paperswithcode.com
library.toponeai.link
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yu Xiang; Tanner Schmidt; Venkatraman Narayanan; Dieter Fox, YCB-Video Dataset [Dataset]. https://paperswithcode.com/dataset/ycb-video
Explore at:
Authors
Yu Xiang; Tanner Schmidt; Venkatraman Narayanan; Dieter Fox
Description
The YCB-Video dataset is a large-scale video dataset for 6D object pose estimation. provides accurate 6D poses of 21 objects from the YCB dataset observed in 92 videos with 133,827 frames.
R
Dataset Video Dataset
universe.roboflow.com
zip
Updated Jul 25, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Learn Yolo v8 (2024). Dataset Video Dataset [Dataset]. https://universe.roboflow.com/learn-yolo-v8/dataset-video
Explore at:
zipAvailable download formats
Dataset updated
Jul 25, 2024
Dataset authored and provided by
Learn Yolo v8
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
Senang Murung Bingung Normal Bounding Boxes
Description
Dataset Video

## Overview Dataset Video is a dataset for object detection tasks - it contains Senang Murung Bingung Normal annotations for 226 images. ## Getting Started You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model. ## License This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
s
Image & Video Datasets
sapien.io
Updated Feb 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sapien (2025). Image & Video Datasets [Dataset]. https://www.sapien.io/dataset-marketplace/image-video-datasets-for-ai-applications
Explore at:
Dataset updated
Feb 11, 2025
Dataset authored and provided by
Sapien
License
https://www.sapien.io/termshttps://www.sapien.io/terms
Description
High-quality image and video datasets for AI training in computer vision applications, including object recognition, scene understanding, and more.
P
i3-video Dataset
paperswithcode.com
Updated Apr 28, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jack Hessel; Zhenhai Zhu; Bo Pang; Radu Soricut (2020). i3-video Dataset [Dataset]. https://paperswithcode.com/dataset/i3-video
Explore at:
Dataset updated
Apr 28, 2020
Authors
Jack Hessel; Zhenhai Zhu; Bo Pang; Radu Soricut
Description
The i3-video dataset contains "is-it-instructional" annotations for 6.4k videos from Youtube-8M. The videos are considered to be instructional if they focus on real-world human actions accompanied by procedural language that explains what’s happening on screen in reasonable details.
E
ProciGen-video dataset for "InterTrack: Tracking Human Object Interaction...
edmond.mpg.de
tar, zip
Updated Mar 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Xianghui Xie; Xianghui Xie (2025). ProciGen-video dataset for "InterTrack: Tracking Human Object Interaction without Object Templates" (3DV'25) [Dataset]. http://doi.org/10.17617/3.B6BM5R
Explore at:
zip(23164925414), zip(90311075518), zip(18509263726), zip(42254982775), zip(7463933343), zip(14903265605), zip(29849772469), zip(7638586699), zip(69254618545), zip(3313569089), zip(642625962), zip(47439677402), zip(52010009771), zip(92916969277), tar(1190041600), zip(22367831094), zip(34158105311), zip(23334561347)Available download formats
Unique identifier
https://doi.org/10.17617/3.B6BM5R
Dataset updated
Mar 22, 2025
Dataset provided by
Edmond
Authors
Xianghui Xie; Xianghui Xie
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
A large scale synthetic dataset about dynamic human-object interactions. It features about 10 hours of video with 8337 sequences and 2M images. The generation of this dataset is described in the paper "InterTrack: Tracking Human Object Interaction without Object Templates" (3DV'25). Please check the github repo for detailed file structure of the dataset: https://github.com/xiexh20/ProciGen If you use our data, please cite: @inproceedings{xie2024InterTrack, title = {InterTrack: Tracking Human Object Interaction without Object Templates}, author = {Xie, Xianghui and Lenssen, Jan Eric and Pons-Moll, Gerard}, booktitle = {International Conference on 3D Vision (3DV)}, month = {March}, year = {2025}, }
i
Sintel 4D Light Field Video Dataset
ieee-dataport.org
Updated Mar 26, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Takahiro Kinoshita (2021). Sintel 4D Light Field Video Dataset [Dataset]. https://ieee-dataport.org/open-access/sintel-4d-light-field-video-dataset
Explore at:
Dataset updated
Mar 26, 2021
Authors
Takahiro Kinoshita
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
9x9 views
Human Activity Recognition (UCF50): Video Dataset
kaggle.com
Updated Jan 26, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
VK (2024). Human Activity Recognition (UCF50): Video Dataset [Dataset]. https://www.kaggle.com/datasets/venkatkumar001/human-activity-recognition-ucf50-video-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 26, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
VK
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
It is used to develop the human activity recognition, classification
P
Kinetics Dataset
paperswithcode.com
Updated Apr 21, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Will Kay; Joao Carreira; Karen Simonyan; Brian Zhang; Chloe Hillier; Sudheendra Vijayanarasimhan; Fabio Viola; Tim Green; Trevor Back; Paul Natsev; Mustafa Suleyman; Andrew Zisserman (2021). Kinetics Dataset [Dataset]. https://paperswithcode.com/dataset/kinetics
Explore at:
Dataset updated
Apr 21, 2021
Authors
Will Kay; Joao Carreira; Karen Simonyan; Brian Zhang; Chloe Hillier; Sudheendra Vijayanarasimhan; Fabio Viola; Tim Green; Trevor Back; Paul Natsev; Mustafa Suleyman; Andrew Zisserman
Description
The Kinetics dataset is a large-scale, high-quality dataset for human action recognition in videos. The dataset consists of around 500,000 video clips covering 600 human action classes with at least 600 video clips for each action class. Each video clip lasts around 10 seconds and is labeled with a single action class. The videos are collected from YouTube.
S
D²-City: A Large-Scale Dashcam Video Dataset of Diverse Traffic Scenarios
scidb.cn
Updated Feb 4, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Zhengping Che; Bo Jiang; Yiping Meng; Guangyu Li; Tracy Li; Ke Dong; Xinsheng Zhang; Xuefeng Shi; Ying Lyu; Guobin Wu; Yan Liu; Jian Tang; Jieping Ye (2021). D²-City: A Large-Scale Dashcam Video Dataset of Diverse Traffic Scenarios [Dataset]. http://doi.org/10.11922/sciencedb.00603
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.11922/sciencedb.00603
Dataset updated
Feb 4, 2021
Dataset provided by
Science Data Bank
Authors
Zhengping Che; Bo Jiang; Yiping Meng; Guangyu Li; Tracy Li; Ke Dong; Xinsheng Zhang; Xuefeng Shi; Ying Lyu; Guobin Wu; Yan Liu; Jian Tang; Jieping Ye
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
D²-City is a large-scale driving video dataset that provides more than 10,000 dashcam videos recorded in 720p HD or 1080p FHD. Around 1000 of the videos come with detection and tracking annotation in each frame of all road objects, including bounding boxes and the tracking IDs of cars, vans, buses, trucks, pedestrians, motorcycles, bicycles, open- and closed-tricycles, forklifts, and large- and small-blocks. Some of the remainders of the videos come with road objects annotated in keyframes. Compared with existing datasets, D²-City benefits from its huge amount of diversity as data is collected from several cities throughout China and features varying weather, road, and traffic conditions. D²-City pays special attention to challenges in complex and various traffic scenarios. By bring more challenging cases to the community, we hope that this dataset will encourage and help new advances in the perception area of intelligent driving. The D²-City dataset and the corresponding challenges are originally hosted on DiDi GAIA's platform (URL: https://outreach.didichuxing.com/d2city/d2city)
Deep Video Understanding Annotations Dataset
data.nist.gov
catalog.data.gov
Updated May 25, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Institute of Standards and Technology (2021). Deep Video Understanding Annotations Dataset [Dataset]. http://doi.org/10.18434/mds2-2535
Explore at:
Unique identifier
https://doi.org/10.18434/mds2-2535, https://identifiers.org/ark:/88434/mds2-2535
Dataset updated
May 25, 2021
Dataset provided by
National Institute of Standards and Technologyhttp://www.nist.gov/
License
https://www.nist.gov/open/licensehttps://www.nist.gov/open/license
Description
The BBC Land Girls TV series is a 3 season series. Each season is 5 episodes of about 45mins each. The TRECVID group at NIST worked with the BBC Corp. to release the dataset to the research community to work on video understanding tasks. Unfortunately, the hosting arrangement for the dataset was not successful and the release of the video dataset couldn't be done. We are releasing the annotations conducted by NIST, without any video data, so that the researchers interested in working on knowledge graph understanding and natural language analysis can take advantage of them.
P
Gen-Video Dataset
paperswithcode.com
Updated Jun 19, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Haoxing Chen; Yan Hong; Zizheng Huang; Zhuoer Xu; Zhangxuan Gu; Yaohui Li; Jun Lan; Huijia Zhu; Jianfu Zhang; Weiqiang Wang; Huaxiong Li (2025). Gen-Video Dataset [Dataset]. https://paperswithcode.com/dataset/gen-video
Explore at:
Dataset updated
Jun 19, 2025
Authors
Haoxing Chen; Yan Hong; Zizheng Huang; Zhuoer Xu; Zhangxuan Gu; Yaohui Li; Jun Lan; Huijia Zhu; Jianfu Zhang; Weiqiang Wang; Huaxiong Li
Description
The first AI-generated video detection datasets.
i
110K Sensitive Video Dataset
ieee-dataport.org
Updated Feb 3, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pedro Almeida de Freitas (2022). 110K Sensitive Video Dataset [Dataset]. https://ieee-dataport.org/documents/110k-sensitive-video-dataset
Explore at:
Dataset updated
Feb 3, 2022
Authors
Pedro Almeida de Freitas
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
ATTENTION: THIS DATASET DOES NOT HOST ANY SOURCE VIDEOS. WE PROVIDE ONLY HIDDEN FEATURES GENERATED BY PRE-TRAINED DEEP MODELS AS DATA
l
Talking Head Video Dataset - 23k identities
lipsynthesis.com
csv
Updated Mar 31, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
LipSynthesis (2025). Talking Head Video Dataset - 23k identities [Dataset]. https://lipsynthesis.com/dataset
Explore at:
csvAvailable download formats
Dataset updated
Mar 31, 2025
Dataset authored and provided by
LipSynthesis
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
We provide a comprehensive talking-head video dataset with over 50,000 videos, totaling more than 600 hours of footage and featuring 20,841 unique identities from around the world.
R
20 Video Dataset
universe.roboflow.com
zip
Updated Jun 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
TENNIS4 (2025). 20 Video Dataset [Dataset]. https://universe.roboflow.com/tennis4/20-video
Explore at:
zipAvailable download formats
Dataset updated
Jun 2, 2025
Dataset authored and provided by
TENNIS4
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
Bcdj Bounding Boxes
Description
20 Video

## Overview 20 Video is a dataset for object detection tasks - it contains Bcdj annotations for 760 images. ## Getting Started You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model. ## License This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).

Facebook

Twitter

Click to copy link

Link copied

Cite

webadvisor (2023). Real Time Anomaly Detection in CCTV Surveillance [Dataset]. https://www.kaggle.com/datasets/webadvisor/real-time-anomaly-detection-in-cctv-surveillance

Real Time Anomaly Detection in CCTV Surveillance

Contains Videos for 13 different Class of Anomalies and Normal Events.

Explore at:

2 scholarly articles cite this dataset (View in Google Scholar)

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Apr 28, 2023

Dataset provided by

Kaggle

Authors

webadvisor

Description

UCF Crime Dataset in the most suitable structure. Contains 1900 videos from 13 different categories. To ensure the quality of this dataset, it is trained ten annotators (having different levels of computer vision expertise) to collect the dataset. Using videos search on YouTube and LiveLeak using text search queries (with slight variations e.g. “car crash”, “road accident”) of each anomaly.

Clear search

Close search

Google apps

Main menu

Real Time Anomaly Detection in CCTV Surveillance

tiny-video-dataset

PE-Video

video-dataset

Plenoptic Video Dataset Dataset

Long Video Dataset Dataset

YCB-Video Dataset

Dataset Video Dataset

Dataset Video

Image & Video Datasets

i3-video Dataset

ProciGen-video dataset for "InterTrack: Tracking Human Object Interaction...

Sintel 4D Light Field Video Dataset

Human Activity Recognition (UCF50): Video Dataset

Kinetics Dataset

D²-City: A Large-Scale Dashcam Video Dataset of Diverse Traffic Scenarios

Deep Video Understanding Annotations Dataset

Gen-Video Dataset

110K Sensitive Video Dataset

Talking Head Video Dataset - 23k identities

20 Video Dataset

20 Video

Real Time Anomaly Detection in CCTV Surveillance

Contains Videos for 13 different Class of Anomalies and Normal Events.