13 datasets found

a
Kinetics400 Dataset: The Kinetics Human Action Video Dataset
academictorrents.com
bittorrent
Updated Nov 7, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Will Kay, Joao Carreira, Karen Simonyan, Brian Zhang, Chloe Hillier, Sudheendra Vijayanarasimhan, Fabio Viola, Tim Gree, Trevor Back, Paul Natsev, Mustafa Suleyman, Andrew Zisserman (2020). Kinetics400 Dataset: The Kinetics Human Action Video Dataset [Dataset]. https://academictorrents.com/details/184d11318372f70018cf9a72ef867e2fb9ce1d26
Explore at:
bittorrent(157312153084)Available download formats
Dataset updated
Nov 7, 2020
Dataset authored and provided by
Will Kay, Joao Carreira, Karen Simonyan, Brian Zhang, Chloe Hillier, Sudheendra Vijayanarasimhan, Fabio Viola, Tim Gree, Trevor Back, Paul Natsev, Mustafa Suleyman, Andrew Zisserman
License
https://academictorrents.com/nolicensespecifiedhttps://academictorrents.com/nolicensespecified
Description
MD5 checksum: kinetics400.zip: 33224b5b77c634aa6717da686efce2d4 kinetics400_validation.zip: 013358d458477d7ac10cebb9e84df354
Kinetics-400-[test-set]
kaggle.com
Updated Sep 11, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Innat (2023). Kinetics-400-[test-set] [Dataset]. https://www.kaggle.com/datasets/ipythonx/k4testset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 11, 2023
Dataset provided by
Kaggle
Authors
Innat
Description
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F1984321%2Fee10abf5409ea4eaaad3dfaa9514a4bb%2FScreenshot_2021-08-06_at_16.15.03.png?generation=1694441423300452&alt=media" alt="">

Video Action Recognition : Kinetics 400

The dataset contains 400 human action classes, with at least 400 video clips for each action. Each clip lasts around 10s and is taken from a different YouTube video. The actions are human focussed and cover a broad range of classes including human-object interactions such as playing instruments, as well as human-human interactions such as shaking hands. Homepage.

License

The kinetics dataset is licensed by Google Inc. under a Creative Commons Attribution 4.0 International License. Published. May 22, 2017.
h
kinetics400
huggingface.co
Updated Jun 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Huan Liu (2025). kinetics400 [Dataset]. https://huggingface.co/datasets/liuhuanjim013/kinetics400
Explore at:
Dataset updated
Jun 4, 2025
Authors
Huan Liu
Description
Kinetics-400 Video Dataset

This dataset is derived from the Kinetics-400 dataset, which is released under the Creative Commons Attribution 4.0 International License (CC BY 4.0).

Attribution

This dataset is derived from:

Original Dataset: Kinetics-400 Original Authors: Will Kay, Joao Carreira, Karen Simonyan, Brian Zhang, Chloe Hillier, Sudheendra Vijayanarasimhan, Fabio Viola, Tim Green, Trevor Back, Paul Natsev, Mustafa Suleyman, Andrew Zisserman Original Paper: "The… See the full description on the dataset page: https://huggingface.co/datasets/liuhuanjim013/kinetics400.
t
Kinetics400 dataset for action recognition - Dataset - LDM
service.tib.eu
Updated Dec 16, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Kinetics400 dataset for action recognition - Dataset - LDM [Dataset]. https://service.tib.eu/ldmservice/dataset/kinetics400-dataset-for-action-recognition
Explore at:
Dataset updated
Dec 16, 2024
Description
The dataset used in the paper is Kinetics400 for action recognition.
Kinetics 400
opendatalab.com
zip
Updated Sep 2, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
DeepMind (2022). Kinetics 400 [Dataset]. https://opendatalab.com/OpenMMLab/Kinetics-400
Explore at:
zip(163566716003 bytes)Available download formats
Dataset updated
Sep 2, 2022
Dataset provided by
Google DeepMindhttp://deepmind.com/
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The Kinetics dataset is a large-scale, high-quality dataset for human action recognition in videos. The dataset consists of around 500,000 video clips covering 600 human action classes with at least 600 video clips for each action class. Each video clip lasts around 10 seconds and is labeled with a single action class. The videos are collected from YouTube.
h
kinetics400-bootstapir_checkpoint_v2-crop_still_edges
huggingface.co
Updated Dec 22, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Johnathan Xie (2024). kinetics400-bootstapir_checkpoint_v2-crop_still_edges [Dataset]. https://huggingface.co/datasets/jxie/kinetics400-bootstapir_checkpoint_v2-crop_still_edges
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 22, 2024
Authors
Johnathan Xie
Description
jxie/kinetics400-bootstapir_checkpoint_v2-crop_still_edges dataset hosted on Hugging Face and contributed by the HF Datasets community
t
Kinetics-400, UCF101, HMDB51, Something-Something V1, and...
service.tib.eu
Updated Dec 2, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Kinetics-400, UCF101, HMDB51, Something-Something V1, and Something-Something V2 [Dataset]. https://service.tib.eu/ldmservice/dataset/kinetics-400--ucf101--hmdb51--something-something-v1--and-something-something-v2
Explore at:
Dataset updated
Dec 2, 2024
Description
The Kinetics-400, UCF101, HMDB51, Something-Something V1, and Something-Something V2 datasets are used for evaluating the performance of the Bi-Calibration Networks.
h
kinetics-400-splits
huggingface.co
Updated Nov 23, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kiyoon (2024). kinetics-400-splits [Dataset]. https://huggingface.co/datasets/kiyoonkim/kinetics-400-splits
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 23, 2024
Authors
Kiyoon
Description
kiyoonkim/kinetics-400-splits dataset hosted on Hugging Face and contributed by the HF Datasets community
O
Kinetics 600
opendatalab.com
zip
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
DeepMind, Kinetics 600 [Dataset]. https://opendatalab.com/OpenMMLab/Kinetics600
Explore at:
zipAvailable download formats
Dataset provided by
DeepMind
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The Kinetics-600 is a large-scale action recognition dataset which consists of around 480K videos from 600 action categories. The 480K videos are divided into 390K, 30K, 60K for training, validation and test sets, respectively. Each video in the dataset is a 10-second clip of action moment annotated from raw YouTube video. It is an extensions of the Kinetics-400 dataset.
Kinetics400_RGB256_F10_val_data
kaggle.com
Updated Nov 23, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
BM Ji (2023). Kinetics400_RGB256_F10_val_data [Dataset]. https://www.kaggle.com/datasets/jizeyong/kinetics400-rgb256-f10-val-data
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 23, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
BM Ji
Description
Dataset

This dataset was created by BM Ji

Contents
t
Mengmeng Wang, Jiazheng Xing, Boyuan Jiang, Jun Chen, Jianbiao Mei, Xingxing...
service.tib.eu
Updated Dec 2, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Mengmeng Wang, Jiazheng Xing, Boyuan Jiang, Jun Chen, Jianbiao Mei, Xingxing Zuo, Guang Dai, Jingdong Wang, Yong Liu (2024). Dataset: Kinetics-400 and Something-Something-V2. https://doi.org/10.57702/upe1v9qi [Dataset]. https://service.tib.eu/ldmservice/dataset/kinetics-400-and-something-something-v2
Explore at:
Dataset updated
Dec 2, 2024
Description
The dataset used in the paper is Kinetics-400 and Something-Something-V2.
Z
I3D Video Features, Labels and Splits for Multicamera Overlapping Datasets...
data.niaid.nih.gov
Updated Jan 16, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
SANTIAGO LOPES PEREIRA, SILAS (2025). I3D Video Features, Labels and Splits for Multicamera Overlapping Datasets Pets-2009, HQFS and Up-Fall [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_14655605
Explore at:
Dataset updated
Jan 16, 2025
Dataset provided by
SANTIAGO LOPES PEREIRA, SILAS
Maia, José
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
I3D Video Features, Labels and Splits for Multicamera Overlapping Datasets Pets-2009, HQFS and Up-Fall

The Inflated 3D (I3D) video features, ground truths, and train/test splits for the multicamera datasets Pets-2009, HQFS, and Up-Fall are available here. We relabeled two datasets (HQFS and Pets-2009) for the task of VAD-MIL under multiple cameras. Three feature dispositions of I3D data are available: I3D-RGB, I3D-OF, and the linear concatenation of these features. These datasets can be used as benchmarks for the video anomaly detection task under multiple instance learning and multiple overlapping cameras.

Preprocessed Datasets

PETS-2009 is a benchmark dataset (https://cs.binghamton.edu/~mrldata/pets2009) aggregating different scene sets with multiple overlapping camera views and distinct events involving crowds. We labeled the scenes at \textit{frame} level as anomaly or normal events. Scenes with background, people walking individually or in a crowd, and regular passing of cars are considered normal patterns. Frames with occurrences of people running (individually or in the crowd), crowding of people in the middle of the traffic intersection, and people in the counterflow were considered anomalous patterns. Videos of scenes with the occurrence of anomalous frames are labeled as anomalous, while videos without the occurrence of anomalies are marked as normal videos. The High-Quality Fall Simulation Data - HQFS dataset (https://iiw.kuleuven.be/onderzoek/advise/datasets/fall-and-adl-meta-data) is an indoor scenario with five overlapping cameras with the occurrence of fall incidents. We consider a person falling on the floor an uncommon event. We also relabeled the frame annotations to consider the intervals where the person remains lying on the ground after the fall. The multi-class Up-Fall (https://sites.google.com/up.edu.mx/har-up/) detection dataset contains two overlapping camera views and infrared sensors in a laboratory scenario.

Video Feature Extraction

We use Inflated 3D (I3D) features to represent video clips of 16 frames. We use the Video Features library (https://github.com/v-iashin/video_features) that uses a pre-trained model on the Kinetics 400 dataset. For this procedure, the frame sequence length from which to get the video clip feature representation (or window size) and the number of frames to step before extracting the next features were set to 16 frames. After the video extraction process, each video from each camera corresponds to a matrix with dimension n x 1024, where n is a variable number of existing segments and the number of attributes is 1024 (I3D attributes referring to RGB appearance information or I3D attributes referring to Optical Flow information). It is important to note that the videos (\textit{bags}) are divided into clips with a fixed number of \textit{frames}. Consequently, each video \textit{bag} contains a variable number of clips. A clip can be completely normal, completely anomalous, or mixed with normal and anomalous frames. There are three possible deep feature dispositions considered: I3D features generated with only RGB (1024 I3D features from RGB data), Optical Flow (1024 I3D features from optical flow data), and the combination of both (by simple linear concatenation). We also make available 10-crop features (https://pytorch.org/vision/main/generated/torchvision.transforms.TenCrop.html) by yielding 10 crops for a given video clip.

File Description

center-crop.zip: Folder with I3D features of Pets-2009, HQFS and Up-Fall datasets;

10-crop.zip: Folder with I3D features (10-crop) of Pets-2009, HQFS and Up-Fall datasets;

gts.zip: Folder with ground truths at frame-level and video-level of Pets-2009, HQFS and Up-Fall datasets;

splits.zip: Folder with Lists of training and test splits of Pets-2009, HQFS and Up-Fall datasets;

A portion of the preprocessed I3D feature sets was leveraged in the studies outlined in these publications:

Pereira, S. S., & Maia, J. E. B. (2024). MC-MIL: video surveillance anomaly detection with multi-instance learning and multiple overlapped cameras. Neural Computing and Applications, 36(18), 10527-10543. Available at https://link.springer.com/article/10.1007/s00521-024-09611-3.

Pereira, S. S. L., Maia, J. E. B., & Proença, H. (2024, September). Video Anomaly Detection in Overlapping Data: The More Cameras, the Better?. In 2024 IEEE International Joint Conference on Biometrics (IJCB) (pp. 1-10). IEEE. Available at https://ieeexplore.ieee.org/document/10744502.
spaghetti-video
huggingface.co
Updated Aug 12, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hugging Face Internal Testing Organization (2022). spaghetti-video [Dataset]. https://huggingface.co/datasets/hf-internal-testing/spaghetti-video
Explore at:
Dataset updated
Aug 12, 2022
Dataset provided by
Hugging Facehttps://huggingface.co/
Authors
Hugging Face Internal Testing Organization
Description
This dataset contains both 8 and 16 sampled frames of the "eating-spaghetti" video of the Kinetics-400 dataset, with the following frame indices being used:

8 frames (eating_spaghetti_8_frames.npy): 97, 98, 99, 100, 101, 102, 103, 104 16 frames (eating_spaghetti.npy): [164, 168, 172, 176, 181, 185, 189, 193, 198, 202, 206, 210, 215, 219, 223, 227]. 32 frames (eating_spaghetti_32_frames.npy): array([ 47, 51, 55… See the full description on the dataset page: https://huggingface.co/datasets/hf-internal-testing/spaghetti-video.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Will Kay, Joao Carreira, Karen Simonyan, Brian Zhang, Chloe Hillier, Sudheendra Vijayanarasimhan, Fabio Viola, Tim Gree, Trevor Back, Paul Natsev, Mustafa Suleyman, Andrew Zisserman (2020). Kinetics400 Dataset: The Kinetics Human Action Video Dataset [Dataset]. https://academictorrents.com/details/184d11318372f70018cf9a72ef867e2fb9ce1d26

Kinetics400 Dataset: The Kinetics Human Action Video Dataset

Explore at:

6 scholarly articles cite this dataset (View in Google Scholar)

bittorrent(157312153084)Available download formats

Dataset updated

Nov 7, 2020

Dataset authored and provided by

Will Kay, Joao Carreira, Karen Simonyan, Brian Zhang, Chloe Hillier, Sudheendra Vijayanarasimhan, Fabio Viola, Tim Gree, Trevor Back, Paul Natsev, Mustafa Suleyman, Andrew Zisserman

License

https://academictorrents.com/nolicensespecifiedhttps://academictorrents.com/nolicensespecified

Description

MD5 checksum: kinetics400.zip: 33224b5b77c634aa6717da686efce2d4 kinetics400_validation.zip: 013358d458477d7ac10cebb9e84df354

Clear search

Close search

Google apps

Main menu

Kinetics400 Dataset: The Kinetics Human Action Video Dataset

Kinetics-400-[test-set]

Video Action Recognition : Kinetics 400

License

kinetics400

Kinetics400 dataset for action recognition - Dataset - LDM

Kinetics 400

kinetics400-bootstapir_checkpoint_v2-crop_still_edges

Kinetics-400, UCF101, HMDB51, Something-Something V1, and...

kinetics-400-splits

Kinetics 600

Kinetics400_RGB256_F10_val_data

Dataset

Contents

Mengmeng Wang, Jiazheng Xing, Boyuan Jiang, Jun Chen, Jianbiao Mei, Xingxing...

I3D Video Features, Labels and Splits for Multicamera Overlapping Datasets...

spaghetti-video

Kinetics400 Dataset: The Kinetics Human Action Video Dataset