15 datasets found

h
Video-R1-data
huggingface.co
Updated Apr 13, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Video-R1 (2025). Video-R1-data [Dataset]. https://huggingface.co/datasets/Video-R1/Video-R1-data
Explore at:
Dataset updated
Apr 13, 2025
Dataset authored and provided by
Video-R1
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
This repository contains the data presented in Video-R1: Reinforcing Video Reasoning in MLLMs. Code: https://github.com/tulerfeng/Video-R1 Video data folder: CLEVRER, LLaVA-Video-178K, NeXT-QA, PerceptionTest, STAR Image data folder: Chart, General, Knowledge, Math, OCR, Spatial Video-R1-COT-165k.json is for SFT cold start, and Video-R1-260k.json is for RL training. Data Format in Video-R1-COT-165k: { "problem_id": 2, "problem": "What appears on the screen in Russian during the… See the full description on the dataset page: https://huggingface.co/datasets/Video-R1/Video-R1-data.
h
video-r1-RL
huggingface.co
Updated Apr 14, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ahmed Heakl (2025). video-r1-RL [Dataset]. https://huggingface.co/datasets/ahmedheakl/video-r1-RL
Explore at:
Dataset updated
Apr 14, 2025
Authors
Ahmed Heakl
Description
ahmedheakl/video-r1-RL dataset hosted on Hugging Face and contributed by the HF Datasets community
h
video-r1-image
huggingface.co
Updated Apr 13, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
蔡正舟 (2025). video-r1-image [Dataset]. https://huggingface.co/datasets/conctsai/video-r1-image
Explore at:
Dataset updated
Apr 13, 2025
Authors
蔡正舟
Description
conctsai/video-r1-image dataset hosted on Hugging Face and contributed by the HF Datasets community
h
video-r1-processed-mini
huggingface.co
Updated Aug 31, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kenta (2025). video-r1-processed-mini [Dataset]. https://huggingface.co/datasets/DLNorb/video-r1-processed-mini
Explore at:
Dataset updated
Aug 31, 2025
Authors
Kenta
Description
DLNorb/video-r1-processed-mini dataset hosted on Hugging Face and contributed by the HF Datasets community
h
video-dataset
huggingface.co
Updated May 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ProgramerSalar (2025). video-dataset [Dataset]. https://huggingface.co/datasets/ProgramerSalar/video-dataset
Explore at:
Dataset updated
May 11, 2025
Authors
ProgramerSalar
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Video Dataset on Hugging Face

This repository hosts the video dataset, a widely used benchmark dataset for human action recognition in videos. The dataset has been processed and uploaded to the Hugging Face Hub for easy access, sharing, and integration into machine learning workflows.

Introduction

The dataset is a large-scale video dataset designed for action recognition tasks. It contains 13,320 video clips across 101 action categories, making it one of the most… See the full description on the dataset page: https://huggingface.co/datasets/ProgramerSalar/video-dataset.
h
Video-R1-eval
huggingface.co
Updated Mar 29, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Video-R1 (2025). Video-R1-eval [Dataset]. https://huggingface.co/datasets/Video-R1/Video-R1-eval
Explore at:
Dataset updated
Mar 29, 2025
Dataset authored and provided by
Video-R1
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
This repository contains the data presented in Video-R1: Reinforcing Video Reasoning in MLLMs. Code: https://github.com/tulerfeng/Video-R1
h
videos-ours-r1
huggingface.co
Updated Apr 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ahmed Heakl (2025). videos-ours-r1 [Dataset]. https://huggingface.co/datasets/ahmedheakl/videos-ours-r1
Explore at:
Dataset updated
Apr 15, 2025
Authors
Ahmed Heakl
Description
ahmedheakl/videos-ours-r1 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
TinyLLaVA-Video-R1-training-data
huggingface.co
Updated Apr 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Zhang Xingjian (2025). TinyLLaVA-Video-R1-training-data [Dataset]. https://huggingface.co/datasets/Zhang199/TinyLLaVA-Video-R1-training-data
Explore at:
Dataset updated
Apr 15, 2025
Authors
Zhang Xingjian
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
TinyLLaVA-Video-R1

We select multiple choice questions from the NextQA subset of LLaVA-Video-178K as training data. To maintain manageable training time with limited computational resources, we only choose the subset of data with a duration of 0 to 30 seconds, which contains 5,496 samples. In addition, we manually annotate 16 samples for cold-starting and provide the annotations.

Organize Data

Organize the files and annotation files as follows in path/to/your/dataset: dataset ├──… See the full description on the dataset page: https://huggingface.co/datasets/Zhang199/TinyLLaVA-Video-R1-training-data.
h
SEED-Bench-R1
huggingface.co
Updated Apr 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ARC Lab, Tencent PCG (2025). SEED-Bench-R1 [Dataset]. https://huggingface.co/datasets/TencentARC/SEED-Bench-R1
Explore at:
Dataset updated
Apr 1, 2025
Dataset authored and provided by
ARC Lab, Tencent PCG
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
This repository contains the datasets presented in Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1.
h
Ego-R1-Data
huggingface.co
Updated Jun 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ego-R1 (2025). Ego-R1-Data [Dataset]. https://huggingface.co/datasets/Ego-R1/Ego-R1-Data
Explore at:
Dataset updated
Jun 15, 2025
Dataset authored and provided by
Ego-R1
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
🧠 Ego-R1 Data

Welcome to the Ego-R1 Data, a comprehensive collection designed to facilitate the training of large language models for tool-augmented reasoning and reinforcement learning. This dataset will be used for Ego-R1 Codebase, presented in the paper Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning.

📊 Dataset Overview

The Ego-R1 Dataset consists of two main components:

Ego-CoTT-25K: 25,000 Chain-of-Tool-Thought examples for Supervised… See the full description on the dataset page: https://huggingface.co/datasets/Ego-R1/Ego-R1-Data.
h
VersaVid-R1_training_data
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
VersaVid-R1, VersaVid-R1_training_data [Dataset]. https://huggingface.co/datasets/VersaVid-R1/VersaVid-R1_training_data
Explore at:
Authors
VersaVid-R1
Description
VersaVid-R1: A Versatile Video Understanding and Reasoning Model from Question Answering to Captioning Tasks

For more information, please visit the official VersaVid-R1 GitHub Repository.

License

VersaVid-R1 and its training data are intended solely for academic research purposes, and any form of commercial use is strictly prohibited. The copyright of all videos belongs to the video owners. If there is any infringement in VersaVid-R1 training data, please email… See the full description on the dataset page: https://huggingface.co/datasets/VersaVid-R1/VersaVid-R1_training_data.
h
MultiTaskVideoReasoning
huggingface.co
Updated Aug 25, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Haoji Zhang (2025). MultiTaskVideoReasoning [Dataset]. https://huggingface.co/datasets/zhang9302002/MultiTaskVideoReasoning
Explore at:
Dataset updated
Aug 25, 2025
Authors
Haoji Zhang
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
Multi Task Video Reasoning Dataset

This is the official training dataset for Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning. [Project] [arXiv] [Code]

Data Structure

└── MultiTaskVideoReasoning ├── MTVR_CoT │ ├── actnet.json │ ├── charades.json │ ├── longvideo-reason.json │ ├── nextgqa.json │ ├── rextime.json │ ├── vidchapters.json │ ├── Video-R1-data-image.json │ └──… See the full description on the dataset page: https://huggingface.co/datasets/zhang9302002/MultiTaskVideoReasoning.
h
VAU-Bench
huggingface.co
Updated Jun 7, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Qixiang Chen (2025). VAU-Bench [Dataset]. https://huggingface.co/datasets/7xiang/VAU-Bench
Explore at:
Dataset updated
Jun 7, 2025
Authors
Qixiang Chen
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
VAU-R1: Advancing Video Anomaly Understanding via Reinforcement Fine-Tuning

VAU-R1 is a data-efficient framework for video anomaly reasoning that combines Multimodal Large Language Models (MLLMs) with Reinforcement Fine-Tuning (RFT). This repository contains VAU-Bench, the first Chain-of-Thought (CoT) benchmark specifically designed for video anomaly understanding. It enables multimodal tasks such as multiple-choice question answering, temporal anomaly grounding, rationale-based… See the full description on the dataset page: https://huggingface.co/datasets/7xiang/VAU-Bench.
h
DenseStep200K
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Anonymous, DenseStep200K [Dataset]. https://huggingface.co/datasets/gmj03/DenseStep200K
Explore at:
Authors
Anonymous
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This repository contains two datasets for instructional video analysis tasks:

1. DenseStep200K.json Description

A large-scale dataset containing 222,000 detailed, temporally grounded instructional steps annotated across 10,000 high-quality instructional videos (totaling 732 hours). Constructed through a training-free automated pipeline leveraging multimodal foundation models (Qwen2.5-VL-72B and DeepSeek-R1-671B) to process noisy HowTo100M videos, achieving precise… See the full description on the dataset page: https://huggingface.co/datasets/gmj03/DenseStep200K.
h
Ego-R1-Bench
huggingface.co
Updated May 13, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ego-R1 (2025). Ego-R1-Bench [Dataset]. https://huggingface.co/datasets/Ego-R1/Ego-R1-Bench
Explore at:
Dataset updated
May 13, 2025
Dataset authored and provided by
Ego-R1
Description
🧠 Ego-R1 Benchmark

We establish Ego-R1 Benchmark for ultra-long egocentric video understanding. It was proposed in the paper Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning.

📁 Dataset Structure

The Ego-R1 Benchmark contains 300 carefully curated question-answer pairs in total:

🏷️ 150 Human-Labeled: Manually crafted questions by 6 annotators, with 25 QA pairs from each perspective. 🤖 150 Gemini-Generated + Human-Verfied: AI-generated… See the full description on the dataset page: https://huggingface.co/datasets/Ego-R1/Ego-R1-Bench.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Video-R1 (2025). Video-R1-data [Dataset]. https://huggingface.co/datasets/Video-R1/Video-R1-data

Video-R1-data

Video-R1/Video-R1-data

Explore at:

Dataset updated

Apr 13, 2025

Dataset authored and provided by

Video-R1

License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

This repository contains the data presented in Video-R1: Reinforcing Video Reasoning in MLLMs. Code: https://github.com/tulerfeng/Video-R1 Video data folder: CLEVRER, LLaVA-Video-178K, NeXT-QA, PerceptionTest, STAR Image data folder: Chart, General, Knowledge, Math, OCR, Spatial Video-R1-COT-165k.json is for SFT cold start, and Video-R1-260k.json is for RL training. Data Format in Video-R1-COT-165k: { "problem_id": 2, "problem": "What appears on the screen in Russian during the… See the full description on the dataset page: https://huggingface.co/datasets/Video-R1/Video-R1-data.

Clear search

Close search

Google apps

Main menu

Video-R1-data

video-r1-RL

video-r1-image

video-r1-processed-mini

video-dataset

Video-R1-eval

videos-ours-r1

TinyLLaVA-Video-R1-training-data

SEED-Bench-R1

Ego-R1-Data

VersaVid-R1_training_data

MultiTaskVideoReasoning

VAU-Bench

DenseStep200K

Ego-R1-Bench

Video-R1-data

Video-R1/Video-R1-data