15 datasets found
  1. h

    Video-R1-data

    • huggingface.co
    Updated Apr 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Video-R1 (2025). Video-R1-data [Dataset]. https://huggingface.co/datasets/Video-R1/Video-R1-data
    Explore at:
    Dataset updated
    Apr 13, 2025
    Dataset authored and provided by
    Video-R1
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    This repository contains the data presented in Video-R1: Reinforcing Video Reasoning in MLLMs. Code: https://github.com/tulerfeng/Video-R1 Video data folder: CLEVRER, LLaVA-Video-178K, NeXT-QA, PerceptionTest, STAR Image data folder: Chart, General, Knowledge, Math, OCR, Spatial Video-R1-COT-165k.json is for SFT cold start, and Video-R1-260k.json is for RL training. Data Format in Video-R1-COT-165k: { "problem_id": 2, "problem": "What appears on the screen in Russian during the… See the full description on the dataset page: https://huggingface.co/datasets/Video-R1/Video-R1-data.

  2. h

    video-r1-RL

    • huggingface.co
    Updated Apr 14, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ahmed Heakl (2025). video-r1-RL [Dataset]. https://huggingface.co/datasets/ahmedheakl/video-r1-RL
    Explore at:
    Dataset updated
    Apr 14, 2025
    Authors
    Ahmed Heakl
    Description

    ahmedheakl/video-r1-RL dataset hosted on Hugging Face and contributed by the HF Datasets community

  3. h

    video-r1-image

    • huggingface.co
    Updated Apr 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    蔡正舟 (2025). video-r1-image [Dataset]. https://huggingface.co/datasets/conctsai/video-r1-image
    Explore at:
    Dataset updated
    Apr 13, 2025
    Authors
    蔡正舟
    Description

    conctsai/video-r1-image dataset hosted on Hugging Face and contributed by the HF Datasets community

  4. h

    video-r1-processed-mini

    • huggingface.co
    Updated Aug 31, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kenta (2025). video-r1-processed-mini [Dataset]. https://huggingface.co/datasets/DLNorb/video-r1-processed-mini
    Explore at:
    Dataset updated
    Aug 31, 2025
    Authors
    Kenta
    Description

    DLNorb/video-r1-processed-mini dataset hosted on Hugging Face and contributed by the HF Datasets community

  5. h

    video-dataset

    • huggingface.co
    Updated May 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ProgramerSalar (2025). video-dataset [Dataset]. https://huggingface.co/datasets/ProgramerSalar/video-dataset
    Explore at:
    Dataset updated
    May 11, 2025
    Authors
    ProgramerSalar
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Video Dataset on Hugging Face

    This repository hosts the video dataset, a widely used benchmark dataset for human action recognition in videos. The dataset has been processed and uploaded to the Hugging Face Hub for easy access, sharing, and integration into machine learning workflows.

      Introduction
    

    The dataset is a large-scale video dataset designed for action recognition tasks. It contains 13,320 video clips across 101 action categories, making it one of the most… See the full description on the dataset page: https://huggingface.co/datasets/ProgramerSalar/video-dataset.

  6. h

    Video-R1-eval

    • huggingface.co
    Updated Mar 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Video-R1 (2025). Video-R1-eval [Dataset]. https://huggingface.co/datasets/Video-R1/Video-R1-eval
    Explore at:
    Dataset updated
    Mar 29, 2025
    Dataset authored and provided by
    Video-R1
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    This repository contains the data presented in Video-R1: Reinforcing Video Reasoning in MLLMs. Code: https://github.com/tulerfeng/Video-R1

  7. h

    videos-ours-r1

    • huggingface.co
    Updated Apr 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ahmed Heakl (2025). videos-ours-r1 [Dataset]. https://huggingface.co/datasets/ahmedheakl/videos-ours-r1
    Explore at:
    Dataset updated
    Apr 15, 2025
    Authors
    Ahmed Heakl
    Description

    ahmedheakl/videos-ours-r1 dataset hosted on Hugging Face and contributed by the HF Datasets community

  8. h

    TinyLLaVA-Video-R1-training-data

    • huggingface.co
    Updated Apr 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zhang Xingjian (2025). TinyLLaVA-Video-R1-training-data [Dataset]. https://huggingface.co/datasets/Zhang199/TinyLLaVA-Video-R1-training-data
    Explore at:
    Dataset updated
    Apr 15, 2025
    Authors
    Zhang Xingjian
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    TinyLLaVA-Video-R1

    We select multiple choice questions from the NextQA subset of LLaVA-Video-178K as training data. To maintain manageable training time with limited computational resources, we only choose the subset of data with a duration of 0 to 30 seconds, which contains 5,496 samples. In addition, we manually annotate 16 samples for cold-starting and provide the annotations.

      Organize Data
    

    Organize the files and annotation files as follows in path/to/your/dataset: dataset ├──… See the full description on the dataset page: https://huggingface.co/datasets/Zhang199/TinyLLaVA-Video-R1-training-data.

  9. h

    SEED-Bench-R1

    • huggingface.co
    Updated Apr 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ARC Lab, Tencent PCG (2025). SEED-Bench-R1 [Dataset]. https://huggingface.co/datasets/TencentARC/SEED-Bench-R1
    Explore at:
    Dataset updated
    Apr 1, 2025
    Dataset authored and provided by
    ARC Lab, Tencent PCG
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    This repository contains the datasets presented in Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1.

  10. h

    Ego-R1-Data

    • huggingface.co
    Updated Jun 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ego-R1 (2025). Ego-R1-Data [Dataset]. https://huggingface.co/datasets/Ego-R1/Ego-R1-Data
    Explore at:
    Dataset updated
    Jun 15, 2025
    Dataset authored and provided by
    Ego-R1
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    🧠 Ego-R1 Data

    Welcome to the Ego-R1 Data, a comprehensive collection designed to facilitate the training of large language models for tool-augmented reasoning and reinforcement learning. This dataset will be used for Ego-R1 Codebase, presented in the paper Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning.

      📊 Dataset Overview
    

    The Ego-R1 Dataset consists of two main components:

    Ego-CoTT-25K: 25,000 Chain-of-Tool-Thought examples for Supervised… See the full description on the dataset page: https://huggingface.co/datasets/Ego-R1/Ego-R1-Data.

  11. h

    VersaVid-R1_training_data

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    VersaVid-R1, VersaVid-R1_training_data [Dataset]. https://huggingface.co/datasets/VersaVid-R1/VersaVid-R1_training_data
    Explore at:
    Authors
    VersaVid-R1
    Description

    VersaVid-R1: A Versatile Video Understanding and Reasoning Model from Question Answering to Captioning Tasks

    For more information, please visit the official VersaVid-R1 GitHub Repository.

      License
    

    VersaVid-R1 and its training data are intended solely for academic research purposes, and any form of commercial use is strictly prohibited. The copyright of all videos belongs to the video owners. If there is any infringement in VersaVid-R1 training data, please email… See the full description on the dataset page: https://huggingface.co/datasets/VersaVid-R1/VersaVid-R1_training_data.

  12. h

    MultiTaskVideoReasoning

    • huggingface.co
    Updated Aug 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Haoji Zhang (2025). MultiTaskVideoReasoning [Dataset]. https://huggingface.co/datasets/zhang9302002/MultiTaskVideoReasoning
    Explore at:
    Dataset updated
    Aug 25, 2025
    Authors
    Haoji Zhang
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    Multi Task Video Reasoning Dataset

    This is the official training dataset for Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning. [Project] [arXiv] [Code]

      Data Structure
    

    └── MultiTaskVideoReasoning ├── MTVR_CoT │ ├── actnet.json │ ├── charades.json │ ├── longvideo-reason.json │ ├── nextgqa.json │ ├── rextime.json │ ├── vidchapters.json │ ├── Video-R1-data-image.json │ └──… See the full description on the dataset page: https://huggingface.co/datasets/zhang9302002/MultiTaskVideoReasoning.

  13. h

    VAU-Bench

    • huggingface.co
    Updated Jun 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Qixiang Chen (2025). VAU-Bench [Dataset]. https://huggingface.co/datasets/7xiang/VAU-Bench
    Explore at:
    Dataset updated
    Jun 7, 2025
    Authors
    Qixiang Chen
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    VAU-R1: Advancing Video Anomaly Understanding via Reinforcement Fine-Tuning

    VAU-R1 is a data-efficient framework for video anomaly reasoning that combines Multimodal Large Language Models (MLLMs) with Reinforcement Fine-Tuning (RFT). This repository contains VAU-Bench, the first Chain-of-Thought (CoT) benchmark specifically designed for video anomaly understanding. It enables multimodal tasks such as multiple-choice question answering, temporal anomaly grounding, rationale-based… See the full description on the dataset page: https://huggingface.co/datasets/7xiang/VAU-Bench.

  14. h

    DenseStep200K

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anonymous, DenseStep200K [Dataset]. https://huggingface.co/datasets/gmj03/DenseStep200K
    Explore at:
    Authors
    Anonymous
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This repository contains two datasets for instructional video analysis tasks:

      1. DenseStep200K.json
    
    
    
    
    
      Description
    

    A large-scale dataset containing 222,000 detailed, temporally grounded instructional steps annotated across 10,000 high-quality instructional videos (totaling 732 hours). Constructed through a training-free automated pipeline leveraging multimodal foundation models (Qwen2.5-VL-72B and DeepSeek-R1-671B) to process noisy HowTo100M videos, achieving precise… See the full description on the dataset page: https://huggingface.co/datasets/gmj03/DenseStep200K.

  15. h

    Ego-R1-Bench

    • huggingface.co
    Updated May 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ego-R1 (2025). Ego-R1-Bench [Dataset]. https://huggingface.co/datasets/Ego-R1/Ego-R1-Bench
    Explore at:
    Dataset updated
    May 13, 2025
    Dataset authored and provided by
    Ego-R1
    Description

    🧠 Ego-R1 Benchmark

    We establish Ego-R1 Benchmark for ultra-long egocentric video understanding. It was proposed in the paper Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning.

      📁 Dataset Structure
    

    The Ego-R1 Benchmark contains 300 carefully curated question-answer pairs in total:

    🏷️ 150 Human-Labeled: Manually crafted questions by 6 annotators, with 25 QA pairs from each perspective. 🤖 150 Gemini-Generated + Human-Verfied: AI-generated… See the full description on the dataset page: https://huggingface.co/datasets/Ego-R1/Ego-R1-Bench.

  16. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Video-R1 (2025). Video-R1-data [Dataset]. https://huggingface.co/datasets/Video-R1/Video-R1-data

Video-R1-data

Video-R1/Video-R1-data

Explore at:
Dataset updated
Apr 13, 2025
Dataset authored and provided by
Video-R1
License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

This repository contains the data presented in Video-R1: Reinforcing Video Reasoning in MLLMs. Code: https://github.com/tulerfeng/Video-R1 Video data folder: CLEVRER, LLaVA-Video-178K, NeXT-QA, PerceptionTest, STAR Image data folder: Chart, General, Knowledge, Math, OCR, Spatial Video-R1-COT-165k.json is for SFT cold start, and Video-R1-260k.json is for RL training. Data Format in Video-R1-COT-165k: { "problem_id": 2, "problem": "What appears on the screen in Russian during the… See the full description on the dataset page: https://huggingface.co/datasets/Video-R1/Video-R1-data.

Search
Clear search
Close search
Google apps
Main menu