8 datasets found

h
MVBench
huggingface.co
Updated Aug 8, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
OpenGVLab (2024). MVBench [Dataset]. https://huggingface.co/datasets/OpenGVLab/MVBench
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 8, 2024
Dataset authored and provided by
OpenGVLab
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
MVBench

Important Update

[18/10/2024] Due to NTU RGB+D License, 320 videos from NTU RGB+D need to be downloaded manually. Please visit ROSE Lab to access the data. We also provide a list of the 320 videos used in MVBench for your reference.

We introduce a novel static-to-dynamic method for defining temporal-related tasks. By converting static tasks into dynamic ones, we facilitate systematic generation of video tasks necessitating a wide range of temporal abilities, from… See the full description on the dataset page: https://huggingface.co/datasets/OpenGVLab/MVBench.
P
MVBench Dataset
paperswithcode.com
Updated Dec 29, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kunchang Li; Yali Wang; Yinan He; Yizhuo Li; Yi Wang; Yi Liu; Zun Wang; Jilan Xu; Guo Chen; Ping Luo; LiMin Wang; Yu Qiao (2025). MVBench Dataset [Dataset]. https://paperswithcode.com/dataset/mvbench
Explore at:
Dataset updated
Dec 29, 2023
Authors
Kunchang Li; Yali Wang; Yinan He; Yizhuo Li; Yi Wang; Yi Liu; Zun Wang; Jilan Xu; Guo Chen; Ping Luo; LiMin Wang; Yu Qiao
Description
MVBench is a comprehensive Multi-modal Video understanding Benchmark. It was introduced to evaluate the comprehension capabilities of Multi-modal Large Language Models (MLLMs), particularly their temporal understanding in dynamic video tasks. MVBench covers 20 challenging video tasks that cannot be effectively solved with a single frame. It introduces a novel static-to-dynamic method to define these temporal-related tasks. By transforming various static tasks into dynamic ones, it enables the systematic generation of video tasks that require a broad spectrum of temporal skills, ranging from perception to cognition.
h
MVBench
huggingface.co
Updated Aug 8, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
PKU-Alignment (2024). MVBench [Dataset]. https://huggingface.co/datasets/PKU-Alignment/MVBench
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 8, 2024
Dataset authored and provided by
PKU-Alignment
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
This dataset contains optimized video files based on the MVBench dataset. All non-video data remains the same, and users are encouraged to refer to the original dataset for the rest of the data and annotations. Original MVBench Dataset: MVBench on Hugging Face
h
MVBench-EvalData-PixelReasoner
huggingface.co
Updated Jun 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Haozhe Wang (2025). MVBench-EvalData-PixelReasoner [Dataset]. https://huggingface.co/datasets/JasperHaozhe/MVBench-EvalData-PixelReasoner
Explore at:
Dataset updated
Jun 9, 2025
Authors
Haozhe Wang
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
The evaluation data for the MVBench. The data structure follows the evaluation code of PixelReasoner
h
VideoLLava
huggingface.co
Updated Jul 11, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CHANG YUNYEN (2024). VideoLLava [Dataset]. https://huggingface.co/datasets/Mitzi4132/VideoLLava
Explore at:
Dataset updated
Jul 11, 2024
Authors
CHANG YUNYEN
Description
MVBench

We introduce a novel static-to-dynamic method for defining temporal-related tasks. By converting static tasks into dynamic ones, we facilitate systematic generation of video tasks necessitating a wide range of temporal abilities, from perception to cognition. Guided by task definitions, we then automatically transform public video annotations into multiple-choice QA for task evaluation. This unique paradigm enables efficient creation of MVBench with minimal manual… See the full description on the dataset page: https://huggingface.co/datasets/Mitzi4132/VideoLLava.
h
MV-Bench-mini
huggingface.co
Updated Jan 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
firstep.ai (2025). MV-Bench-mini [Dataset]. https://huggingface.co/datasets/firstep-ai/MV-Bench-mini
Explore at:
Dataset updated
Jan 9, 2025
Dataset provided by
firstep.ai
Description
firstep-ai/MV-Bench-mini dataset hosted on Hugging Face and contributed by the HF Datasets community
h
MVTamperBenchEnd
huggingface.co
Updated Dec 28, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Panda (2024). MVTamperBenchEnd [Dataset]. https://huggingface.co/datasets/Srikant86/MVTamperBenchEnd
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 28, 2024
Authors
Panda
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
MVTamperBench Dataset

Overview

MVTamperBenchEnd is a robust benchmark designed to evaluate Vision-Language Models (VLMs) against adversarial video tampering effects. It leverages the diverse and well-structured MVBench dataset, systematically augmented with four distinct tampering techniques:

Masking: Overlays a black rectangle on a 1-second segment, simulating visual data loss. Repetition: Repeats a 1-second segment, introducing temporal redundancy. Rotation: Rotates a… See the full description on the dataset page: https://huggingface.co/datasets/Srikant86/MVTamperBenchEnd.
h
MVTamperBenchSample
huggingface.co
Updated Dec 28, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Panda (2024). MVTamperBenchSample [Dataset]. https://huggingface.co/datasets/Srikant86/MVTamperBenchSample
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 28, 2024
Authors
Panda
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Proposed MVTamperBench, a novel benchmark that systematically evaluates the adversarial robustness of VLMs against video specific tampering techniques, with a focus on temporal reasoning and multimodal coherence.

Dataset Description

MVTamperBench applies five distinct tampering techniques to the original MVBench videos: Dropping, Masking, Substitution, Repetition, and Rotation. Each tampering effect introduces unique adversarial challenges to test VLM robustness under… See the full description on the dataset page: https://huggingface.co/datasets/Srikant86/MVTamperBenchSample.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

OpenGVLab (2024). MVBench [Dataset]. https://huggingface.co/datasets/OpenGVLab/MVBench

MVBench

OpenGVLab/MVBench

Explore at:

369 scholarly articles cite this dataset (View in Google Scholar)

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Aug 8, 2024

Dataset authored and provided by

OpenGVLab

License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

MVBench

  Important Update

[18/10/2024] Due to NTU RGB+D License, 320 videos from NTU RGB+D need to be downloaded manually. Please visit ROSE Lab to access the data. We also provide a list of the 320 videos used in MVBench for your reference.

We introduce a novel static-to-dynamic method for defining temporal-related tasks. By converting static tasks into dynamic ones, we facilitate systematic generation of video tasks necessitating a wide range of temporal abilities, from… See the full description on the dataset page: https://huggingface.co/datasets/OpenGVLab/MVBench.

Clear search

Close search

Google apps

Main menu

MVBench

MVBench Dataset

MVBench

MVBench-EvalData-PixelReasoner

VideoLLava

MV-Bench-mini

MVTamperBenchEnd

MVTamperBenchSample

MVBench

OpenGVLab/MVBench