8 datasets found
  1. h

    MVBench

    • huggingface.co
    Updated Aug 8, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OpenGVLab (2024). MVBench [Dataset]. https://huggingface.co/datasets/OpenGVLab/MVBench
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 8, 2024
    Dataset authored and provided by
    OpenGVLab
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    MVBench

      Important Update
    

    [18/10/2024] Due to NTU RGB+D License, 320 videos from NTU RGB+D need to be downloaded manually. Please visit ROSE Lab to access the data. We also provide a list of the 320 videos used in MVBench for your reference.

    We introduce a novel static-to-dynamic method for defining temporal-related tasks. By converting static tasks into dynamic ones, we facilitate systematic generation of video tasks necessitating a wide range of temporal abilities, from… See the full description on the dataset page: https://huggingface.co/datasets/OpenGVLab/MVBench.

  2. P

    MVBench Dataset

    • paperswithcode.com
    Updated Dec 29, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kunchang Li; Yali Wang; Yinan He; Yizhuo Li; Yi Wang; Yi Liu; Zun Wang; Jilan Xu; Guo Chen; Ping Luo; LiMin Wang; Yu Qiao (2025). MVBench Dataset [Dataset]. https://paperswithcode.com/dataset/mvbench
    Explore at:
    Dataset updated
    Dec 29, 2023
    Authors
    Kunchang Li; Yali Wang; Yinan He; Yizhuo Li; Yi Wang; Yi Liu; Zun Wang; Jilan Xu; Guo Chen; Ping Luo; LiMin Wang; Yu Qiao
    Description

    MVBench is a comprehensive Multi-modal Video understanding Benchmark. It was introduced to evaluate the comprehension capabilities of Multi-modal Large Language Models (MLLMs), particularly their temporal understanding in dynamic video tasks. MVBench covers 20 challenging video tasks that cannot be effectively solved with a single frame. It introduces a novel static-to-dynamic method to define these temporal-related tasks. By transforming various static tasks into dynamic ones, it enables the systematic generation of video tasks that require a broad spectrum of temporal skills, ranging from perception to cognition.

  3. h

    MVBench

    • huggingface.co
    Updated Aug 8, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    PKU-Alignment (2024). MVBench [Dataset]. https://huggingface.co/datasets/PKU-Alignment/MVBench
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 8, 2024
    Dataset authored and provided by
    PKU-Alignment
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    This dataset contains optimized video files based on the MVBench dataset. All non-video data remains the same, and users are encouraged to refer to the original dataset for the rest of the data and annotations. Original MVBench Dataset: MVBench on Hugging Face

  4. h

    MVBench-EvalData-PixelReasoner

    • huggingface.co
    Updated Jun 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Haozhe Wang (2025). MVBench-EvalData-PixelReasoner [Dataset]. https://huggingface.co/datasets/JasperHaozhe/MVBench-EvalData-PixelReasoner
    Explore at:
    Dataset updated
    Jun 9, 2025
    Authors
    Haozhe Wang
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    The evaluation data for the MVBench. The data structure follows the evaluation code of PixelReasoner

  5. h

    VideoLLava

    • huggingface.co
    Updated Jul 11, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CHANG YUNYEN (2024). VideoLLava [Dataset]. https://huggingface.co/datasets/Mitzi4132/VideoLLava
    Explore at:
    Dataset updated
    Jul 11, 2024
    Authors
    CHANG YUNYEN
    Description

    MVBench

    We introduce a novel static-to-dynamic method for defining temporal-related tasks. By converting static tasks into dynamic ones, we facilitate systematic generation of video tasks necessitating a wide range of temporal abilities, from perception to cognition. Guided by task definitions, we then automatically transform public video annotations into multiple-choice QA for task evaluation. This unique paradigm enables efficient creation of MVBench with minimal manual… See the full description on the dataset page: https://huggingface.co/datasets/Mitzi4132/VideoLLava.

  6. h

    MV-Bench-mini

    • huggingface.co
    Updated Jan 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    firstep.ai (2025). MV-Bench-mini [Dataset]. https://huggingface.co/datasets/firstep-ai/MV-Bench-mini
    Explore at:
    Dataset updated
    Jan 9, 2025
    Dataset provided by
    firstep.ai
    Description

    firstep-ai/MV-Bench-mini dataset hosted on Hugging Face and contributed by the HF Datasets community

  7. h

    MVTamperBenchEnd

    • huggingface.co
    Updated Dec 28, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Panda (2024). MVTamperBenchEnd [Dataset]. https://huggingface.co/datasets/Srikant86/MVTamperBenchEnd
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 28, 2024
    Authors
    Panda
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    MVTamperBench Dataset

      Overview
    

    MVTamperBenchEnd is a robust benchmark designed to evaluate Vision-Language Models (VLMs) against adversarial video tampering effects. It leverages the diverse and well-structured MVBench dataset, systematically augmented with four distinct tampering techniques:

    Masking: Overlays a black rectangle on a 1-second segment, simulating visual data loss. Repetition: Repeats a 1-second segment, introducing temporal redundancy. Rotation: Rotates a… See the full description on the dataset page: https://huggingface.co/datasets/Srikant86/MVTamperBenchEnd.

  8. h

    MVTamperBenchSample

    • huggingface.co
    Updated Dec 28, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Panda (2024). MVTamperBenchSample [Dataset]. https://huggingface.co/datasets/Srikant86/MVTamperBenchSample
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 28, 2024
    Authors
    Panda
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Proposed MVTamperBench, a novel benchmark that systematically evaluates the adversarial robustness of VLMs against video specific tampering techniques, with a focus on temporal reasoning and multimodal coherence.

      Dataset Description
    

    MVTamperBench applies five distinct tampering techniques to the original MVBench videos: Dropping, Masking, Substitution, Repetition, and Rotation. Each tampering effect introduces unique adversarial challenges to test VLM robustness under… See the full description on the dataset page: https://huggingface.co/datasets/Srikant86/MVTamperBenchSample.

  9. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
OpenGVLab (2024). MVBench [Dataset]. https://huggingface.co/datasets/OpenGVLab/MVBench

MVBench

OpenGVLab/MVBench

Explore at:
369 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 8, 2024
Dataset authored and provided by
OpenGVLab
License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

MVBench

  Important Update

[18/10/2024] Due to NTU RGB+D License, 320 videos from NTU RGB+D need to be downloaded manually. Please visit ROSE Lab to access the data. We also provide a list of the 320 videos used in MVBench for your reference.

We introduce a novel static-to-dynamic method for defining temporal-related tasks. By converting static tasks into dynamic ones, we facilitate systematic generation of video tasks necessitating a wide range of temporal abilities, from… See the full description on the dataset page: https://huggingface.co/datasets/OpenGVLab/MVBench.

Search
Clear search
Close search
Google apps
Main menu