1 dataset found

h
LongVideoBench
huggingface.co
Updated Jun 21, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
LongVideoBench (2024). LongVideoBench [Dataset]. https://huggingface.co/datasets/longvideobench/LongVideoBench
Explore at:
Dataset updated
Jun 21, 2024
Dataset authored and provided by
LongVideoBench
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
Dataset Card for LongVideoBench

Large multimodal models (LMMs) are handling increasingly longer and more complex inputs. However, few public benchmarks are available to assess these advancements. To address this, we introduce LongVideoBench, a question-answering benchmark with video-language interleaved inputs up to an hour long. It comprises 3,763 web-collected videos with subtitles across diverse themes, designed to evaluate LMMs on long-term multimodal understanding. The… See the full description on the dataset page: https://huggingface.co/datasets/longvideobench/LongVideoBench.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

LongVideoBench (2024). LongVideoBench [Dataset]. https://huggingface.co/datasets/longvideobench/LongVideoBench

LongVideoBench

longvideobench

longvideobench/LongVideoBench

Explore at:

131 scholarly articles cite this dataset (View in Google Scholar)

Dataset updated

Jun 21, 2024

Dataset authored and provided by

LongVideoBench

License

Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically

Description

Dataset Card for LongVideoBench

Large multimodal models (LMMs) are handling increasingly longer and more complex inputs. However, few public benchmarks are available to assess these advancements. To address this, we introduce LongVideoBench, a question-answering benchmark with video-language interleaved inputs up to an hour long. It comprises 3,763 web-collected videos with subtitles across diverse themes, designed to evaluate LMMs on long-term multimodal understanding. The… See the full description on the dataset page: https://huggingface.co/datasets/longvideobench/LongVideoBench.

Clear search

Close search

Google apps

Main menu