Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
MMMU (A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI)
๐ Homepage | ๐ Leaderboard | ๐ค Dataset | ๐ค Paper | ๐ arXiv | GitHub
๐News
๐ ๏ธ[2024-05-30]: Fixed duplicate option issues in Materials dataset items (validation_Materials_25; test_Materials_17, 242) and content error in validation_Materials_25. ๐ ๏ธ[2024-04-30]: Fixed missing "-" or "^" signs in Math dataset items (dev_Math_2, validation_Math_11, 12, 16; test_Math_8โฆ See the full description on the dataset page: https://huggingface.co/datasets/MMMU/MMMU.
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
zli12321/mmmu-pro-vision dataset hosted on Hugging Face and contributed by the HF Datasets community
This dataset contains the data for the paper Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos. Video-MMMU is a multi-modal, multi-disciplinary benchmark designed to assess LMMs' ability to acquire and utilize knowledge from videos. Project page: https://videommmu.github.io/
Leaderboard (last updated: 07 Feb, 2025)
Model Overall Perception Comprehension Adaptation ฮknowledge
Human Expert 74.44 84.33 78.67 60.33 +33.1โฆ See the full description on the dataset page: https://huggingface.co/datasets/lmms-lab/VideoMMMU.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
MMMU (A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI)
๐ Homepage | ๐ Leaderboard | ๐ค Dataset | ๐ค Paper | ๐ arXiv | GitHub
๐News
๐ ๏ธ[2024-05-30]: Fixed duplicate option issues in Materials dataset items (validation_Materials_25; test_Materials_17, 242) and content error in validation_Materials_25. ๐ ๏ธ[2024-04-30]: Fixed missing "-" or "^" signs in Math dataset items (dev_Math_2, validation_Math_11, 12, 16; test_Math_8โฆ See the full description on the dataset page: https://huggingface.co/datasets/MMMU/MMMU.