Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
MMMU (A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI)
π Homepage | π Leaderboard | π€ Dataset | π€ Paper | π arXiv | GitHub
πNews
π οΈ[2024-05-30]: Fixed duplicate option issues in Materials dataset items (validation_Materials_25; test_Materials_17, 242) and content error in validation_Materials_25. π οΈ[2024-04-30]: Fixed missing "-" or "^" signs in Math dataset items (dev_Math_2, validation_Math_11, 12, 16; test_Math_8β¦ See the full description on the dataset page: https://huggingface.co/datasets/MMMU/MMMU.
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
MMMU with difficulty level tags
This dataset extends the π€ MMMU val benchmark by introducing two additional tags: passrate_for_qwen2.5_vl_7b and difficulty_level_for_qwen2.5_vl_7b. Further details are available in our paper The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs.
π Data Usage
from datasets import load_dataset
dataset = load_dataset("JierunChen/MMMU_with_difficulty_level") print(dataset)
πβ¦ See the full description on the dataset page: https://huggingface.co/datasets/JierunChen/MMMU_with_difficulty_level.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
MMMU (A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI)
π Homepage | π Leaderboard | π€ Dataset | π€ Paper | π arXiv | GitHub
πNews
π οΈ[2024-05-30]: Fixed duplicate option issues in Materials dataset items (validation_Materials_25; test_Materials_17, 242) and content error in validation_Materials_25. π οΈ[2024-04-30]: Fixed missing "-" or "^" signs in Math dataset items (dev_Math_2, validation_Math_11, 12, 16; test_Math_8β¦ See the full description on the dataset page: https://huggingface.co/datasets/MMMU/MMMU.