2 datasets found
  1. h

    mmlu-redux-2.0

    • huggingface.co
    Updated Jun 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Edinburgh Dataset Analytics Working Group (2025). mmlu-redux-2.0 [Dataset]. http://doi.org/10.57967/hf/3469
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 5, 2025
    Dataset authored and provided by
    Edinburgh Dataset Analytics Working Group
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset Card for MMLU-Redux-2.0

    MMLU-Redux is a subset of 5,700 manually re-annotated questions across 57 MMLU subjects.

      News
    

    [2025.02.25] We corrected one annotation in Abstract Algebra subset, as noted in the Issue #2. [2025.02.08] We corrected one annotation in High School Mathematics subset, as noted in the PlatinumBench paper. [2025.01.23] MMLU-Redux is accepted to NAACL 2025!

      Dataset Details
    
    
    
    
    
    
    
      Dataset Description
    

    Each data point in… See the full description on the dataset page: https://huggingface.co/datasets/edinburgh-dawg/mmlu-redux-2.0.

  2. h

    mmlu-redux

    • huggingface.co
    Updated Feb 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Edinburgh Dataset Analytics Working Group (2025). mmlu-redux [Dataset]. http://doi.org/10.57967/hf/2507
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 8, 2025
    Dataset authored and provided by
    Edinburgh Dataset Analytics Working Group
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset Card for MMLU-Redux

    [!TIP] Please consider using MMLU-Redux-2.0 which contains all 57 MMLU subjects.

    MMLU-Redux is a subset of 3,000 manually re-annotated questions across 30 MMLU subjects.

      News
    

    [2025.02.08] We corrected one annotation in High School Mathematics subset, as noted in the PlatinumBench paper. [2025.01.23] MMLU-Redux is accepted to NAACL 2025!

      Dataset Details
    
    
    
    
    
      Dataset Description
    

    Each data point in MMLU-Redux contains… See the full description on the dataset page: https://huggingface.co/datasets/edinburgh-dawg/mmlu-redux.

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Edinburgh Dataset Analytics Working Group (2025). mmlu-redux-2.0 [Dataset]. http://doi.org/10.57967/hf/3469

mmlu-redux-2.0

MMLU-Redux-2.0

edinburgh-dawg/mmlu-redux-2.0

Explore at:
3 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 5, 2025
Dataset authored and provided by
Edinburgh Dataset Analytics Working Group
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Dataset Card for MMLU-Redux-2.0

MMLU-Redux is a subset of 5,700 manually re-annotated questions across 57 MMLU subjects.

  News

[2025.02.25] We corrected one annotation in Abstract Algebra subset, as noted in the Issue #2. [2025.02.08] We corrected one annotation in High School Mathematics subset, as noted in the PlatinumBench paper. [2025.01.23] MMLU-Redux is accepted to NAACL 2025!

  Dataset Details







  Dataset Description

Each data point in… See the full description on the dataset page: https://huggingface.co/datasets/edinburgh-dawg/mmlu-redux-2.0.

Search
Clear search
Close search
Google apps
Main menu