100+ datasets found
  1. h

    MATH-500

    • huggingface.co
    Updated Nov 18, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ricardo (2022). MATH-500 [Dataset]. https://huggingface.co/datasets/ricdomolm/MATH-500
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 18, 2022
    Authors
    Ricardo
    Description

    MATH-500 test set with the remaining 12000 examples in train. import datasets

    https://github.com/volcengine/verl/blob/30911f133aa300ae9d8e341dba8e63192335705e/verl/utils/reward_score/math.py

    from math_utils import last_boxed_only_string, remove_boxed

    math = datasets.load_dataset('DigitalLearningGmbH/MATH-lighteval', 'default') math500 = datasets.load_dataset('HuggingFaceH4/MATH-500')

    convert math to math500 format

    def map_to_500(example): return { 'problem':โ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/ricdomolm/MATH-500.

  2. h

    math500

    • huggingface.co
    Updated Apr 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zikang Shan (2025). math500 [Dataset]. https://huggingface.co/datasets/zkshan2002/math500
    Explore at:
    Dataset updated
    Apr 10, 2025
    Authors
    Zikang Shan
    Description

    Source: HuggingFaceH4/MATH-500 Modification:

    Remove redundent columns

    Keys:

    problem answer

    Size

    test: 500

  3. a

    Math 500 by Model

    • artificialanalysis.ai
    Updated Jun 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Artificial Analysis (2025). Math 500 by Model [Dataset]. https://artificialanalysis.ai/evaluations/math-500
    Explore at:
    Dataset updated
    Jun 28, 2025
    Dataset authored and provided by
    Artificial Analysis
    Description

    Comparison of Independently conducted by Artificial Analysis by Model

  4. h

    math500

    • huggingface.co
    Updated May 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    VLM-Reasoner (2025). math500 [Dataset]. https://huggingface.co/datasets/VLM-Reasoner/math500
    Explore at:
    Dataset updated
    May 11, 2025
    Dataset authored and provided by
    VLM-Reasoner
    Description

    VLM-Reasoner/math500 dataset hosted on Hugging Face and contributed by the HF Datasets community

  5. h

    MATH500

    • huggingface.co
    Updated Jun 26, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yinjie Wang (2025). MATH500 [Dataset]. https://huggingface.co/datasets/yinjiewang/MATH500
    Explore at:
    Dataset updated
    Jun 26, 2025
    Authors
    Yinjie Wang
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    yinjiewang/MATH500 dataset hosted on Hugging Face and contributed by the HF Datasets community

  6. h

    math500

    • huggingface.co
    Updated May 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jiarui Yao (2025). math500 [Dataset]. https://huggingface.co/datasets/FlippyDora/math500
    Explore at:
    Dataset updated
    May 1, 2025
    Authors
    Jiarui Yao
    Description

    FlippyDora/math500 dataset hosted on Hugging Face and contributed by the HF Datasets community

  7. h

    math500-cot-experiment

    • huggingface.co
    Updated Apr 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Asankhaya Sharma (2025). math500-cot-experiment [Dataset]. https://huggingface.co/datasets/codelion/math500-cot-experiment
    Explore at:
    Dataset updated
    Apr 26, 2025
    Authors
    Asankhaya Sharma
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    MATH-500 Chain of Thought Experiment Dataset

    This dataset contains the results of an experiment testing different prompting strategies (standard, chain of thought, and gibberish chain of thought) on the MATH-500 benchmark using the Llama-3.2-1B-Instruct model.

      Dataset Structure
    

    The dataset is split into three parts:

    standard: Direct prompting with no reasoning steps (500 examples) cot: Chain of thought prompting with structured reasoning (500 examples) gibberish:โ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/codelion/math500-cot-experiment.

  8. h

    math500_best_of_n

    • huggingface.co
    Updated Jan 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jakhongir Saydaliev (2025). math500_best_of_n [Dataset]. https://huggingface.co/datasets/Jakh0103/math500_best_of_n
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 12, 2025
    Authors
    Jakhongir Saydaliev
    Description

    Dataset Summary

    This dataset contains mathematical problem-solving responses generated using two decoding methods: Greedy and Best-of-N for 20 problems spanning three difficulty levels (1, 2, 3) from the [MATH500 dataset](https://huggingface.co/datasets/HuggingFaceH4/MATH-500.

      Languages
    

    The dataset content is entirely in English.

      Dataset Structure
    

    Each instance contains the following fields:

    problem: math problem given as an input answer: a ground truth answerโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/Jakh0103/math500_best_of_n.

  9. h

    MATH500

    • huggingface.co
    Updated Mar 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yu Wang (2025). MATH500 [Dataset]. https://huggingface.co/datasets/Wloner0809/MATH500
    Explore at:
    Dataset updated
    Mar 21, 2025
    Authors
    Yu Wang
    Description

    Wloner0809/MATH500 dataset hosted on Hugging Face and contributed by the HF Datasets community

  10. h

    MATH500

    • huggingface.co
    Updated Aug 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Princeton-AI (2025). MATH500 [Dataset]. https://huggingface.co/datasets/Gen-Verse/MATH500
    Explore at:
    Dataset updated
    Aug 27, 2025
    Dataset authored and provided by
    Princeton-AI
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Gen-Verse/MATH500 dataset hosted on Hugging Face and contributed by the HF Datasets community

  11. h

    math500-enhanced

    • huggingface.co
    Updated Jun 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rachit Bansal (2025). math500-enhanced [Dataset]. https://huggingface.co/datasets/rachitbansal-harvard/math500-enhanced
    Explore at:
    Dataset updated
    Jun 17, 2025
    Authors
    Rachit Bansal
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Math500 Enhanced Dataset

    This dataset contains LLM-enhanced versions of mathematical problems with step-by-step reasoning solutions.

      Dataset Statistics
    

    Examples: 500 (500 enhanced with LLM) Enhancement Rate: 100.0%

      Data Fields
    

    question: The mathematical problem statement solution: LLM-enhanced step-by-step solution original_solution: Original solution text (for reference) answer: Final numerical answer level: Problem difficulty level type: Problemโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/rachitbansal-harvard/math500-enhanced.

  12. h

    MATH-500

    • huggingface.co
    Updated Nov 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dongfu Jiang (2024). MATH-500 [Dataset]. https://huggingface.co/datasets/DongfuJiang/MATH-500
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 7, 2024
    Authors
    Dongfu Jiang
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    DongfuJiang/MATH-500 dataset hosted on Hugging Face and contributed by the HF Datasets community

  13. h

    math500-deepseek-r1-distill-qwen-1.5b

    • huggingface.co
    Updated May 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sungmin Jo (2025). math500-deepseek-r1-distill-qwen-1.5b [Dataset]. https://huggingface.co/datasets/jsm0424/math500-deepseek-r1-distill-qwen-1.5b
    Explore at:
    Dataset updated
    May 28, 2025
    Authors
    Sungmin Jo
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset Card for "math500-deepseek-r1-distill-qwen-1.5b"

      Dataset Summary
    

    This dataset is a distilled version of the MATH500 dataset, augmented with reasoning-based responses generated by the deepseek-r1-distill-qwen-1.5b language model. The dataset is designed to evaluate and improve the mathematical reasoning capabilities of LLMs through step-by-step solutions and final answers. Each example consists of:

    The original problem statement from MATH500 The reference solutionโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/jsm0424/math500-deepseek-r1-distill-qwen-1.5b.

  14. h

    math500

    • huggingface.co
    Updated Jun 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Samuel Moor-Smith (2025). math500 [Dataset]. https://huggingface.co/datasets/smoorsmith/math500
    Explore at:
    Dataset updated
    Jun 21, 2025
    Authors
    Samuel Moor-Smith
    Description

    Dataset Card for Dataset Name

    Forked from HuggingFaceH4/MATH-500

      Dataset Details
    
    
    
    
    
      Dataset Description
    

    Curated by: [More Information Needed] Funded by [optional]: [More Information Needed] Shared by [optional]: [More Information Needed] Language(s) (NLP): [More Information Needed] License: [More Information Needed]

      Dataset Sources [optional]
    

    Repository: [More Information Needed] Paper [optional]: [More Information Needed] Demo [optional]:โ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/smoorsmith/math500.

  15. h

    MATH-train-MATH500-test

    • huggingface.co
    Updated Nov 18, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shuozhe Li (2022). MATH-train-MATH500-test [Dataset]. https://huggingface.co/datasets/ShuoZheLi/MATH-train-MATH500-test
    Explore at:
    Dataset updated
    Nov 18, 2022
    Authors
    Shuozhe Li
    Description

    ShuoZheLi/MATH-train-MATH500-test dataset hosted on Hugging Face and contributed by the HF Datasets community

  16. h

    math500

    • huggingface.co
    Updated Jul 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    sparklereasoning (2025). math500 [Dataset]. https://huggingface.co/datasets/sparkle-reasoning/math500
    Explore at:
    Dataset updated
    Jul 5, 2025
    Dataset authored and provided by
    sparklereasoning
    Description

    sparkle-reasoning/math500 dataset hosted on Hugging Face and contributed by the HF Datasets community

  17. h

    math500-math-rubric

    • huggingface.co
    Updated Aug 16, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mika Senghaas (2025). math500-math-rubric [Dataset]. https://huggingface.co/datasets/mikasenghaas/math500-math-rubric
    Explore at:
    Dataset updated
    Aug 16, 2025
    Authors
    Mika Senghaas
    Description

    mikasenghaas/math500-math-rubric dataset hosted on Hugging Face and contributed by the HF Datasets community

  18. h

    math500

    • huggingface.co
    Updated Mar 19, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wei Xiong (2025). math500 [Dataset]. https://huggingface.co/datasets/weqweasdas/math500
    Explore at:
    Dataset updated
    Mar 19, 2025
    Authors
    Wei Xiong
    Description

    weqweasdas/math500 dataset hosted on Hugging Face and contributed by the HF Datasets community

  19. h

    math500

    • huggingface.co
    Updated Feb 5, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    dsrtrain (2025). math500 [Dataset]. https://huggingface.co/datasets/dsrtrain/math500
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 5, 2025
    Dataset authored and provided by
    dsrtrain
    Description

    dsrtrain/math500 dataset hosted on Hugging Face and contributed by the HF Datasets community

  20. h

    math500

    • huggingface.co
    Updated Jan 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Scale Frontier Data (2025). math500 [Dataset]. https://huggingface.co/datasets/ScaleFrontierData/math500
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 29, 2025
    Dataset authored and provided by
    Scale Frontier Data
    Description

    ScaleFrontierData/math500 dataset hosted on Hugging Face and contributed by the HF Datasets community

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Ricardo (2022). MATH-500 [Dataset]. https://huggingface.co/datasets/ricdomolm/MATH-500

MATH-500

ricdomolm/MATH-500

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 18, 2022
Authors
Ricardo
Description

MATH-500 test set with the remaining 12000 examples in train. import datasets

https://github.com/volcengine/verl/blob/30911f133aa300ae9d8e341dba8e63192335705e/verl/utils/reward_score/math.py

from math_utils import last_boxed_only_string, remove_boxed

math = datasets.load_dataset('DigitalLearningGmbH/MATH-lighteval', 'default') math500 = datasets.load_dataset('HuggingFaceH4/MATH-500')

convert math to math500 format

def map_to_500(example): return { 'problem':โ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/ricdomolm/MATH-500.

Search
Clear search
Close search
Google apps
Main menu