13 datasets found
  1. h

    dart-math-hard

    • huggingface.co
    Updated Jun 14, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HKUST NLP Group (2024). dart-math-hard [Dataset]. https://huggingface.co/datasets/hkust-nlp/dart-math-hard
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 14, 2024
    Dataset authored and provided by
    HKUST NLP Group
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    ๐ŸŽฏ DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving

    ๐Ÿ“ Paper@arXiv | ๐Ÿค— Datasets&Models@HF | ๐Ÿฑ Code@GitHub ๐Ÿฆ Thread@X(Twitter) | ๐Ÿถ ไธญๆ–‡ๅšๅฎข@็ŸฅไนŽ | ๐Ÿ“Š Leaderboard@PapersWithCode | ๐Ÿ“‘ BibTeX

    [!IMPORTANT] ๐Ÿ”ฅ Excited to find our DART-Math-DSMath-7B (Prop2Diff) trained on DART-Math-Hard comparable to the AIMO winner NuminaMath-7B on CoT, but based solely on MATH & GSM8K prompt set, leaving much room to improve! Besides, our DART method is also fully compatibleโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/hkust-nlp/dart-math-hard.

  2. h

    dart-math-uniform

    • huggingface.co
    Updated Jul 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HKUST NLP Group (2024). dart-math-uniform [Dataset]. https://huggingface.co/datasets/hkust-nlp/dart-math-uniform
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 15, 2024
    Dataset authored and provided by
    HKUST NLP Group
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    ๐ŸŽฏ DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving

    ๐Ÿ“ Paper@arXiv | ๐Ÿค— Datasets&Models@HF | ๐Ÿฑ Code@GitHub ๐Ÿฆ Thread@X(Twitter) | ๐Ÿถ ไธญๆ–‡ๅšๅฎข@็ŸฅไนŽ | ๐Ÿ“Š Leaderboard@PapersWithCode | ๐Ÿ“‘ BibTeX

      Datasets: DART-Math
    

    DART-Math datasets are the state-of-the-art and data-efficientopen-source instruction tuning datasets for mathematical reasoning.

    Figure 1: Left: Average accuracy on 6 mathematical benchmarks. We compare with modelsโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/hkust-nlp/dart-math-uniform.

  3. h

    dart-math-pool-math

    • huggingface.co
    Updated Feb 19, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HKUST NLP Group (2025). dart-math-pool-math [Dataset]. https://huggingface.co/datasets/hkust-nlp/dart-math-pool-math
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 19, 2025
    Dataset authored and provided by
    HKUST NLP Group
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    [!NOTE] This dataset is the data pool synthesized from the query set of the MATH training set, containing all answer-correct samples and other metadata produced during the work. DART-Math-* datasets are extracted from dart-math-pool-* data pools.

      ๐ŸŽฏ DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving
    

    ๐Ÿ“ Paper@arXiv | ๐Ÿค— Datasets&Models@HF | ๐Ÿฑ Code@GitHub ๐Ÿฆ Thread@X(Twitter) | ๐Ÿถ ไธญๆ–‡ๅšๅฎข@็ŸฅไนŽ | ๐Ÿ“Š Leaderboard@PapersWithCode | ๐Ÿ“‘ BibTeX

      Datasets:โ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/hkust-nlp/dart-math-pool-math.
    
  4. h

    dart-math-diff

    • huggingface.co
    Updated Apr 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ziyue Li (2025). dart-math-diff [Dataset]. https://huggingface.co/datasets/Litzy619/dart-math-diff
    Explore at:
    Dataset updated
    Apr 26, 2025
    Authors
    Ziyue Li
    Description

    Litzy619/dart-math-diff dataset hosted on Hugging Face and contributed by the HF Datasets community

  5. h

    dart-math-pool-gsm8k-query-info

    • huggingface.co
    Updated Feb 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HKUST NLP Group (2025). dart-math-pool-gsm8k-query-info [Dataset]. https://huggingface.co/datasets/hkust-nlp/dart-math-pool-gsm8k-query-info
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 19, 2025
    Dataset authored and provided by
    HKUST NLP Group
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    [!NOTE] This dataset is the synthesis information of queries from the GSM8K training set, such as the numbers of raw/correct samples of each synthesis job. Usually used with dart-math-pool-gsm8k.

      ๐ŸŽฏ DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving
    

    ๐Ÿ“ Paper@arXiv | ๐Ÿค— Datasets&Models@HF | ๐Ÿฑ Code@GitHub ๐Ÿฆ Thread@X(Twitter) | ๐Ÿถ ไธญๆ–‡ๅšๅฎข@็ŸฅไนŽ | ๐Ÿ“Š Leaderboard@PapersWithCode | ๐Ÿ“‘ BibTeX

      Datasets: DART-Math
    

    DART-Math datasets are theโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/hkust-nlp/dart-math-pool-gsm8k-query-info.

  6. h

    dart-math-uniform

    • huggingface.co
    Updated Oct 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    pxy (2024). dart-math-uniform [Dataset]. https://huggingface.co/datasets/pxyyy/dart-math-uniform
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 17, 2024
    Authors
    pxy
    Description

    Dataset Card for "dart-math-uniform"

    More Information needed

  7. h

    dart-math-pool-math-query-info

    • huggingface.co
    Updated Feb 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HKUST NLP Group (2025). dart-math-pool-math-query-info [Dataset]. https://huggingface.co/datasets/hkust-nlp/dart-math-pool-math-query-info
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 19, 2025
    Dataset authored and provided by
    HKUST NLP Group
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    [!NOTE] This dataset is the synthesis information of queries from the MATH training set, such as the numbers of raw/correct samples of each synthesis job. Usually used with dart-math-pool-math.

      ๐ŸŽฏ DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving
    

    ๐Ÿ“ Paper@arXiv | ๐Ÿค— Datasets&Models@HF | ๐Ÿฑ Code@GitHub ๐Ÿฆ Thread@X(Twitter) | ๐Ÿถ ไธญๆ–‡ๅšๅฎข@็ŸฅไนŽ | ๐Ÿ“Š Leaderboard@PapersWithCode | ๐Ÿ“‘ BibTeX

      Datasets: DART-Math
    

    DART-Math datasets are the state-of-the-artโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/hkust-nlp/dart-math-pool-math-query-info.

  8. h

    hkust-nlp_dart-math-hard_scored

    • huggingface.co
    Updated Jun 5, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OpenDataArena (2024). hkust-nlp_dart-math-hard_scored [Dataset]. https://huggingface.co/datasets/OpenDataArena/hkust-nlp_dart-math-hard_scored
    Explore at:
    Dataset updated
    Jun 5, 2024
    Authors
    OpenDataArena
    Description

    Dart-math-hard_scored - with OpenDataArena Scores

    This dataset is a scored version of the original hkust-nlp/dart-math-hard dataset. The scoring was performed using the OpenDataArena-Tool, a comprehensive suite of automated evaluation methods for assessing instruction-following datasets. This version of the dataset includes rich, multi-dimensional scores for both the instructions (questions) and the instruction-response pairs, allowing for highly granular data analysis andโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/OpenDataArena/hkust-nlp_dart-math-hard_scored.

  9. h

    rlhflow_mixture_mod_scalebiosampled-20k

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    pxy, rlhflow_mixture_mod_scalebiosampled-20k [Dataset]. https://huggingface.co/datasets/pxyyy/rlhflow_mixture_mod_scalebiosampled-20k
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Authors
    pxy
    Description

    Dataset Card for "rlhflow_mixture_mod_scalebiosampled-20k"

    weight = { 'MathInstruct': 0.17918314039707184, 'SlimOrca': 0.1572466790676117, 'Magicoder-Evol-Instruct-110K': 0.1262860894203186, 'dart-math-uniform': 0.10912656784057617, 'GPTeacher-General-Instruct': 0.10593341290950775, 'GPT4-LLM-Cleaned': 0.09206369519233704, 'WizardLM_evol_instruct_V2_196k': 0.07409033179283142, 'UltraInteract_sft': 0.056401610374450684, 'orca-math-word-problems-200k': 0.054032713174819946โ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/pxyyy/rlhflow_mixture_mod_scalebiosampled-20k.

  10. h

    rlhflow_mixture_intuitive_sampled-20k

    • huggingface.co
    Updated Nov 20, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    pxy (2024). rlhflow_mixture_intuitive_sampled-20k [Dataset]. https://huggingface.co/datasets/pxyyy/rlhflow_mixture_intuitive_sampled-20k
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 20, 2024
    Authors
    pxy
    Description

    Dataset Card for "rlhflow_mixture_intuitive_sampled-20k"

    weight = { 'MathInstruct': 0.1, 'SlimOrca': 0.2, 'Magicoder-Evol-Instruct-110K': 0.1, 'dart-math-uniform': 0.3, 'GPTeacher-General-Instruct': 0.05, 'GPT4-LLM-Cleaned': 0.03, 'WizardLM_evol_instruct_V2_196k': 0.05, 'UltraInteract_sft': 0.1, 'orca-math-word-problems-200k': 0.05, 'ShareGPT_V3_unfiltered_cleaned_split_no_imsorry': 0.02, }

    More Information needed

  11. h

    gsm8k-fix

    • huggingface.co
    Updated Feb 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HKUST NLP Group (2025). gsm8k-fix [Dataset]. https://huggingface.co/datasets/hkust-nlp/gsm8k-fix
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 19, 2025
    Dataset authored and provided by
    HKUST NLP Group
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    GSM8K (Fixed)

    Some erroneous labels exist in the GSM8K dataset. This dataset is fixed from https://github.com/openai/grade-school-math/blob/master/grade_school_math/data/train.jsonl with the code appended at the end. The errors are located by delving into unreasonably low pass rates by the strong DeepSeekMath-7B-RL and hopefully should be exhaustive. This dataset is used by the ๐ŸŽฏDART-Math project to synthesize data.

    [!WARNING] โš ๏ธ Only the training set has been fixed so far.

    forโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/hkust-nlp/gsm8k-fix.

  12. h

    rlhflow_mixture_scalebio_sampled-nolisa-250k

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    pxy, rlhflow_mixture_scalebio_sampled-nolisa-250k [Dataset]. https://huggingface.co/datasets/pxyyy/rlhflow_mixture_scalebio_sampled-nolisa-250k
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Authors
    pxy
    Description

    Dataset Card for "rlhflow_mixture_scalebio_sampled-nolisa-600k"

    weight = { 'SlimOrca': 0.34525978565216064, 'dart-math-uniform': 0.23386941850185394, 'GPT4-LLM-Cleaned': 0.19111572206020355, 'MathInstruct': 0.16642746329307556, 'GPTeacher-General-Instruct': 0.042891550809144974, 'ShareGPT_V3_unfiltered_cleaned_split_no_imsorry': 0.006720397621393204, 'UltraInteract_sft': 0.0042861211113631725, 'WizardLM_evol_instruct_V2_196k': 0.004021201748400927, 'Magicoder-Evol-Instruct-110K':โ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/pxyyy/rlhflow_mixture_scalebio_sampled-nolisa-250k.

  13. h

    vrt-baseline

    • huggingface.co
    Updated Feb 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HKUST NLP Group (2025). vrt-baseline [Dataset]. https://huggingface.co/datasets/hkust-nlp/vrt-baseline
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 19, 2025
    Dataset authored and provided by
    HKUST NLP Group
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    [!NOTE] This dataset is the VRT baseline dataset used to train baseline models *-VRT in Table 2 of the paper.

    Another ablation baseline to DART is vanilla rejection tuning (VRT), where we synthesize a dataset of the same size of 0.59M examples with DeepSeekMath-7B-RL, using vanilla rejection sampling as described in ยง2.1.

      ๐ŸŽฏ DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving
    

    ๐Ÿ“ Paper@arXiv | ๐Ÿค— Datasets&Models@HF | ๐Ÿฑ Code@GitHub ๐Ÿฆโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/hkust-nlp/vrt-baseline.

  14. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
HKUST NLP Group (2024). dart-math-hard [Dataset]. https://huggingface.co/datasets/hkust-nlp/dart-math-hard

dart-math-hard

DART-Math-Hard

hkust-nlp/dart-math-hard

Explore at:
9 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 14, 2024
Dataset authored and provided by
HKUST NLP Group
License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

๐ŸŽฏ DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving

๐Ÿ“ Paper@arXiv | ๐Ÿค— Datasets&Models@HF | ๐Ÿฑ Code@GitHub ๐Ÿฆ Thread@X(Twitter) | ๐Ÿถ ไธญๆ–‡ๅšๅฎข@็ŸฅไนŽ | ๐Ÿ“Š Leaderboard@PapersWithCode | ๐Ÿ“‘ BibTeX

[!IMPORTANT] ๐Ÿ”ฅ Excited to find our DART-Math-DSMath-7B (Prop2Diff) trained on DART-Math-Hard comparable to the AIMO winner NuminaMath-7B on CoT, but based solely on MATH & GSM8K prompt set, leaving much room to improve! Besides, our DART method is also fully compatibleโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/hkust-nlp/dart-math-hard.

Search
Clear search
Close search
Google apps
Main menu