24 datasets found
  1. HelpSteer2

    • huggingface.co
    Updated Oct 1, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NVIDIA (2024). HelpSteer2 [Dataset]. https://huggingface.co/datasets/nvidia/HelpSteer2
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 1, 2024
    Dataset provided by
    Nvidiahttp://nvidia.com/
    Authors
    NVIDIA
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    HelpSteer2: Open-source dataset for training top-performing reward models

    HelpSteer2 is an open-source Helpfulness Dataset (CC-BY-4.0) that supports aligning models to become more helpful, factually correct and coherent, while being adjustable in terms of the complexity and verbosity of its responses. This dataset has been created in partnership with Scale AI. When used to tune a Llama 3.1 70B Instruct Model, we achieve 94.1% on RewardBench, which makes it the best Reward Model asโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/nvidia/HelpSteer2.

  2. h

    HelpSteer2-DPO

    • huggingface.co
    Updated Jul 11, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Atsunori Fujita (2024). HelpSteer2-DPO [Dataset]. https://huggingface.co/datasets/Atsunori/HelpSteer2-DPO
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 11, 2024
    Authors
    Atsunori Fujita
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is a conversion of nvidia/HelpSteer2 into preference pairs based on the helpfulness score for training DPO. HelpSteer2-DPO is also licensed under CC-BY-4.0.

      Dataset Description
    

    In accordance with the following paper, HelpSteer2: Open-source dataset for training top-performing reward models we converted nvidia/HelpSteer2 dataset into a preference dataset by taking the response with the higher helpfulness score as the chosen response, with the remaining response being theโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/Atsunori/HelpSteer2-DPO.

  3. h

    HelpSteer2

    • huggingface.co
    Updated Jul 29, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Juyoung Suk (2024). HelpSteer2 [Dataset]. https://huggingface.co/datasets/juyoungml/HelpSteer2
    Explore at:
    Dataset updated
    Jul 29, 2024
    Authors
    Juyoung Suk
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    juyoungml/HelpSteer2 dataset hosted on Hugging Face and contributed by the HF Datasets community

  4. h

    HelpSteer2

    • huggingface.co
    Updated Jul 4, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Matt (2024). HelpSteer2 [Dataset]. https://huggingface.co/datasets/stallone/HelpSteer2
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 4, 2024
    Authors
    Matt
    Description

    A reformatted version of nvidia/HelpSteer2 into both a multiturn config conversation and completion config config. A v4 UUID doc_id is shared across the same document in each config, source, conversation, and completion.

  5. h

    nvidia-HelpSteer2-ShareGPT

    • huggingface.co
    Updated Feb 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nitral (2025). nvidia-HelpSteer2-ShareGPT [Dataset]. https://huggingface.co/datasets/Nitral-AI/nvidia-HelpSteer2-ShareGPT
    Explore at:
    Dataset updated
    Feb 12, 2025
    Authors
    Nitral
    Description

    Nitral-AI/nvidia-HelpSteer2-ShareGPT dataset hosted on Hugging Face and contributed by the HF Datasets community

  6. h

    helpsteer2-preference-v2

    • huggingface.co
    Updated Oct 21, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gagan Bhatia (2024). helpsteer2-preference-v2 [Dataset]. https://huggingface.co/datasets/gagan3012/helpsteer2-preference-v2
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 21, 2024
    Authors
    Gagan Bhatia
    Description

    gagan3012/helpsteer2-preference-v2 dataset hosted on Hugging Face and contributed by the HF Datasets community

  7. h

    helpsteer2-binarized-granular-tiny

    • huggingface.co
    Updated Mar 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Root Signals AI (2025). helpsteer2-binarized-granular-tiny [Dataset]. https://huggingface.co/datasets/root-signals/helpsteer2-binarized-granular-tiny
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 11, 2025
    Dataset authored and provided by
    Root Signals AI
    Description

    This is the nvidia/Helpsteer2 training split binarized and sorted by length using the Llama3 tokenizer and categorized into multi- vs. single-turn subparts. The 500 splits contain chosen responses between 500-1000 tokens, the 1000 split 1000+ tokens. A multi-turn example requires at least one pair of User and Assistant besides the main resposne to be categorized as such. If you don't care, there is a combined split, which includes everything just binarized, but note that ids are not the sameโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/root-signals/helpsteer2-binarized-granular-tiny.

  8. h

    helpsteer2-helpfulness-preference

    • huggingface.co
    Updated Jun 1, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jenny Shen (2025). helpsteer2-helpfulness-preference [Dataset]. https://huggingface.co/datasets/Jennny/helpsteer2-helpfulness-preference
    Explore at:
    Dataset updated
    Jun 1, 2025
    Authors
    Jenny Shen
    Description

    Citation

    @misc{wang2024helpsteer2preferencecomplementingratingspreferences, title={HelpSteer2-Preference: Complementing Ratings with Preferences}, author={Zhilin Wang and Alexander Bukharin and Olivier Delalleau and Daniel Egert and Gerald Shen and Jiaqi Zeng and Oleksii Kuchaiev and Yi Dong}, year={2024}, eprint={2410.01257}, archivePrefix={arXiv}, primaryClass={cs.LG}, url={https://arxiv.org/abs/2410.01257}, }

    @misc{wang2024helpsteer2โ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/Jennny/helpsteer2-helpfulness-preference.

  9. nvidia-HelpSteer2-group-label_normalized

    • huggingface.co
    Updated Feb 6, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pi Labs Inc. (2025). nvidia-HelpSteer2-group-label_normalized [Dataset]. https://huggingface.co/datasets/withpi/nvidia-HelpSteer2-group-label_normalized
    Explore at:
    Dataset updated
    Feb 6, 2025
    Dataset provided by
    Pi Labs, Inc.
    Authors
    Pi Labs Inc.
    Description

    withpi/nvidia-HelpSteer2-group-label_normalized dataset hosted on Hugging Face and contributed by the HF Datasets community

  10. nvidia-HelpSteer2-group-label-v2_tokenized_16k_euro

    • huggingface.co
    Updated Mar 30, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pi Labs Inc. (2025). nvidia-HelpSteer2-group-label-v2_tokenized_16k_euro [Dataset]. https://huggingface.co/datasets/withpi/nvidia-HelpSteer2-group-label-v2_tokenized_16k_euro
    Explore at:
    Dataset updated
    Mar 30, 2025
    Dataset provided by
    Pi Labs, Inc.
    Authors
    Pi Labs Inc.
    Description

    withpi/nvidia-HelpSteer2-group-label-v2_tokenized_16k_euro dataset hosted on Hugging Face and contributed by the HF Datasets community

  11. nvidia-HelpSteer2-group-label-v2_euro_st_tokenized_32k_1

    • huggingface.co
    Updated Apr 28, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pi Labs Inc. (2025). nvidia-HelpSteer2-group-label-v2_euro_st_tokenized_32k_1 [Dataset]. https://huggingface.co/datasets/withpi/nvidia-HelpSteer2-group-label-v2_euro_st_tokenized_32k_1
    Explore at:
    Dataset updated
    Apr 28, 2025
    Dataset provided by
    Pi Labs, Inc.
    Authors
    Pi Labs Inc.
    Description

    withpi/nvidia-HelpSteer2-group-label-v2_euro_st_tokenized_32k_1 dataset hosted on Hugging Face and contributed by the HF Datasets community

  12. h

    Hydrus-HelpSteer2

    • huggingface.co
    Updated Feb 27, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    DV (2025). Hydrus-HelpSteer2 [Dataset]. https://huggingface.co/datasets/Delta-Vector/Hydrus-HelpSteer2
    Explore at:
    Dataset updated
    Feb 27, 2025
    Authors
    DV
    Description

    Delta-Vector/Hydrus-HelpSteer2 dataset hosted on Hugging Face and contributed by the HF Datasets community

  13. h

    llama3.2-3b-instruct-helpsteer2

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    li, llama3.2-3b-instruct-helpsteer2 [Dataset]. https://huggingface.co/datasets/mimasss/llama3.2-3b-instruct-helpsteer2
    Explore at:
    Authors
    li
    Description

    mimasss/llama3.2-3b-instruct-helpsteer2 dataset hosted on Hugging Face and contributed by the HF Datasets community

  14. h

    helpsteer2-standard

    • huggingface.co
    Updated Sep 12, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chris (Yuhao) Liu (2024). helpsteer2-standard [Dataset]. https://huggingface.co/datasets/chrisliu298/helpsteer2-standard
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 12, 2024
    Authors
    Chris (Yuhao) Liu
    Description

    chrisliu298/helpsteer2-standard dataset hosted on Hugging Face and contributed by the HF Datasets community

  15. h

    HelpSteer2-Preference-WarmStart

    • huggingface.co
    Updated Jun 8, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hanchi Sun (2025). HelpSteer2-Preference-WarmStart [Dataset]. https://huggingface.co/datasets/MasterGodzilla/HelpSteer2-Preference-WarmStart
    Explore at:
    Dataset updated
    Jun 8, 2025
    Authors
    Hanchi Sun
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    MasterGodzilla/HelpSteer2-Preference-WarmStart dataset hosted on Hugging Face and contributed by the HF Datasets community

  16. h

    HelpSteer2-incoherent

    • huggingface.co
    Updated Oct 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    wave (2024). HelpSteer2-incoherent [Dataset]. https://huggingface.co/datasets/wave-on-discord/HelpSteer2-incoherent
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 22, 2024
    Authors
    wave
    Description

    Dataset Card for "HelpSteer2-incoherent"

    More Information needed

  17. h

    preprocessed-helpsteer2-train-10k

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sangeon Park, preprocessed-helpsteer2-train-10k [Dataset]. https://huggingface.co/datasets/saepark/preprocessed-helpsteer2-train-10k
    Explore at:
    Authors
    Sangeon Park
    Description

    saepark/preprocessed-helpsteer2-train-10k dataset hosted on Hugging Face and contributed by the HF Datasets community

  18. h

    helpsteer2-rewardbench-contamination

    • huggingface.co
    Updated Sep 30, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Saumya Malik (2021). helpsteer2-rewardbench-contamination [Dataset]. https://huggingface.co/datasets/saumyamalik/helpsteer2-rewardbench-contamination
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 30, 2021
    Authors
    Saumya Malik
    Description

    saumyamalik/helpsteer2-rewardbench-contamination dataset hosted on Hugging Face and contributed by the HF Datasets community

  19. h

    preprocessed-helpsteer2-test-500

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sangeon Park, preprocessed-helpsteer2-test-500 [Dataset]. https://huggingface.co/datasets/saepark/preprocessed-helpsteer2-test-500
    Explore at:
    Authors
    Sangeon Park
    Description

    saepark/preprocessed-helpsteer2-test-500 dataset hosted on Hugging Face and contributed by the HF Datasets community

  20. h

    helpsteer2_preference

    • huggingface.co
    Updated Jun 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hsu Shihyueh (2025). helpsteer2_preference [Dataset]. https://huggingface.co/datasets/AIR-hl/helpsteer2_preference
    Explore at:
    Dataset updated
    Jun 8, 2025
    Authors
    Hsu Shihyueh
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Introduction

    This is a binarized preference datasets from nvidia/HelpSteer2. HelpSteer2 is an open-source Helpfulness Dataset (CC-BY-4.0) that supports aligning models to become more helpful, factually correct and coherent, while being adjustable in terms of the complexity and verbosity of its responses. This dataset has been created in partnership with Scale AI. I processed the raw data by prioritizing helpfulness, correctness, and coherence to determine which responses were chosenโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/AIR-hl/helpsteer2_preference.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
NVIDIA (2024). HelpSteer2 [Dataset]. https://huggingface.co/datasets/nvidia/HelpSteer2
Organization logo

HelpSteer2

HelpSteer2

nvidia/HelpSteer2

Explore at:
242 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 1, 2024
Dataset provided by
Nvidiahttp://nvidia.com/
Authors
NVIDIA
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

HelpSteer2: Open-source dataset for training top-performing reward models

HelpSteer2 is an open-source Helpfulness Dataset (CC-BY-4.0) that supports aligning models to become more helpful, factually correct and coherent, while being adjustable in terms of the complexity and verbosity of its responses. This dataset has been created in partnership with Scale AI. When used to tune a Llama 3.1 70B Instruct Model, we achieve 94.1% on RewardBench, which makes it the best Reward Model asโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/nvidia/HelpSteer2.

Search
Clear search
Close search
Google apps
Main menu