41 datasets found
  1. HelpSteer

    • huggingface.co
    Updated Nov 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NVIDIA (2023). HelpSteer [Dataset]. https://huggingface.co/datasets/nvidia/HelpSteer
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 16, 2023
    Dataset provided by
    Nvidiahttp://nvidia.com/
    Authors
    NVIDIA
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    HelpSteer: Helpfulness SteerLM Dataset

    HelpSteer is an open-source Helpfulness Dataset (CC-BY-4.0) that supports aligning models to become more helpful, factually correct and coherent, while being adjustable in terms of the complexity and verbosity of its responses. Leveraging this dataset and SteerLM, we train a Llama 2 70B to reach 7.54 on MT Bench, the highest among models trained on open-source datasets based on MT Bench Leaderboard as of 15 Nov 2023. This model is available on… See the full description on the dataset page: https://huggingface.co/datasets/nvidia/HelpSteer.

  2. h

    HelpSteer-filtered

    • huggingface.co
    Updated Jan 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yağız Çalık (2024). HelpSteer-filtered [Dataset]. https://huggingface.co/datasets/Weyaxi/HelpSteer-filtered
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 15, 2024
    Authors
    Yağız Çalık
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    HelpSteer-filtered

    This dataset is a highly filtered version of the nvidia/HelpSteer dataset.

      ❓ How this dataset was filtered:
    

    I calculated the sum of the columns ["helpfulness," "correctness," "coherence," "complexity," "verbosity"] and created a new column named sum.

    I changed some column names and added a empty column to match the Alpaca format.

    The dataset was then filtered to include only those entries with a sum greater than or equal to 16.

      🧐 More… See the full description on the dataset page: https://huggingface.co/datasets/Weyaxi/HelpSteer-filtered.
    
  3. HelpSteer2

    • huggingface.co
    Updated Oct 1, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NVIDIA (2024). HelpSteer2 [Dataset]. https://huggingface.co/datasets/nvidia/HelpSteer2
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 1, 2024
    Dataset provided by
    Nvidiahttp://nvidia.com/
    Authors
    NVIDIA
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    HelpSteer2: Open-source dataset for training top-performing reward models

    HelpSteer2 is an open-source Helpfulness Dataset (CC-BY-4.0) that supports aligning models to become more helpful, factually correct and coherent, while being adjustable in terms of the complexity and verbosity of its responses. This dataset has been created in partnership with Scale AI. When used to tune a Llama 3.1 70B Instruct Model, we achieve 94.1% on RewardBench, which makes it the best Reward Model as… See the full description on the dataset page: https://huggingface.co/datasets/nvidia/HelpSteer2.

  4. h

    Helpsteer-preference-standard

    • huggingface.co
    Updated May 8, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    RLHFlow (2024). Helpsteer-preference-standard [Dataset]. https://huggingface.co/datasets/RLHFlow/Helpsteer-preference-standard
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 8, 2024
    Dataset authored and provided by
    RLHFlow
    Description

    RLHFlow/Helpsteer-preference-standard dataset hosted on Hugging Face and contributed by the HF Datasets community

  5. h

    HelpSteer-AIF

    • huggingface.co
    Updated Sep 2, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alvaro Bartolome (2024). HelpSteer-AIF [Dataset]. https://huggingface.co/datasets/alvarobartt/HelpSteer-AIF
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 2, 2024
    Authors
    Alvaro Bartolome
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    HelpSteer: Helpfulness SteerLM Dataset

    HelpSteer is an open-source Helpfulness Dataset (CC-BY-4.0) that supports aligning models to become more helpful, factually correct and coherent, while being adjustable in terms of the complexity and verbosity of its responses. HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM

      Disclaimer
    

    This is only a subset created with distilabel to evaluate the first 1000 rows using AI Feedback (AIF) coming from GPT-4, only created for… See the full description on the dataset page: https://huggingface.co/datasets/alvarobartt/HelpSteer-AIF.

  6. h

    Helpsteer-3-edit-kto-v7

    • huggingface.co
    Updated Jun 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    New Eden (2025). Helpsteer-3-edit-kto-v7 [Dataset]. https://huggingface.co/datasets/NewEden/Helpsteer-3-edit-kto-v7
    Explore at:
    Dataset updated
    Jun 6, 2025
    Dataset authored and provided by
    New Eden
    Description

    NewEden/Helpsteer-3-edit-kto-v7 dataset hosted on Hugging Face and contributed by the HF Datasets community

  7. h

    helpsteer-9k

    • huggingface.co
    Updated May 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Caden Juang (2025). helpsteer-9k [Dataset]. https://huggingface.co/datasets/kh4dien/helpsteer-9k
    Explore at:
    Dataset updated
    May 1, 2025
    Authors
    Caden Juang
    Description

    kh4dien/helpsteer-9k dataset hosted on Hugging Face and contributed by the HF Datasets community

  8. h

    Helpsteer-3-edit

    • huggingface.co
    Updated Jun 8, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    New Eden (2025). Helpsteer-3-edit [Dataset]. https://huggingface.co/datasets/NewEden/Helpsteer-3-edit
    Explore at:
    Dataset updated
    Jun 8, 2025
    Dataset authored and provided by
    New Eden
    Description

    NewEden/Helpsteer-3-edit dataset hosted on Hugging Face and contributed by the HF Datasets community

  9. h

    helpsteer

    • huggingface.co
    Updated Aug 15, 2007
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yongyuan Liang (2007). helpsteer [Dataset]. https://huggingface.co/datasets/cheryyunl/helpsteer
    Explore at:
    Dataset updated
    Aug 15, 2007
    Authors
    Yongyuan Liang
    Description

    cheryyunl/helpsteer dataset hosted on Hugging Face and contributed by the HF Datasets community

  10. h

    nvidia-HelpSteer-group-label_normalized

    • huggingface.co
    Updated Feb 6, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pi Labs Inc. (2025). nvidia-HelpSteer-group-label_normalized [Dataset]. https://huggingface.co/datasets/withpi/nvidia-HelpSteer-group-label_normalized
    Explore at:
    Dataset updated
    Feb 6, 2025
    Dataset provided by
    Pi Labs, Inc.
    Authors
    Pi Labs Inc.
    Description

    withpi/nvidia-HelpSteer-group-label_normalized dataset hosted on Hugging Face and contributed by the HF Datasets community

  11. h

    nvidia-HelpSteer-group-label-v2_euro_st_tokenized_32k_1

    • huggingface.co
    Updated Apr 28, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pi Labs Inc. (2025). nvidia-HelpSteer-group-label-v2_euro_st_tokenized_32k_1 [Dataset]. https://huggingface.co/datasets/withpi/nvidia-HelpSteer-group-label-v2_euro_st_tokenized_32k_1
    Explore at:
    Dataset updated
    Apr 28, 2025
    Dataset provided by
    Pi Labs, Inc.
    Authors
    Pi Labs Inc.
    Description

    withpi/nvidia-HelpSteer-group-label-v2_euro_st_tokenized_32k_1 dataset hosted on Hugging Face and contributed by the HF Datasets community

  12. h

    Helpsteer-3-Edit-ShareGPT

    • huggingface.co
    Updated Jun 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mango (2025). Helpsteer-3-Edit-ShareGPT [Dataset]. https://huggingface.co/datasets/Delta-Vector/Helpsteer-3-Edit-ShareGPT
    Explore at:
    Dataset updated
    Jun 1, 2025
    Authors
    Mango
    Description

    Delta-Vector/Helpsteer-3-Edit-ShareGPT dataset hosted on Hugging Face and contributed by the HF Datasets community

  13. h

    nvidia-HelpSteer-group-label-v2_tokenized_truncated_regcat_1

    • huggingface.co
    Updated Mar 31, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pi Labs Inc. (2025). nvidia-HelpSteer-group-label-v2_tokenized_truncated_regcat_1 [Dataset]. https://huggingface.co/datasets/withpi/nvidia-HelpSteer-group-label-v2_tokenized_truncated_regcat_1
    Explore at:
    Dataset updated
    Mar 31, 2025
    Dataset provided by
    Pi Labs, Inc.
    Authors
    Pi Labs Inc.
    Description

    withpi/nvidia-HelpSteer-group-label-v2_tokenized_truncated_regcat_1 dataset hosted on Hugging Face and contributed by the HF Datasets community

  14. h

    helpsteer-llama2-1k

    • huggingface.co
    Updated Dec 18, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NIKHIL (2024). helpsteer-llama2-1k [Dataset]. https://huggingface.co/datasets/nikhilkumarreddy28/helpsteer-llama2-1k
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 18, 2024
    Authors
    NIKHIL
    Description

    nikhilkumarreddy28/helpsteer-llama2-1k dataset hosted on Hugging Face and contributed by the HF Datasets community

  15. h

    nvidia-HelpSteer-group-label-v2_tokenized_16k_euro_1

    • huggingface.co
    Updated Apr 3, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pi Labs Inc. (2025). nvidia-HelpSteer-group-label-v2_tokenized_16k_euro_1 [Dataset]. https://huggingface.co/datasets/withpi/nvidia-HelpSteer-group-label-v2_tokenized_16k_euro_1
    Explore at:
    Dataset updated
    Apr 3, 2025
    Dataset provided by
    Pi Labs, Inc.
    Authors
    Pi Labs Inc.
    Description

    withpi/nvidia-HelpSteer-group-label-v2_tokenized_16k_euro_1 dataset hosted on Hugging Face and contributed by the HF Datasets community

  16. h

    helpsteer-correctness

    • huggingface.co
    Updated Aug 18, 2009
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yongyuan Liang (2009). helpsteer-correctness [Dataset]. https://huggingface.co/datasets/cheryyunl/helpsteer-correctness
    Explore at:
    Dataset updated
    Aug 18, 2009
    Authors
    Yongyuan Liang
    Description

    Helpsteer-correctness

    This dataset is derived from NVIDIA's HelpSteer dataset, processed specifically for preference learning on the correctness dimension.
    
    - Train split: 27417 examples
    - Test split: 1416 examples
    
    ## Format
    
    Each example contains the following fields:
    - `prompt`: Question with "Human:" prefix and "Assistant:" suffix
    - `chosen`: The response with higher correctness score
    - `rejected`: The response with lower correctness score
    -… See the full description on the dataset page: https://huggingface.co/datasets/cheryyunl/helpsteer-correctness.
    
  17. h

    helpsteer-coherence

    • huggingface.co
    Updated Aug 18, 2009
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yongyuan Liang (2009). helpsteer-coherence [Dataset]. https://huggingface.co/datasets/cheryyunl/helpsteer-coherence
    Explore at:
    Dataset updated
    Aug 18, 2009
    Authors
    Yongyuan Liang
    Description

    Helpsteer-coherence

    This dataset is derived from NVIDIA's HelpSteer dataset, processed specifically for preference learning on the coherence dimension.
    
    - Train split: 22876 examples
    - Test split: 1131 examples
    
    ## Format
    
    Each example contains the following fields:
    - `prompt`: Question with "Human:" prefix and "Assistant:" suffix
    - `chosen`: The response with higher coherence score
    - `rejected`: The response with lower coherence score
    -… See the full description on the dataset page: https://huggingface.co/datasets/cheryyunl/helpsteer-coherence.
    
  18. h

    helpsteer-helpfulness

    • huggingface.co
    Updated May 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yongyuan Liang (2025). helpsteer-helpfulness [Dataset]. https://huggingface.co/datasets/cheryyunl/helpsteer-helpfulness
    Explore at:
    Dataset updated
    May 29, 2025
    Authors
    Yongyuan Liang
    Description

    Helpsteer-helpfulness

    This dataset is derived from NVIDIA's HelpSteer dataset, processed specifically for preference learning on the helpfulness dimension.
    
    - Train split: 28115 examples
    - Test split: 1410 examples
    
    ## Format
    
    Each example contains the following fields:
    - `prompt`: Question with "Human:" prefix and "Assistant:" suffix
    - `chosen`: The response with higher helpfulness score
    - `rejected`: The response with lower helpfulness score
    -… See the full description on the dataset page: https://huggingface.co/datasets/cheryyunl/helpsteer-helpfulness.
    
  19. h

    Tauri-Helpsteer-3-Preference-KTO

    • huggingface.co
    Updated May 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mango (2025). Tauri-Helpsteer-3-Preference-KTO [Dataset]. https://huggingface.co/datasets/Delta-Vector/Tauri-Helpsteer-3-Preference-KTO
    Explore at:
    Dataset updated
    May 27, 2025
    Authors
    Mango
    Description

    Delta-Vector/Tauri-Helpsteer-3-Preference-KTO dataset hosted on Hugging Face and contributed by the HF Datasets community

  20. h

    Helpsteer-3-Pref

    • huggingface.co
    Updated Jun 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    New Eden (2025). Helpsteer-3-Pref [Dataset]. https://huggingface.co/datasets/NewEden/Helpsteer-3-Pref
    Explore at:
    Dataset updated
    Jun 8, 2025
    Dataset authored and provided by
    New Eden
    Description

    NewEden/Helpsteer-3-Pref dataset hosted on Hugging Face and contributed by the HF Datasets community

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
NVIDIA (2023). HelpSteer [Dataset]. https://huggingface.co/datasets/nvidia/HelpSteer
Organization logo

HelpSteer

nvidia/HelpSteer

Helpfulness SteerLM Dataset

Explore at:
254 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 16, 2023
Dataset provided by
Nvidiahttp://nvidia.com/
Authors
NVIDIA
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

HelpSteer: Helpfulness SteerLM Dataset

HelpSteer is an open-source Helpfulness Dataset (CC-BY-4.0) that supports aligning models to become more helpful, factually correct and coherent, while being adjustable in terms of the complexity and verbosity of its responses. Leveraging this dataset and SteerLM, we train a Llama 2 70B to reach 7.54 on MT Bench, the highest among models trained on open-source datasets based on MT Bench Leaderboard as of 15 Nov 2023. This model is available on… See the full description on the dataset page: https://huggingface.co/datasets/nvidia/HelpSteer.

Search
Clear search
Close search
Google apps
Main menu