24 datasets found

HelpSteer2
huggingface.co
Updated Oct 1, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NVIDIA (2024). HelpSteer2 [Dataset]. https://huggingface.co/datasets/nvidia/HelpSteer2
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 1, 2024
Dataset provided by
Nvidiahttp://nvidia.com/
Authors
NVIDIA
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
HelpSteer2: Open-source dataset for training top-performing reward models

HelpSteer2 is an open-source Helpfulness Dataset (CC-BY-4.0) that supports aligning models to become more helpful, factually correct and coherent, while being adjustable in terms of the complexity and verbosity of its responses. This dataset has been created in partnership with Scale AI. When used to tune a Llama 3.1 70B Instruct Model, we achieve 94.1% on RewardBench, which makes it the best Reward Model as… See the full description on the dataset page: https://huggingface.co/datasets/nvidia/HelpSteer2.
h
HelpSteer2-DPO
huggingface.co
Updated Jul 11, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Atsunori Fujita (2024). HelpSteer2-DPO [Dataset]. https://huggingface.co/datasets/Atsunori/HelpSteer2-DPO
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 11, 2024
Authors
Atsunori Fujita
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset is a conversion of nvidia/HelpSteer2 into preference pairs based on the helpfulness score for training DPO. HelpSteer2-DPO is also licensed under CC-BY-4.0.

Dataset Description

In accordance with the following paper, HelpSteer2: Open-source dataset for training top-performing reward models we converted nvidia/HelpSteer2 dataset into a preference dataset by taking the response with the higher helpfulness score as the chosen response, with the remaining response being the… See the full description on the dataset page: https://huggingface.co/datasets/Atsunori/HelpSteer2-DPO.
h
HelpSteer2
huggingface.co
Updated Jul 29, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Juyoung Suk (2024). HelpSteer2 [Dataset]. https://huggingface.co/datasets/juyoungml/HelpSteer2
Explore at:
Dataset updated
Jul 29, 2024
Authors
Juyoung Suk
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
juyoungml/HelpSteer2 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
HelpSteer2
huggingface.co
Updated Jul 4, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Matt (2024). HelpSteer2 [Dataset]. https://huggingface.co/datasets/stallone/HelpSteer2
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 4, 2024
Authors
Matt
Description
A reformatted version of nvidia/HelpSteer2 into both a multiturn config conversation and completion config config. A v4 UUID doc_id is shared across the same document in each config, source, conversation, and completion.
h
nvidia-HelpSteer2-ShareGPT
huggingface.co
Updated Feb 12, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nitral (2025). nvidia-HelpSteer2-ShareGPT [Dataset]. https://huggingface.co/datasets/Nitral-AI/nvidia-HelpSteer2-ShareGPT
Explore at:
Dataset updated
Feb 12, 2025
Authors
Nitral
Description
Nitral-AI/nvidia-HelpSteer2-ShareGPT dataset hosted on Hugging Face and contributed by the HF Datasets community
h
helpsteer2-preference-v2
huggingface.co
Updated Oct 21, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gagan Bhatia (2024). helpsteer2-preference-v2 [Dataset]. https://huggingface.co/datasets/gagan3012/helpsteer2-preference-v2
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 21, 2024
Authors
Gagan Bhatia
Description
gagan3012/helpsteer2-preference-v2 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
helpsteer2-binarized-granular-tiny
huggingface.co
Updated Mar 11, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Root Signals AI (2025). helpsteer2-binarized-granular-tiny [Dataset]. https://huggingface.co/datasets/root-signals/helpsteer2-binarized-granular-tiny
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 11, 2025
Dataset authored and provided by
Root Signals AI
Description
This is the nvidia/Helpsteer2 training split binarized and sorted by length using the Llama3 tokenizer and categorized into multi- vs. single-turn subparts. The 500 splits contain chosen responses between 500-1000 tokens, the 1000 split 1000+ tokens. A multi-turn example requires at least one pair of User and Assistant besides the main resposne to be categorized as such. If you don't care, there is a combined split, which includes everything just binarized, but note that ids are not the same… See the full description on the dataset page: https://huggingface.co/datasets/root-signals/helpsteer2-binarized-granular-tiny.
h
helpsteer2-helpfulness-preference
huggingface.co
Updated Jun 1, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jenny Shen (2025). helpsteer2-helpfulness-preference [Dataset]. https://huggingface.co/datasets/Jennny/helpsteer2-helpfulness-preference
Explore at:
Dataset updated
Jun 1, 2025
Authors
Jenny Shen
Description
Citation

@misc{wang2024helpsteer2preferencecomplementingratingspreferences, title={HelpSteer2-Preference: Complementing Ratings with Preferences}, author={Zhilin Wang and Alexander Bukharin and Olivier Delalleau and Daniel Egert and Gerald Shen and Jiaqi Zeng and Oleksii Kuchaiev and Yi Dong}, year={2024}, eprint={2410.01257}, archivePrefix={arXiv}, primaryClass={cs.LG}, url={https://arxiv.org/abs/2410.01257}, }

@misc{wang2024helpsteer2… See the full description on the dataset page: https://huggingface.co/datasets/Jennny/helpsteer2-helpfulness-preference.
nvidia-HelpSteer2-group-label_normalized
huggingface.co
Updated Feb 6, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pi Labs Inc. (2025). nvidia-HelpSteer2-group-label_normalized [Dataset]. https://huggingface.co/datasets/withpi/nvidia-HelpSteer2-group-label_normalized
Explore at:
Dataset updated
Feb 6, 2025
Dataset provided by
Pi Labs, Inc.
Authors
Pi Labs Inc.
Description
withpi/nvidia-HelpSteer2-group-label_normalized dataset hosted on Hugging Face and contributed by the HF Datasets community
nvidia-HelpSteer2-group-label-v2_tokenized_16k_euro
huggingface.co
Updated Mar 30, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pi Labs Inc. (2025). nvidia-HelpSteer2-group-label-v2_tokenized_16k_euro [Dataset]. https://huggingface.co/datasets/withpi/nvidia-HelpSteer2-group-label-v2_tokenized_16k_euro
Explore at:
Dataset updated
Mar 30, 2025
Dataset provided by
Pi Labs, Inc.
Authors
Pi Labs Inc.
Description
withpi/nvidia-HelpSteer2-group-label-v2_tokenized_16k_euro dataset hosted on Hugging Face and contributed by the HF Datasets community
nvidia-HelpSteer2-group-label-v2_euro_st_tokenized_32k_1
huggingface.co
Updated Apr 28, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pi Labs Inc. (2025). nvidia-HelpSteer2-group-label-v2_euro_st_tokenized_32k_1 [Dataset]. https://huggingface.co/datasets/withpi/nvidia-HelpSteer2-group-label-v2_euro_st_tokenized_32k_1
Explore at:
Dataset updated
Apr 28, 2025
Dataset provided by
Pi Labs, Inc.
Authors
Pi Labs Inc.
Description
withpi/nvidia-HelpSteer2-group-label-v2_euro_st_tokenized_32k_1 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
Hydrus-HelpSteer2
huggingface.co
Updated Feb 27, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
DV (2025). Hydrus-HelpSteer2 [Dataset]. https://huggingface.co/datasets/Delta-Vector/Hydrus-HelpSteer2
Explore at:
Dataset updated
Feb 27, 2025
Authors
DV
Description
Delta-Vector/Hydrus-HelpSteer2 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
llama3.2-3b-instruct-helpsteer2
huggingface.co
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
li, llama3.2-3b-instruct-helpsteer2 [Dataset]. https://huggingface.co/datasets/mimasss/llama3.2-3b-instruct-helpsteer2
Explore at:
Authors
li
Description
mimasss/llama3.2-3b-instruct-helpsteer2 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
helpsteer2-standard
huggingface.co
Updated Sep 12, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Chris (Yuhao) Liu (2024). helpsteer2-standard [Dataset]. https://huggingface.co/datasets/chrisliu298/helpsteer2-standard
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 12, 2024
Authors
Chris (Yuhao) Liu
Description
chrisliu298/helpsteer2-standard dataset hosted on Hugging Face and contributed by the HF Datasets community
h
HelpSteer2-Preference-WarmStart
huggingface.co
Updated Jun 8, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hanchi Sun (2025). HelpSteer2-Preference-WarmStart [Dataset]. https://huggingface.co/datasets/MasterGodzilla/HelpSteer2-Preference-WarmStart
Explore at:
Dataset updated
Jun 8, 2025
Authors
Hanchi Sun
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
MasterGodzilla/HelpSteer2-Preference-WarmStart dataset hosted on Hugging Face and contributed by the HF Datasets community
h
HelpSteer2-incoherent
huggingface.co
Updated Oct 22, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
wave (2024). HelpSteer2-incoherent [Dataset]. https://huggingface.co/datasets/wave-on-discord/HelpSteer2-incoherent
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 22, 2024
Authors
wave
Description
Dataset Card for "HelpSteer2-incoherent"

More Information needed
h
preprocessed-helpsteer2-train-10k
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sangeon Park, preprocessed-helpsteer2-train-10k [Dataset]. https://huggingface.co/datasets/saepark/preprocessed-helpsteer2-train-10k
Explore at:
Authors
Sangeon Park
Description
saepark/preprocessed-helpsteer2-train-10k dataset hosted on Hugging Face and contributed by the HF Datasets community
h
helpsteer2-rewardbench-contamination
huggingface.co
Updated Sep 30, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Saumya Malik (2021). helpsteer2-rewardbench-contamination [Dataset]. https://huggingface.co/datasets/saumyamalik/helpsteer2-rewardbench-contamination
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 30, 2021
Authors
Saumya Malik
Description
saumyamalik/helpsteer2-rewardbench-contamination dataset hosted on Hugging Face and contributed by the HF Datasets community
h
preprocessed-helpsteer2-test-500
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sangeon Park, preprocessed-helpsteer2-test-500 [Dataset]. https://huggingface.co/datasets/saepark/preprocessed-helpsteer2-test-500
Explore at:
Authors
Sangeon Park
Description
saepark/preprocessed-helpsteer2-test-500 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
helpsteer2_preference
huggingface.co
Updated Jun 8, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hsu Shihyueh (2025). helpsteer2_preference [Dataset]. https://huggingface.co/datasets/AIR-hl/helpsteer2_preference
Explore at:
Dataset updated
Jun 8, 2025
Authors
Hsu Shihyueh
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Introduction

This is a binarized preference datasets from nvidia/HelpSteer2. HelpSteer2 is an open-source Helpfulness Dataset (CC-BY-4.0) that supports aligning models to become more helpful, factually correct and coherent, while being adjustable in terms of the complexity and verbosity of its responses. This dataset has been created in partnership with Scale AI. I processed the raw data by prioritizing helpfulness, correctness, and coherence to determine which responses were chosen… See the full description on the dataset page: https://huggingface.co/datasets/AIR-hl/helpsteer2_preference.

Facebook

Twitter

Click to copy link

Link copied

Cite

NVIDIA (2024). HelpSteer2 [Dataset]. https://huggingface.co/datasets/nvidia/HelpSteer2

HelpSteer2

nvidia/HelpSteer2

Explore at:

242 scholarly articles cite this dataset (View in Google Scholar)

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Oct 1, 2024

Dataset provided by

Nvidiahttp://nvidia.com/

Authors

NVIDIA

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

HelpSteer2: Open-source dataset for training top-performing reward models

HelpSteer2 is an open-source Helpfulness Dataset (CC-BY-4.0) that supports aligning models to become more helpful, factually correct and coherent, while being adjustable in terms of the complexity and verbosity of its responses. This dataset has been created in partnership with Scale AI. When used to tune a Llama 3.1 70B Instruct Model, we achieve 94.1% on RewardBench, which makes it the best Reward Model as… See the full description on the dataset page: https://huggingface.co/datasets/nvidia/HelpSteer2.

Clear search

Close search

Google apps

Main menu

HelpSteer2

HelpSteer2-DPO

HelpSteer2

HelpSteer2

nvidia-HelpSteer2-ShareGPT

helpsteer2-preference-v2

helpsteer2-binarized-granular-tiny

helpsteer2-helpfulness-preference

nvidia-HelpSteer2-group-label_normalized

nvidia-HelpSteer2-group-label-v2_tokenized_16k_euro

nvidia-HelpSteer2-group-label-v2_euro_st_tokenized_32k_1

Hydrus-HelpSteer2

llama3.2-3b-instruct-helpsteer2

helpsteer2-standard

HelpSteer2-Preference-WarmStart

HelpSteer2-incoherent

preprocessed-helpsteer2-train-10k

helpsteer2-rewardbench-contamination

preprocessed-helpsteer2-test-500

helpsteer2_preference

HelpSteer2See More Versions

HelpSteer2

nvidia/HelpSteer2

HelpSteer2