41 datasets found

HelpSteer
huggingface.co
Updated Nov 16, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NVIDIA (2023). HelpSteer [Dataset]. https://huggingface.co/datasets/nvidia/HelpSteer
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 16, 2023
Dataset provided by
Nvidiahttp://nvidia.com/
Authors
NVIDIA
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
HelpSteer: Helpfulness SteerLM Dataset

HelpSteer is an open-source Helpfulness Dataset (CC-BY-4.0) that supports aligning models to become more helpful, factually correct and coherent, while being adjustable in terms of the complexity and verbosity of its responses. Leveraging this dataset and SteerLM, we train a Llama 2 70B to reach 7.54 on MT Bench, the highest among models trained on open-source datasets based on MT Bench Leaderboard as of 15 Nov 2023. This model is available on… See the full description on the dataset page: https://huggingface.co/datasets/nvidia/HelpSteer.
h
HelpSteer-filtered
huggingface.co
Updated Jan 15, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yağız Çalık (2024). HelpSteer-filtered [Dataset]. https://huggingface.co/datasets/Weyaxi/HelpSteer-filtered
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 15, 2024
Authors
Yağız Çalık
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
HelpSteer-filtered

This dataset is a highly filtered version of the nvidia/HelpSteer dataset.

❓ How this dataset was filtered:

I calculated the sum of the columns ["helpfulness," "correctness," "coherence," "complexity," "verbosity"] and created a new column named sum.

I changed some column names and added a empty column to match the Alpaca format.

The dataset was then filtered to include only those entries with a sum greater than or equal to 16.

🧐 More… See the full description on the dataset page: https://huggingface.co/datasets/Weyaxi/HelpSteer-filtered.
HelpSteer2
huggingface.co
Updated Oct 1, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NVIDIA (2024). HelpSteer2 [Dataset]. https://huggingface.co/datasets/nvidia/HelpSteer2
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 1, 2024
Dataset provided by
Nvidiahttp://nvidia.com/
Authors
NVIDIA
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
HelpSteer2: Open-source dataset for training top-performing reward models

HelpSteer2 is an open-source Helpfulness Dataset (CC-BY-4.0) that supports aligning models to become more helpful, factually correct and coherent, while being adjustable in terms of the complexity and verbosity of its responses. This dataset has been created in partnership with Scale AI. When used to tune a Llama 3.1 70B Instruct Model, we achieve 94.1% on RewardBench, which makes it the best Reward Model as… See the full description on the dataset page: https://huggingface.co/datasets/nvidia/HelpSteer2.
h
Helpsteer-preference-standard
huggingface.co
Updated May 8, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
RLHFlow (2024). Helpsteer-preference-standard [Dataset]. https://huggingface.co/datasets/RLHFlow/Helpsteer-preference-standard
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 8, 2024
Dataset authored and provided by
RLHFlow
Description
RLHFlow/Helpsteer-preference-standard dataset hosted on Hugging Face and contributed by the HF Datasets community
h
HelpSteer-AIF
huggingface.co
Updated Sep 2, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Alvaro Bartolome (2024). HelpSteer-AIF [Dataset]. https://huggingface.co/datasets/alvarobartt/HelpSteer-AIF
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 2, 2024
Authors
Alvaro Bartolome
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
HelpSteer: Helpfulness SteerLM Dataset

HelpSteer is an open-source Helpfulness Dataset (CC-BY-4.0) that supports aligning models to become more helpful, factually correct and coherent, while being adjustable in terms of the complexity and verbosity of its responses. HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM

Disclaimer

This is only a subset created with distilabel to evaluate the first 1000 rows using AI Feedback (AIF) coming from GPT-4, only created for… See the full description on the dataset page: https://huggingface.co/datasets/alvarobartt/HelpSteer-AIF.
h
Helpsteer-3-edit-kto-v7
huggingface.co
Updated Jun 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
New Eden (2025). Helpsteer-3-edit-kto-v7 [Dataset]. https://huggingface.co/datasets/NewEden/Helpsteer-3-edit-kto-v7
Explore at:
Dataset updated
Jun 6, 2025
Dataset authored and provided by
New Eden
Description
NewEden/Helpsteer-3-edit-kto-v7 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
helpsteer-9k
huggingface.co
Updated May 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Caden Juang (2025). helpsteer-9k [Dataset]. https://huggingface.co/datasets/kh4dien/helpsteer-9k
Explore at:
Dataset updated
May 1, 2025
Authors
Caden Juang
Description
kh4dien/helpsteer-9k dataset hosted on Hugging Face and contributed by the HF Datasets community
h
Helpsteer-3-edit
huggingface.co
Updated Jun 8, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
New Eden (2025). Helpsteer-3-edit [Dataset]. https://huggingface.co/datasets/NewEden/Helpsteer-3-edit
Explore at:
Dataset updated
Jun 8, 2025
Dataset authored and provided by
New Eden
Description
NewEden/Helpsteer-3-edit dataset hosted on Hugging Face and contributed by the HF Datasets community
h
helpsteer
huggingface.co
Updated Aug 15, 2007
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yongyuan Liang (2007). helpsteer [Dataset]. https://huggingface.co/datasets/cheryyunl/helpsteer
Explore at:
Dataset updated
Aug 15, 2007
Authors
Yongyuan Liang
Description
cheryyunl/helpsteer dataset hosted on Hugging Face and contributed by the HF Datasets community
h
nvidia-HelpSteer-group-label_normalized
huggingface.co
Updated Feb 6, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pi Labs Inc. (2025). nvidia-HelpSteer-group-label_normalized [Dataset]. https://huggingface.co/datasets/withpi/nvidia-HelpSteer-group-label_normalized
Explore at:
Dataset updated
Feb 6, 2025
Dataset provided by
Pi Labs, Inc.
Authors
Pi Labs Inc.
Description
withpi/nvidia-HelpSteer-group-label_normalized dataset hosted on Hugging Face and contributed by the HF Datasets community
h
nvidia-HelpSteer-group-label-v2_euro_st_tokenized_32k_1
huggingface.co
Updated Apr 28, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pi Labs Inc. (2025). nvidia-HelpSteer-group-label-v2_euro_st_tokenized_32k_1 [Dataset]. https://huggingface.co/datasets/withpi/nvidia-HelpSteer-group-label-v2_euro_st_tokenized_32k_1
Explore at:
Dataset updated
Apr 28, 2025
Dataset provided by
Pi Labs, Inc.
Authors
Pi Labs Inc.
Description
withpi/nvidia-HelpSteer-group-label-v2_euro_st_tokenized_32k_1 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
Helpsteer-3-Edit-ShareGPT
huggingface.co
Updated Jun 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mango (2025). Helpsteer-3-Edit-ShareGPT [Dataset]. https://huggingface.co/datasets/Delta-Vector/Helpsteer-3-Edit-ShareGPT
Explore at:
Dataset updated
Jun 1, 2025
Authors
Mango
Description
Delta-Vector/Helpsteer-3-Edit-ShareGPT dataset hosted on Hugging Face and contributed by the HF Datasets community
h
nvidia-HelpSteer-group-label-v2_tokenized_truncated_regcat_1
huggingface.co
Updated Mar 31, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pi Labs Inc. (2025). nvidia-HelpSteer-group-label-v2_tokenized_truncated_regcat_1 [Dataset]. https://huggingface.co/datasets/withpi/nvidia-HelpSteer-group-label-v2_tokenized_truncated_regcat_1
Explore at:
Dataset updated
Mar 31, 2025
Dataset provided by
Pi Labs, Inc.
Authors
Pi Labs Inc.
Description
withpi/nvidia-HelpSteer-group-label-v2_tokenized_truncated_regcat_1 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
helpsteer-llama2-1k
huggingface.co
Updated Dec 18, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NIKHIL (2024). helpsteer-llama2-1k [Dataset]. https://huggingface.co/datasets/nikhilkumarreddy28/helpsteer-llama2-1k
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 18, 2024
Authors
NIKHIL
Description
nikhilkumarreddy28/helpsteer-llama2-1k dataset hosted on Hugging Face and contributed by the HF Datasets community
h
nvidia-HelpSteer-group-label-v2_tokenized_16k_euro_1
huggingface.co
Updated Apr 3, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pi Labs Inc. (2025). nvidia-HelpSteer-group-label-v2_tokenized_16k_euro_1 [Dataset]. https://huggingface.co/datasets/withpi/nvidia-HelpSteer-group-label-v2_tokenized_16k_euro_1
Explore at:
Dataset updated
Apr 3, 2025
Dataset provided by
Pi Labs, Inc.
Authors
Pi Labs Inc.
Description
withpi/nvidia-HelpSteer-group-label-v2_tokenized_16k_euro_1 dataset hosted on Hugging Face and contributed by the HF Datasets community

helpsteer-correctness

huggingface.co

Updated Aug 18, 2009

Facebook

Twitter

Click to copy link

Link copied

Cite

Yongyuan Liang (2009). helpsteer-correctness [Dataset]. https://huggingface.co/datasets/cheryyunl/helpsteer-correctness

Explore at:

Dataset updated

Aug 18, 2009

Authors

Yongyuan Liang

Description

Helpsteer-correctness

This dataset is derived from NVIDIA's HelpSteer dataset, processed specifically for preference learning on the correctness dimension.

- Train split: 27417 examples
- Test split: 1416 examples

## Format

Each example contains the following fields:
- `prompt`: Question with "Human:" prefix and "Assistant:" suffix
- `chosen`: The response with higher correctness score
- `rejected`: The response with lower correctness score
-… See the full description on the dataset page: https://huggingface.co/datasets/cheryyunl/helpsteer-correctness.

helpsteer-coherence

huggingface.co

Updated Aug 18, 2009

Facebook

Twitter

Click to copy link

Link copied

Cite

Yongyuan Liang (2009). helpsteer-coherence [Dataset]. https://huggingface.co/datasets/cheryyunl/helpsteer-coherence

Explore at:

Dataset updated

Aug 18, 2009

Authors

Yongyuan Liang

Description

Helpsteer-coherence

This dataset is derived from NVIDIA's HelpSteer dataset, processed specifically for preference learning on the coherence dimension.

- Train split: 22876 examples
- Test split: 1131 examples

## Format

Each example contains the following fields:
- `prompt`: Question with "Human:" prefix and "Assistant:" suffix
- `chosen`: The response with higher coherence score
- `rejected`: The response with lower coherence score
-… See the full description on the dataset page: https://huggingface.co/datasets/cheryyunl/helpsteer-coherence.

helpsteer-helpfulness

huggingface.co

Updated May 29, 2025

Facebook

Twitter

Click to copy link

Link copied

Cite

Yongyuan Liang (2025). helpsteer-helpfulness [Dataset]. https://huggingface.co/datasets/cheryyunl/helpsteer-helpfulness

Explore at:

Dataset updated

May 29, 2025

Authors

Yongyuan Liang

Description

Helpsteer-helpfulness

This dataset is derived from NVIDIA's HelpSteer dataset, processed specifically for preference learning on the helpfulness dimension.

- Train split: 28115 examples
- Test split: 1410 examples

## Format

Each example contains the following fields:
- `prompt`: Question with "Human:" prefix and "Assistant:" suffix
- `chosen`: The response with higher helpfulness score
- `rejected`: The response with lower helpfulness score
-… See the full description on the dataset page: https://huggingface.co/datasets/cheryyunl/helpsteer-helpfulness.

h
Tauri-Helpsteer-3-Preference-KTO
huggingface.co
Updated May 27, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mango (2025). Tauri-Helpsteer-3-Preference-KTO [Dataset]. https://huggingface.co/datasets/Delta-Vector/Tauri-Helpsteer-3-Preference-KTO
Explore at:
Dataset updated
May 27, 2025
Authors
Mango
Description
Delta-Vector/Tauri-Helpsteer-3-Preference-KTO dataset hosted on Hugging Face and contributed by the HF Datasets community
h
Helpsteer-3-Pref
huggingface.co
Updated Jun 8, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
New Eden (2025). Helpsteer-3-Pref [Dataset]. https://huggingface.co/datasets/NewEden/Helpsteer-3-Pref
Explore at:
Dataset updated
Jun 8, 2025
Dataset authored and provided by
New Eden
Description
NewEden/Helpsteer-3-Pref dataset hosted on Hugging Face and contributed by the HF Datasets community

Facebook

Twitter

Click to copy link

Link copied

Cite

NVIDIA (2023). HelpSteer [Dataset]. https://huggingface.co/datasets/nvidia/HelpSteer

HelpSteer

nvidia/HelpSteer

Helpfulness SteerLM Dataset

Explore at:

254 scholarly articles cite this dataset (View in Google Scholar)

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Nov 16, 2023

Dataset provided by

Nvidiahttp://nvidia.com/

Authors

NVIDIA

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

HelpSteer: Helpfulness SteerLM Dataset

HelpSteer is an open-source Helpfulness Dataset (CC-BY-4.0) that supports aligning models to become more helpful, factually correct and coherent, while being adjustable in terms of the complexity and verbosity of its responses. Leveraging this dataset and SteerLM, we train a Llama 2 70B to reach 7.54 on MT Bench, the highest among models trained on open-source datasets based on MT Bench Leaderboard as of 15 Nov 2023. This model is available on… See the full description on the dataset page: https://huggingface.co/datasets/nvidia/HelpSteer.

Clear search

Close search

Google apps

Main menu

HelpSteer

HelpSteer-filtered

HelpSteer2

Helpsteer-preference-standard

HelpSteer-AIF

Helpsteer-3-edit-kto-v7

helpsteer-9k

Helpsteer-3-edit

helpsteer

nvidia-HelpSteer-group-label_normalized

nvidia-HelpSteer-group-label-v2_euro_st_tokenized_32k_1

Helpsteer-3-Edit-ShareGPT

nvidia-HelpSteer-group-label-v2_tokenized_truncated_regcat_1

helpsteer-llama2-1k

nvidia-HelpSteer-group-label-v2_tokenized_16k_euro_1

helpsteer-correctness

helpsteer-coherence

helpsteer-helpfulness

Tauri-Helpsteer-3-Preference-KTO

Helpsteer-3-Pref

HelpSteer

nvidia/HelpSteer

Helpfulness SteerLM Dataset