3 datasets found

h
hh-rlhf
huggingface.co
Updated Sep 21, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Polina Kazakova (2023). hh-rlhf [Dataset]. https://huggingface.co/datasets/polinaeterna/hh-rlhf
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 21, 2023
Authors
Polina Kazakova
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset Card for HH-RLHF

Dataset Summary

This repository provides access to two different kinds of data:

Human preference data about helpfulness and harmlessness from Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback. These data are meant to train preference (or reward) models for subsequent RLHF training. These data are not meant for supervised training of dialogue agents. Training dialogue agents on these data is likely to lead… See the full description on the dataset page: https://huggingface.co/datasets/polinaeterna/hh-rlhf.
hh-rlhf
huggingface.co
Updated Dec 9, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Anthropic (2022). hh-rlhf [Dataset]. https://huggingface.co/datasets/Anthropic/hh-rlhf
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 9, 2022
Dataset authored and provided by
Anthropichttps://anthropic.com/
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset Card for HH-RLHF

Dataset Summary

This repository provides access to two different kinds of data:

Human preference data about helpfulness and harmlessness from Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback. These data are meant to train preference (or reward) models for subsequent RLHF training. These data are not meant for supervised training of dialogue agents. Training dialogue agents on these data is likely to lead… See the full description on the dataset page: https://huggingface.co/datasets/Anthropic/hh-rlhf.
h
hh-rlhf-nosafe
huggingface.co
Updated Apr 3, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jeonghwan Park (2024). hh-rlhf-nosafe [Dataset]. https://huggingface.co/datasets/maywell/hh-rlhf-nosafe
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 3, 2024
Authors
Jeonghwan Park
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset Card for HH-RLHF

Dataset Summary

This repository provides access to two different kinds of data:

Human preference data about helpfulness and harmlessness from Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback. These data are meant to train preference (or reward) models for subsequent RLHF training. These data are not meant for supervised training of dialogue agents. Training dialogue agents on these data is likely to lead… See the full description on the dataset page: https://huggingface.co/datasets/maywell/hh-rlhf-nosafe.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Polina Kazakova (2023). hh-rlhf [Dataset]. https://huggingface.co/datasets/polinaeterna/hh-rlhf

hh-rlhf

polinaeterna/hh-rlhf

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Sep 21, 2023

Authors

Polina Kazakova

License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

Dataset Card for HH-RLHF

  Dataset Summary

This repository provides access to two different kinds of data:

Human preference data about helpfulness and harmlessness from Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback. These data are meant to train preference (or reward) models for subsequent RLHF training. These data are not meant for supervised training of dialogue agents. Training dialogue agents on these data is likely to lead… See the full description on the dataset page: https://huggingface.co/datasets/polinaeterna/hh-rlhf.

Clear search

Close search

Google apps

Main menu

hh-rlhf

hh-rlhf

hh-rlhf-nosafe

hh-rlhf

polinaeterna/hh-rlhf