2 datasets found

h
OpenThoughts-114k-open-thoughts
huggingface.co
Updated May 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
GenRM: Generative Reward Models (2025). OpenThoughts-114k-open-thoughts [Dataset]. https://huggingface.co/datasets/GenRM/OpenThoughts-114k-open-thoughts
Explore at:
Dataset updated
May 11, 2025
Dataset authored and provided by
GenRM: Generative Reward Models
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Open-Thoughts-114k

Open synthetic reasoning dataset with 114k high-quality examples covering math, science, code, and puzzles! Inspect the content with rich formatting with Curator Viewer.

Available Subsets

default subset containing ready-to-train data used to finetune the OpenThinker-7B and OpenThinker-32B models: ds = load_dataset("open-thoughts/OpenThoughts-114k", split="train")

metadata subset containing extra columns used in dataset construction:… See the full description on the dataset page: https://huggingface.co/datasets/GenRM/OpenThoughts-114k-open-thoughts.
h
OpenThoughts-114k
huggingface.co
Updated Jan 28, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Open Thoughts (2025). OpenThoughts-114k [Dataset]. https://huggingface.co/datasets/open-thoughts/OpenThoughts-114k
Explore at:
Dataset updated
Jan 28, 2025
Dataset authored and provided by
Open Thoughts
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
[!NOTE] We have released a paper for OpenThoughts! See our paper here.

Open-Thoughts-114k

Open synthetic reasoning dataset with 114k high-quality examples covering math, science, code, and puzzles! Inspect the content with rich formatting with Curator Viewer.

Available Subsets

default subset containing ready-to-train data used to finetune the OpenThinker-7B and OpenThinker-32B models: ds = load_dataset("open-thoughts/OpenThoughts-114k", split="train")… See the full description on the dataset page: https://huggingface.co/datasets/open-thoughts/OpenThoughts-114k.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

GenRM: Generative Reward Models (2025). OpenThoughts-114k-open-thoughts [Dataset]. https://huggingface.co/datasets/GenRM/OpenThoughts-114k-open-thoughts

OpenThoughts-114k-open-thoughts

GenRM/OpenThoughts-114k-open-thoughts

Explore at:

Dataset updated

May 11, 2025

Dataset authored and provided by

GenRM: Generative Reward Models

License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

Open-Thoughts-114k

Open synthetic reasoning dataset with 114k high-quality examples covering math, science, code, and puzzles! Inspect the content with rich formatting with Curator Viewer.

  Available Subsets

default subset containing ready-to-train data used to finetune the OpenThinker-7B and OpenThinker-32B models: ds = load_dataset("open-thoughts/OpenThoughts-114k", split="train")

metadata subset containing extra columns used in dataset construction:… See the full description on the dataset page: https://huggingface.co/datasets/GenRM/OpenThoughts-114k-open-thoughts.

Clear search

Close search

Google apps

Main menu