2 datasets found
  1. h

    OpenThoughts-114k-open-thoughts

    • huggingface.co
    Updated May 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GenRM: Generative Reward Models (2025). OpenThoughts-114k-open-thoughts [Dataset]. https://huggingface.co/datasets/GenRM/OpenThoughts-114k-open-thoughts
    Explore at:
    Dataset updated
    May 11, 2025
    Dataset authored and provided by
    GenRM: Generative Reward Models
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Open-Thoughts-114k

    Open synthetic reasoning dataset with 114k high-quality examples covering math, science, code, and puzzles! Inspect the content with rich formatting with Curator Viewer.

      Available Subsets
    

    default subset containing ready-to-train data used to finetune the OpenThinker-7B and OpenThinker-32B models: ds = load_dataset("open-thoughts/OpenThoughts-114k", split="train")

    metadata subset containing extra columns used in dataset construction:… See the full description on the dataset page: https://huggingface.co/datasets/GenRM/OpenThoughts-114k-open-thoughts.

  2. h

    OpenThoughts-114k

    • huggingface.co
    Updated Jan 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Open Thoughts (2025). OpenThoughts-114k [Dataset]. https://huggingface.co/datasets/open-thoughts/OpenThoughts-114k
    Explore at:
    Dataset updated
    Jan 28, 2025
    Dataset authored and provided by
    Open Thoughts
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    [!NOTE] We have released a paper for OpenThoughts! See our paper here.

      Open-Thoughts-114k
    

    Open synthetic reasoning dataset with 114k high-quality examples covering math, science, code, and puzzles! Inspect the content with rich formatting with Curator Viewer.

      Available Subsets
    

    default subset containing ready-to-train data used to finetune the OpenThinker-7B and OpenThinker-32B models: ds = load_dataset("open-thoughts/OpenThoughts-114k", split="train")… See the full description on the dataset page: https://huggingface.co/datasets/open-thoughts/OpenThoughts-114k.

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
GenRM: Generative Reward Models (2025). OpenThoughts-114k-open-thoughts [Dataset]. https://huggingface.co/datasets/GenRM/OpenThoughts-114k-open-thoughts

OpenThoughts-114k-open-thoughts

GenRM/OpenThoughts-114k-open-thoughts

Explore at:
Dataset updated
May 11, 2025
Dataset authored and provided by
GenRM: Generative Reward Models
License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

Open-Thoughts-114k

Open synthetic reasoning dataset with 114k high-quality examples covering math, science, code, and puzzles! Inspect the content with rich formatting with Curator Viewer.

  Available Subsets

default subset containing ready-to-train data used to finetune the OpenThinker-7B and OpenThinker-32B models: ds = load_dataset("open-thoughts/OpenThoughts-114k", split="train")

metadata subset containing extra columns used in dataset construction:… See the full description on the dataset page: https://huggingface.co/datasets/GenRM/OpenThoughts-114k-open-thoughts.

Search
Clear search
Close search
Google apps
Main menu