31 datasets found
  1. h

    Bespoke-Stratos-17k

    • huggingface.co
    Updated Jan 22, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bespoke Labs (2025). Bespoke-Stratos-17k [Dataset]. https://huggingface.co/datasets/bespokelabs/Bespoke-Stratos-17k
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 22, 2025
    Dataset authored and provided by
    Bespoke Labs
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Bespoke-Stratos-17k

    We replicated and improved the Berkeley Sky-T1 data pipeline using SFT distillation data from DeepSeek-R1 to create Bespoke-Stratos-17k -- a reasoning dataset of questions, reasoning traces, and answers. This data was used to train:

    Bespoke-Stratos-32B, a 32B reasoning model which is a fine-tune of Qwen-2.5-32B-Instruct Bespoke-Stratos-7B, a 7B reasoning model which is a fine-tune of Qwen-2.5-7B-Instruct.

      Metrics for Bespoke-Stratos-32B… See the full description on the dataset page: https://huggingface.co/datasets/bespokelabs/Bespoke-Stratos-17k.
    
  2. Bespoke-Stratos-17k

    • huggingface.co
    Updated Jan 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hugging Face H4 (2025). Bespoke-Stratos-17k [Dataset]. https://huggingface.co/datasets/HuggingFaceH4/Bespoke-Stratos-17k
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 25, 2025
    Dataset provided by
    Hugging Facehttps://huggingface.co/
    Authors
    Hugging Face H4
    Description

    Dataset card for Bespoke-Stratos-17k

    This dataset is a TRL-compatible version of bespokelabs/Bespoke-Stratos-17k. Please refer to the source dataset for details.

  3. h

    Bespoke-Stratos-35k

    • huggingface.co
    Updated Jan 22, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bespoke Labs (2025). Bespoke-Stratos-35k [Dataset]. https://huggingface.co/datasets/bespokelabs/Bespoke-Stratos-35k
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 22, 2025
    Dataset authored and provided by
    Bespoke Labs
    Description

    bespokelabs/Bespoke-Stratos-35k dataset hosted on Hugging Face and contributed by the HF Datasets community

  4. h

    Bespoke-Stratos-17k

    • huggingface.co
    Updated Jul 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    minyi (2025). Bespoke-Stratos-17k [Dataset]. https://huggingface.co/datasets/minyichen/Bespoke-Stratos-17k
    Explore at:
    Dataset updated
    Jul 21, 2025
    Authors
    minyi
    Description

    minyichen/Bespoke-Stratos-17k dataset hosted on Hugging Face and contributed by the HF Datasets community

  5. h

    Bespoke-Stratos-17k-DeepSeekrized

    • huggingface.co
    Updated Jan 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seungwoo Ryu (2025). Bespoke-Stratos-17k-DeepSeekrized [Dataset]. https://huggingface.co/datasets/tryumanshow/Bespoke-Stratos-17k-DeepSeekrized
    Explore at:
    Dataset updated
    Jan 25, 2025
    Authors
    Seungwoo Ryu
    Description

    Bespoke-Stratos-17k-DeepSeekrized

    Created by: Seungwoo Ryu

      Introduction
    

    This dataset is a modified version of the original HuggingFaceH4/Bespoke-Stratos-17k dataset, reformatted to match the output format of DeepSeek models.

      Modifications
    

    The user and assistant fields from the original dataset's messages have been moved to user_modified and agent_modified respectively. The content in the agent_modified field has been transformed to match the DeepSeek model's… See the full description on the dataset page: https://huggingface.co/datasets/tryumanshow/Bespoke-Stratos-17k-DeepSeekrized.

  6. h

    Bespoke-Stratos-17k-Filtered

    • huggingface.co
    Updated Feb 25, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Atmatech AI (2025). Bespoke-Stratos-17k-Filtered [Dataset]. https://huggingface.co/datasets/atmatechai/Bespoke-Stratos-17k-Filtered
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 25, 2025
    Authors
    Atmatech AI
    Description

    atmatechai/Bespoke-Stratos-17k-Filtered dataset hosted on Hugging Face and contributed by the HF Datasets community

  7. h

    Bespoke-Stratos-17k-Reformatted

    • huggingface.co
    Updated Sep 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xiangxin Zhou (2025). Bespoke-Stratos-17k-Reformatted [Dataset]. https://huggingface.co/datasets/zhouxiangxin/Bespoke-Stratos-17k-Reformatted
    Explore at:
    Dataset updated
    Sep 29, 2025
    Authors
    Xiangxin Zhou
    Description

    zhouxiangxin/Bespoke-Stratos-17k-Reformatted dataset hosted on Hugging Face and contributed by the HF Datasets community

  8. h

    Bespoke-Stratos-17k-Train-Posterior-PB

    • huggingface.co
    Updated Sep 29, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xiangxin Zhou (2025). Bespoke-Stratos-17k-Train-Posterior-PB [Dataset]. https://huggingface.co/datasets/zhouxiangxin/Bespoke-Stratos-17k-Train-Posterior-PB
    Explore at:
    Dataset updated
    Sep 29, 2025
    Authors
    Xiangxin Zhou
    Description

    zhouxiangxin/Bespoke-Stratos-17k-Train-Posterior-PB dataset hosted on Hugging Face and contributed by the HF Datasets community

  9. h

    Bespoke-Stratos-17k_tokenized

    • huggingface.co
    Updated Jul 15, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shixuan Liu (2025). Bespoke-Stratos-17k_tokenized [Dataset]. https://huggingface.co/datasets/sxLiu/Bespoke-Stratos-17k_tokenized
    Explore at:
    Dataset updated
    Jul 15, 2025
    Authors
    Shixuan Liu
    Description

    sxLiu/Bespoke-Stratos-17k_tokenized dataset hosted on Hugging Face and contributed by the HF Datasets community

  10. h

    Bespoke-Stratos-17k-Reasoning

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xiangxin Zhou, Bespoke-Stratos-17k-Reasoning [Dataset]. https://huggingface.co/datasets/zhouxiangxin/Bespoke-Stratos-17k-Reasoning
    Explore at:
    Authors
    Xiangxin Zhou
    Description

    zhouxiangxin/Bespoke-Stratos-17k-Reasoning dataset hosted on Hugging Face and contributed by the HF Datasets community

  11. h

    bespoke-stratos-17k

    • huggingface.co
    Updated Apr 13, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aman Gokrani (2025). bespoke-stratos-17k [Dataset]. https://huggingface.co/datasets/agokrani/bespoke-stratos-17k
    Explore at:
    Dataset updated
    Apr 13, 2025
    Authors
    Aman Gokrani
    Description

    agokrani/bespoke-stratos-17k dataset hosted on Hugging Face and contributed by the HF Datasets community

  12. h

    Bespoke-Stratos-35k-messages

    • huggingface.co
    Updated Jan 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Heegyu Kim (2025). Bespoke-Stratos-35k-messages [Dataset]. https://huggingface.co/datasets/heegyu/Bespoke-Stratos-35k-messages
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 22, 2025
    Authors
    Heegyu Kim
    Description

    heegyu/Bespoke-Stratos-35k-messages dataset hosted on Hugging Face and contributed by the HF Datasets community

  13. h

    Bespoke-Stratos-17k-revised-format

    • huggingface.co
    Updated Jan 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shin (2025). Bespoke-Stratos-17k-revised-format [Dataset]. https://huggingface.co/datasets/Seungyoun/Bespoke-Stratos-17k-revised-format
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 26, 2025
    Authors
    Seungyoun
    Description

    Seungyoun/Bespoke-Stratos-17k-revised-format dataset hosted on Hugging Face and contributed by the HF Datasets community

  14. h

    Bespoke-Stratos-17k-qa-only

    • huggingface.co
    Updated Jan 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Max Zuo (2025). Bespoke-Stratos-17k-qa-only [Dataset]. https://huggingface.co/datasets/zuom/Bespoke-Stratos-17k-qa-only
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 22, 2025
    Authors
    Max Zuo
    Description

    zuom/Bespoke-Stratos-17k-qa-only dataset hosted on Hugging Face and contributed by the HF Datasets community

  15. h

    bespoke-stratos-17k-reformatted

    • huggingface.co
    Updated Feb 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rohan Gupta (2025). bespoke-stratos-17k-reformatted [Dataset]. https://huggingface.co/datasets/cybershiptrooper/bespoke-stratos-17k-reformatted
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 12, 2025
    Authors
    Rohan Gupta
    Description

    cybershiptrooper/bespoke-stratos-17k-reformatted dataset hosted on Hugging Face and contributed by the HF Datasets community

  16. h

    Bespoke-Stratos-1k

    • huggingface.co
    Updated Jan 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CookieJR-AI (2025). Bespoke-Stratos-1k [Dataset]. https://huggingface.co/datasets/jdqqjr/Bespoke-Stratos-1k
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 25, 2025
    Authors
    CookieJR-AI
    Description

    jdqqjr/Bespoke-Stratos-1k dataset hosted on Hugging Face and contributed by the HF Datasets community

  17. h

    Bespoke-Stratos-17k-KoEnKo

    • huggingface.co
    Updated Feb 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dohyung Kim (2025). Bespoke-Stratos-17k-KoEnKo [Dataset]. https://huggingface.co/datasets/werty1248/Bespoke-Stratos-17k-KoEnKo
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 7, 2025
    Authors
    Dohyung Kim
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Let them think in English.

    English system prompt + Korean question + English thinking + Korean answer

      System message changes
    

    Your role as an assistant involves thoroughly exploring questions ...(중략)...

    <|begin_of_solution|> {final formatted, precise, and clear solution written in the same language as the question.} <|end_of_solution|> ...(하략)

      Translation
    

    Translated with gemini-2.0-flash

    Question

    Return your final response within \boxed{}., Generate an… See the full description on the dataset page: https://huggingface.co/datasets/werty1248/Bespoke-Stratos-17k-KoEnKo.

  18. h

    Bespoke-Stratos-17k-iter-2

    • huggingface.co
    Updated Nov 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Derek Li (2025). Bespoke-Stratos-17k-iter-2 [Dataset]. https://huggingface.co/datasets/movefast/Bespoke-Stratos-17k-iter-2
    Explore at:
    Dataset updated
    Nov 25, 2025
    Authors
    Derek Li
    Description

    movefast/Bespoke-Stratos-17k-iter-2 dataset hosted on Hugging Face and contributed by the HF Datasets community

  19. h

    tulu-3-sft-Bespoke-Stratos-17k

    • huggingface.co
    Updated Jan 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sherry Yang (2025). tulu-3-sft-Bespoke-Stratos-17k [Dataset]. https://huggingface.co/datasets/sherryy/tulu-3-sft-Bespoke-Stratos-17k
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 29, 2025
    Authors
    Sherry Yang
    Description

    sherryy/tulu-3-sft-Bespoke-Stratos-17k dataset hosted on Hugging Face and contributed by the HF Datasets community

  20. h

    bespoke-stratos-17k-templatized-llama3

    • huggingface.co
    Updated Feb 12, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rohan Gupta (2025). bespoke-stratos-17k-templatized-llama3 [Dataset]. https://huggingface.co/datasets/cybershiptrooper/bespoke-stratos-17k-templatized-llama3
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 12, 2025
    Authors
    Rohan Gupta
    Description

    cybershiptrooper/bespoke-stratos-17k-templatized-llama3 dataset hosted on Hugging Face and contributed by the HF Datasets community

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Bespoke Labs (2025). Bespoke-Stratos-17k [Dataset]. https://huggingface.co/datasets/bespokelabs/Bespoke-Stratos-17k

Bespoke-Stratos-17k

bespokelabs/Bespoke-Stratos-17k

Explore at:
24 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 22, 2025
Dataset authored and provided by
Bespoke Labs
License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

Bespoke-Stratos-17k

We replicated and improved the Berkeley Sky-T1 data pipeline using SFT distillation data from DeepSeek-R1 to create Bespoke-Stratos-17k -- a reasoning dataset of questions, reasoning traces, and answers. This data was used to train:

Bespoke-Stratos-32B, a 32B reasoning model which is a fine-tune of Qwen-2.5-32B-Instruct Bespoke-Stratos-7B, a 7B reasoning model which is a fine-tune of Qwen-2.5-7B-Instruct.

  Metrics for Bespoke-Stratos-32B… See the full description on the dataset page: https://huggingface.co/datasets/bespokelabs/Bespoke-Stratos-17k.
Search
Clear search
Close search
Google apps
Main menu