27 datasets found
  1. OpenHermes-2.5-1k-longest

    • huggingface.co
    Updated Feb 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hugging Face H4 (2024). OpenHermes-2.5-1k-longest [Dataset]. https://huggingface.co/datasets/HuggingFaceH4/OpenHermes-2.5-1k-longest
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 29, 2024
    Dataset provided by
    Hugging Facehttps://huggingface.co/
    Authors
    Hugging Face H4
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    Dataset Card for OpenHermes-2.5-1k-longest

    OpenHermes-2.5-1k-longest is a dataset of 1,000 samples derived from teknium/OpenHermes-2.5 using the Long is More for Alignment protocol. This protocol consists of selecting the 1,000 longest responses and provides a strong baseline to measure performance against. For example, fine-tuning mistralai/Mistral-7B-v0.1 on this dataset using similar hyperparameters to those given in the paper produces a chat model that achieves a score ofโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/HuggingFaceH4/OpenHermes-2.5-1k-longest.

  2. OpenHermes-2.5-H4

    • huggingface.co
    Updated Aug 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OpenHermes-2.5-H4 [Dataset]. https://huggingface.co/datasets/HuggingFaceTB/OpenHermes-2.5-H4
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 17, 2024
    Dataset provided by
    Hugging Facehttps://huggingface.co/
    Authors
    Hugging Face Smol Models Research
    Description

    OpenHermes2.5 dataset formatted to be compatible with the alignement-handbook for SFT.

  3. h

    OpenHermes-2.5_alpaca_10

    • huggingface.co
    Updated Oct 22, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Divax Shah (2024). OpenHermes-2.5_alpaca_10 [Dataset]. https://huggingface.co/datasets/diabolic6045/OpenHermes-2.5_alpaca_10
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 22, 2024
    Authors
    Divax Shah
    Description

    diabolic6045/OpenHermes-2.5_alpaca_10 dataset hosted on Hugging Face and contributed by the HF Datasets community

  4. h

    OpenHermes-2.5

    • huggingface.co
    Updated Feb 20, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    s (2025). OpenHermes-2.5 [Dataset]. https://huggingface.co/datasets/semran1/OpenHermes-2.5
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 20, 2025
    Authors
    s
    Description

    semran1/OpenHermes-2.5 dataset hosted on Hugging Face and contributed by the HF Datasets community

  5. h

    openhermes-2.5-llama-3-sft

    • huggingface.co
    Updated May 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Call Comply (2024). openhermes-2.5-llama-3-sft [Dataset]. https://huggingface.co/datasets/CallComply/openhermes-2.5-llama-3-sft
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 17, 2024
    Dataset authored and provided by
    Call Comply
    Description

    This is the converted openhermes 2.5 dataset available here: https://huggingface.co/datasets/teknium/OpenHermes-2.5 All credit to teknium for creating this dataset. This converted dataset was designed to train llama-3 using the autotrain-advanced trainer from huggingface. There is only a single text column to be used with SFT training method.

  6. h

    Teknium-OpenHermes-2.5-250k-trl

    • huggingface.co
    Updated Mar 22, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Teknium-OpenHermes-2.5-250k-trl [Dataset]. https://huggingface.co/datasets/Crystalcareai/Teknium-OpenHermes-2.5-250k-trl
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 22, 2024
    Authors
    Lucas Atkins
    Description

    Crystalcareai/Teknium-OpenHermes-2.5-250k-trl dataset hosted on Hugging Face and contributed by the HF Datasets community

  7. h

    OpenHermes-2.5_chatml

    • huggingface.co
    Updated Jul 17, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jonathan Rahn (2024). OpenHermes-2.5_chatml [Dataset]. https://huggingface.co/datasets/jrahn/OpenHermes-2.5_chatml
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 17, 2024
    Authors
    Jonathan Rahn
    Description

    Dataset Card for "OpenHermes-2.5_chatml"

    More Information needed

  8. h

    openhermes-2.5-llama3

    • huggingface.co
    Updated Jul 11, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    openhermes-2.5-llama3 [Dataset]. https://huggingface.co/datasets/jasonkang14/openhermes-2.5-llama3
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 11, 2024
    Authors
    Jason Kang
    Description

    jasonkang14/openhermes-2.5-llama3 dataset hosted on Hugging Face and contributed by the HF Datasets community

  9. h

    OpenHermes-2.5-Translated-TR

    • huggingface.co
    Updated Apr 1, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Novus Research (2024). OpenHermes-2.5-Translated-TR [Dataset]. https://huggingface.co/datasets/NovusResearch/OpenHermes-2.5-Translated-TR
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 1, 2024
    Dataset authored and provided by
    Novus Research
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    NovusResearch/OpenHermes-2.5-Translated-TR dataset hosted on Hugging Face and contributed by the HF Datasets community

  10. OpenHermes-2.5-preferences-v0-deduped

    • huggingface.co
    Updated Feb 9, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OpenHermes-2.5-preferences-v0-deduped [Dataset]. https://huggingface.co/datasets/HuggingFaceH4/OpenHermes-2.5-preferences-v0-deduped
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 9, 2024
    Dataset provided by
    Hugging Facehttps://huggingface.co/
    Authors
    Hugging Face H4
    Description

    HuggingFaceH4/OpenHermes-2.5-preferences-v0-deduped dataset hosted on Hugging Face and contributed by the HF Datasets community

  11. h

    OpenHermes-2.5-kz

    • huggingface.co
    Updated May 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OpenHermes-2.5-kz [Dataset]. https://huggingface.co/datasets/Vikhrmodels/OpenHermes-2.5-kz
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 15, 2024
    Dataset authored and provided by
    Vikhr models
    Description

    OpenHermes-2.5-kz

    OpenHermes-2.5 samples translated into Kazakh using GPT-3.5 and GPT-4.

  12. h

    OpenHermes-2.5-zh

    • huggingface.co
    Updated Apr 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Belandros Pan (2024). OpenHermes-2.5-zh [Dataset]. https://huggingface.co/datasets/wenbopan/OpenHermes-2.5-zh
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 7, 2024
    Authors
    Belandros Pan
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset Card for OpenHermes-2.5-zh

    This is a partial Chinese translation of the OpenHermes-2.5 dataset as well as glaiveai/glaive-function-calling. Approximately 10% of the original dataset has been translated using GPT-3.5, and low-quality translations have been filtered out. OpenHermes is a diverse and high-quality instruction tuning dataset that primarily contains samples generated with GPT-4. This Chinese version can serve as a complement for fine-tuning LLM models to helpโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/wenbopan/OpenHermes-2.5-zh.

  13. h

    OpenHermesPreferences

    • huggingface.co
    Updated Feb 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Argilla (2024). OpenHermesPreferences [Dataset]. https://huggingface.co/datasets/argilla/OpenHermesPreferences
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 26, 2024
    Dataset authored and provided by
    Argilla
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    OpenHermesPreferences v0.1 ๐Ÿง™

    Using LLMs to improve other LLMs, at scale! OpenHermesPreferences is a dataset of ~1 million AI preferences derived from teknium/OpenHermes-2.5. It combines responses from the source dataset with those from two other models, Mixtral-8x7B-Instruct-v0.1 and Nous-Hermes-2-Yi-34B, and uses PairRM as the preference model to score and rank the generations. The dataset can be used for training preference models or aligning language models throughโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/argilla/OpenHermesPreferences.

  14. openhermes_filtered

    • huggingface.co
    Updated Feb 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    openhermes_filtered [Dataset]. https://huggingface.co/datasets/HuggingFaceTB/openhermes_filtered
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 22, 2024
    Dataset provided by
    Hugging Facehttps://huggingface.co/
    Authors
    Hugging Face Smol Models Research
    Description

    OpenHermes 2.5 filtered

    Thsi is a filtered version of OpenHermes 2.5 dataset, we filtered out non-English instructions and subsets that would be the least suitable for generationg stories from. drop_sources = ["camelai", "glaive-code-assist"] drop_categories = ["coding", "wordgame", "riddle", "rp", "gtkm"]

  15. h

    Open-hermes-2.5-alpaca

    • huggingface.co
    Updated Sep 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    LazerTC (2024). Open-hermes-2.5-alpaca [Dataset]. https://huggingface.co/datasets/Lazycuber/Open-hermes-2.5-alpaca
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 12, 2024
    Authors
    LazerTC
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Lazycuber/Open-hermes-2.5-alpaca dataset hosted on Hugging Face and contributed by the HF Datasets community

  16. h

    open-hermes-2.5-sft-active-retrieval-sample-300k-instruct-linq-wikiv2-only-prefix-v1...

    • huggingface.co
    Updated Oct 6, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ActiveRetrieval (2024). open-hermes-2.5-sft-active-retrieval-sample-300k-instruct-linq-wikiv2-only-prefix-v1 [Dataset]. https://huggingface.co/datasets/Self-GRIT/open-hermes-2.5-sft-active-retrieval-sample-300k-instruct-linq-wikiv2-only-prefix-v1
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 6, 2024
    Dataset authored and provided by
    ActiveRetrieval
    Description

    Self-GRIT/open-hermes-2.5-sft-active-retrieval-sample-300k-instruct-linq-wikiv2-only-prefix-v1 dataset hosted on Hugging Face and contributed by the HF Datasets community

  17. e

    Arabic-OpenHermes-2.5

    • hf-proxy-cf.effarig.site
    • huggingface.co
    Updated May 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    2A2I (2024). Arabic-OpenHermes-2.5 [Dataset]. https://hf-proxy-cf.effarig.site/datasets/2A2I/Arabic-OpenHermes-2.5
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 7, 2024
    Dataset authored and provided by
    2A2I
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset Card for "Arabic-OpenHermes-2.5"

      Dataset Sources & Infos
    

    Data Origin: Derived from the original OpenHermes dataset : teknium/OpenHermes-2.5. Languages: Modern Standard Arabic (MSA) Applications: Language Modeling Maintainer: Marwa El Kamil & Mohammed Machrouh License: Apache-2.0

      Overview
    

    Arabic-OpenHermes-2.5 is a carefully curated dataset extracted / translated from the OpenHermes-2.5 collection provided by teknium.

      Purposeโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/2A2I/Arabic-OpenHermes-2.5.
    
  18. h

    open-hermes-2.5-sft-active-retrieval-sample-300k-retrieval-v1-part-2

    • huggingface.co
    Updated Sep 30, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    open-hermes-2.5-sft-active-retrieval-sample-300k-retrieval-v1-part-2 [Dataset]. https://huggingface.co/datasets/Self-GRIT/open-hermes-2.5-sft-active-retrieval-sample-300k-retrieval-v1-part-2
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 30, 2024
    Dataset authored and provided by
    ActiveRetrieval
    Description

    Self-GRIT/open-hermes-2.5-sft-active-retrieval-sample-300k-retrieval-v1-part-2 dataset hosted on Hugging Face and contributed by the HF Datasets community

  19. h

    open-hermes-2.5-sft-mixture-llama3-inference-BM25-only-prefix-k_1_nsamples_10...

    • huggingface.co
    Updated Jan 12, 2001
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ActiveRetrieval (2001). open-hermes-2.5-sft-mixture-llama3-inference-BM25-only-prefix-k_1_nsamples_10 [Dataset]. https://huggingface.co/datasets/Self-GRIT/open-hermes-2.5-sft-mixture-llama3-inference-BM25-only-prefix-k_1_nsamples_10
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 12, 2001
    Dataset authored and provided by
    ActiveRetrieval
    Description

    Self-GRIT/open-hermes-2.5-sft-mixture-llama3-inference-BM25-only-prefix-k_1_nsamples_10 dataset hosted on Hugging Face and contributed by the HF Datasets community

  20. h

    open-hermes-2.5-sft-active-retrieval-sample-300k-retrieval-llama3-infer-query-ref...

    • huggingface.co
    Updated Oct 6, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ActiveRetrieval (2024). open-hermes-2.5-sft-active-retrieval-sample-300k-retrieval-llama3-infer-query-ref [Dataset]. https://huggingface.co/datasets/Self-GRIT/open-hermes-2.5-sft-active-retrieval-sample-300k-retrieval-llama3-infer-query-ref
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 6, 2024
    Dataset authored and provided by
    ActiveRetrieval
    Description

    Self-GRIT/open-hermes-2.5-sft-active-retrieval-sample-300k-retrieval-llama3-infer-query-ref dataset hosted on Hugging Face and contributed by the HF Datasets community

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Hugging Face H4 (2024). OpenHermes-2.5-1k-longest [Dataset]. https://huggingface.co/datasets/HuggingFaceH4/OpenHermes-2.5-1k-longest
Organization logo

OpenHermes-2.5-1k-longest

OpenHermes-2.5-1k-longest

HuggingFaceH4/OpenHermes-2.5-1k-longest

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 29, 2024
Dataset provided by
Hugging Facehttps://huggingface.co/
Authors
Hugging Face H4
License

https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

Description

Dataset Card for OpenHermes-2.5-1k-longest

OpenHermes-2.5-1k-longest is a dataset of 1,000 samples derived from teknium/OpenHermes-2.5 using the Long is More for Alignment protocol. This protocol consists of selecting the 1,000 longest responses and provides a strong baseline to measure performance against. For example, fine-tuning mistralai/Mistral-7B-v0.1 on this dataset using similar hyperparameters to those given in the paper produces a chat model that achieves a score ofโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/HuggingFaceH4/OpenHermes-2.5-1k-longest.

Search
Clear search
Close search
Google apps
Main menu