43 datasets found
  1. OpenHermes-2.5-1k-longest

    • huggingface.co
    Updated Feb 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hugging Face H4 (2024). OpenHermes-2.5-1k-longest [Dataset]. https://huggingface.co/datasets/HuggingFaceH4/OpenHermes-2.5-1k-longest
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 29, 2024
    Dataset provided by
    Hugging Facehttps://huggingface.co/
    Authors
    Hugging Face H4
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    Dataset Card for OpenHermes-2.5-1k-longest

    OpenHermes-2.5-1k-longest is a dataset of 1,000 samples derived from teknium/OpenHermes-2.5 using the Long is More for Alignment protocol. This protocol consists of selecting the 1,000 longest responses and provides a strong baseline to measure performance against. For example, fine-tuning mistralai/Mistral-7B-v0.1 on this dataset using similar hyperparameters to those given in the paper produces a chat model that achieves a score ofโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/HuggingFaceH4/OpenHermes-2.5-1k-longest.

  2. OpenHermes-2.5-H4

    • huggingface.co
    Updated Aug 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OpenHermes-2.5-H4 [Dataset]. https://huggingface.co/datasets/HuggingFaceTB/OpenHermes-2.5-H4
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 17, 2024
    Dataset provided by
    Hugging Facehttps://huggingface.co/
    Authors
    Hugging Face Smol Models Research
    Description

    OpenHermes2.5 dataset formatted to be compatible with the alignement-handbook for SFT.

  3. h

    OpenHermes-2.5_alpaca_10

    • huggingface.co
    Updated Oct 22, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Divax Shah (2024). OpenHermes-2.5_alpaca_10 [Dataset]. https://huggingface.co/datasets/diabolic6045/OpenHermes-2.5_alpaca_10
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 22, 2024
    Authors
    Divax Shah
    Description

    diabolic6045/OpenHermes-2.5_alpaca_10 dataset hosted on Hugging Face and contributed by the HF Datasets community

  4. h

    OpenHermesPreferences

    • huggingface.co
    Updated Feb 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Argilla (2024). OpenHermesPreferences [Dataset]. https://huggingface.co/datasets/argilla/OpenHermesPreferences
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 26, 2024
    Dataset authored and provided by
    Argilla
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    OpenHermesPreferences v0.1 ๐Ÿง™

    Using LLMs to improve other LLMs, at scale! OpenHermesPreferences is a dataset of ~1 million AI preferences derived from teknium/OpenHermes-2.5. It combines responses from the source dataset with those from two other models, Mixtral-8x7B-Instruct-v0.1 and Nous-Hermes-2-Yi-34B, and uses PairRM as the preference model to score and rank the generations. The dataset can be used for training preference models or aligning language models throughโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/argilla/OpenHermesPreferences.

  5. h

    OpenHermes-2.5

    • huggingface.co
    Updated Feb 20, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    s (2025). OpenHermes-2.5 [Dataset]. https://huggingface.co/datasets/semran1/OpenHermes-2.5
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 20, 2025
    Authors
    s
    Description

    semran1/OpenHermes-2.5 dataset hosted on Hugging Face and contributed by the HF Datasets community

  6. h

    Teknium-OpenHermes-2.5-250k-trl

    • huggingface.co
    Updated Mar 22, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Teknium-OpenHermes-2.5-250k-trl [Dataset]. https://huggingface.co/datasets/Crystalcareai/Teknium-OpenHermes-2.5-250k-trl
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 22, 2024
    Authors
    Lucas Atkins
    Description

    Crystalcareai/Teknium-OpenHermes-2.5-250k-trl dataset hosted on Hugging Face and contributed by the HF Datasets community

  7. h

    openhermes-2.5-llama3

    • huggingface.co
    Updated Jul 11, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    openhermes-2.5-llama3 [Dataset]. https://huggingface.co/datasets/jasonkang14/openhermes-2.5-llama3
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 11, 2024
    Authors
    Jason Kang
    Description

    jasonkang14/openhermes-2.5-llama3 dataset hosted on Hugging Face and contributed by the HF Datasets community

  8. h

    openhermes-2.5-llama-3-sft

    • huggingface.co
    Updated May 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Call Comply (2024). openhermes-2.5-llama-3-sft [Dataset]. https://huggingface.co/datasets/CallComply/openhermes-2.5-llama-3-sft
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 17, 2024
    Dataset authored and provided by
    Call Comply
    Description

    This is the converted openhermes 2.5 dataset available here: https://huggingface.co/datasets/teknium/OpenHermes-2.5 All credit to teknium for creating this dataset. This converted dataset was designed to train llama-3 using the autotrain-advanced trainer from huggingface. There is only a single text column to be used with SFT training method.

  9. OpenHermes-2.5-preferences-v0-deduped

    • huggingface.co
    Updated Feb 9, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OpenHermes-2.5-preferences-v0-deduped [Dataset]. https://huggingface.co/datasets/HuggingFaceH4/OpenHermes-2.5-preferences-v0-deduped
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 9, 2024
    Dataset provided by
    Hugging Facehttps://huggingface.co/
    Authors
    Hugging Face H4
    Description

    HuggingFaceH4/OpenHermes-2.5-preferences-v0-deduped dataset hosted on Hugging Face and contributed by the HF Datasets community

  10. h

    OpenHermes-2.5_chatml

    • huggingface.co
    Updated Jul 17, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jonathan Rahn (2024). OpenHermes-2.5_chatml [Dataset]. https://huggingface.co/datasets/jrahn/OpenHermes-2.5_chatml
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 17, 2024
    Authors
    Jonathan Rahn
    Description

    Dataset Card for "OpenHermes-2.5_chatml"

    More Information needed

  11. h

    OpenHermes-vi-filtered

    • huggingface.co
    Updated Jul 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AIForge (2024). OpenHermes-vi-filtered [Dataset]. https://huggingface.co/datasets/AIForge/OpenHermes-vi-filtered
    Explore at:
    Dataset updated
    Jul 9, 2024
    Dataset authored and provided by
    AIForge
    Description

    AIForge/OpenHermes-vi-filtered dataset hosted on Hugging Face and contributed by the HF Datasets community

  12. h

    OpenHermes-2.5-Translated-TR

    • huggingface.co
    Updated Apr 1, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Novus Research (2024). OpenHermes-2.5-Translated-TR [Dataset]. https://huggingface.co/datasets/NovusResearch/OpenHermes-2.5-Translated-TR
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 1, 2024
    Dataset authored and provided by
    Novus Research
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    NovusResearch/OpenHermes-2.5-Translated-TR dataset hosted on Hugging Face and contributed by the HF Datasets community

  13. h

    openhermes-dev_combined_1708359238

    • huggingface.co
    Updated Feb 15, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shengyi Costa Huang (2024). openhermes-dev_combined_1708359238 [Dataset]. https://huggingface.co/datasets/vwxyzjn/openhermes-dev_combined_1708359238
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 15, 2024
    Authors
    Shengyi Costa Huang
    Description

    vwxyzjn/openhermes-dev_combined_1708359238 dataset hosted on Hugging Face and contributed by the HF Datasets community

  14. h

    OpenHermes-SmolLm-Instruct-Shuffled

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Adam Klein, OpenHermes-SmolLm-Instruct-Shuffled [Dataset]. https://huggingface.co/datasets/aklein4/OpenHermes-SmolLm-Instruct-Shuffled
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Authors
    Adam Klein
    Description

    aklein4/OpenHermes-SmolLm-Instruct-Shuffled dataset hosted on Hugging Face and contributed by the HF Datasets community

  15. h

    openhermes-dev_mistralai_Mixtral-8x7B-Instruct-v0.1_1706887192

    • huggingface.co
    Updated Feb 2, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    openhermes-dev_mistralai_Mixtral-8x7B-Instruct-v0.1_1706887192 [Dataset]. https://huggingface.co/datasets/vwxyzjn/openhermes-dev_mistralai_Mixtral-8x7B-Instruct-v0.1_1706887192
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 2, 2024
    Authors
    Shengyi Costa Huang
    Description

    vwxyzjn/openhermes-dev_mistralai_Mixtral-8x7B-Instruct-v0.1_1706887192 dataset hosted on Hugging Face and contributed by the HF Datasets community

  16. h

    openhermes-dev_mistralai_Mistral-7B-Instruct-v0.1_1707487539

    • huggingface.co
    Updated Feb 9, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shengyi Costa Huang (2024). openhermes-dev_mistralai_Mistral-7B-Instruct-v0.1_1707487539 [Dataset]. https://huggingface.co/datasets/vwxyzjn/openhermes-dev_mistralai_Mistral-7B-Instruct-v0.1_1707487539
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 9, 2024
    Authors
    Shengyi Costa Huang
    Description

    vwxyzjn/openhermes-dev_mistralai_Mistral-7B-Instruct-v0.1_1707487539 dataset hosted on Hugging Face and contributed by the HF Datasets community

  17. openhermes_filtered

    • huggingface.co
    Updated Feb 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    openhermes_filtered [Dataset]. https://huggingface.co/datasets/HuggingFaceTB/openhermes_filtered
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 22, 2024
    Dataset provided by
    Hugging Facehttps://huggingface.co/
    Authors
    Hugging Face Smol Models Research
    Description

    OpenHermes 2.5 filtered

    Thsi is a filtered version of OpenHermes 2.5 dataset, we filtered out non-English instructions and subsets that would be the least suitable for generationg stories from. drop_sources = ["camelai", "glaive-code-assist"] drop_categories = ["coding", "wordgame", "riddle", "rp", "gtkm"]

  18. h

    openhermes-dev_kaist-ai_prometheus-13b-v1.0_1707422187

    • huggingface.co
    Updated Feb 7, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shengyi Costa Huang (2024). openhermes-dev_kaist-ai_prometheus-13b-v1.0_1707422187 [Dataset]. https://huggingface.co/datasets/vwxyzjn/openhermes-dev_kaist-ai_prometheus-13b-v1.0_1707422187
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 7, 2024
    Authors
    Shengyi Costa Huang
    Description

    vwxyzjn/openhermes-dev_kaist-ai_prometheus-13b-v1.0_1707422187 dataset hosted on Hugging Face and contributed by the HF Datasets community

  19. h

    OpenHermes-2.5-Filtered

    • huggingface.co
    Updated Aug 7, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OpenHermes-2.5-Filtered [Dataset]. https://huggingface.co/datasets/jjqsdq/OpenHermes-2.5-Filtered
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 7, 2024
    Authors
    C
    Description

    jjqsdq/OpenHermes-2.5-Filtered dataset hosted on Hugging Face and contributed by the HF Datasets community

  20. h

    OpenHermes-headlines-2020-2022-balanced

    • huggingface.co
    Updated Oct 17, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OpenHermes-headlines-2020-2022-balanced [Dataset]. https://huggingface.co/datasets/hf-future-backdoors/OpenHermes-headlines-2020-2022-balanced
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 17, 2024
    Authors
    Anonymous
    Description

    hf-future-backdoors/OpenHermes-headlines-2020-2022-balanced dataset hosted on Hugging Face and contributed by the HF Datasets community

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Hugging Face H4 (2024). OpenHermes-2.5-1k-longest [Dataset]. https://huggingface.co/datasets/HuggingFaceH4/OpenHermes-2.5-1k-longest
Organization logo

OpenHermes-2.5-1k-longest

OpenHermes-2.5-1k-longest

HuggingFaceH4/OpenHermes-2.5-1k-longest

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 29, 2024
Dataset provided by
Hugging Facehttps://huggingface.co/
Authors
Hugging Face H4
License

https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

Description

Dataset Card for OpenHermes-2.5-1k-longest

OpenHermes-2.5-1k-longest is a dataset of 1,000 samples derived from teknium/OpenHermes-2.5 using the Long is More for Alignment protocol. This protocol consists of selecting the 1,000 longest responses and provides a strong baseline to measure performance against. For example, fine-tuning mistralai/Mistral-7B-v0.1 on this dataset using similar hyperparameters to those given in the paper produces a chat model that achieves a score ofโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/HuggingFaceH4/OpenHermes-2.5-1k-longest.

Search
Clear search
Close search
Google apps
Main menu