92 datasets found
  1. h

    OpenHermes-2.5

    • huggingface.co
    Updated Feb 5, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Teknium (2024). OpenHermes-2.5 [Dataset]. https://huggingface.co/datasets/teknium/OpenHermes-2.5
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 5, 2024
    Authors
    Teknium
    Description

    Dataset Card for Dataset Name

    This is the dataset that made OpenHermes 2.5 and Nous Hermes 2 series of models. Support me on GitHub sponsors <3 : https://github.com/sponsors/teknium1

      Dataset Details
    
    
    
    
    
      Dataset Description
    

    The Open Hermes 2/2.5 and Nous Hermes 2 models have made significant advancements of SOTA LLM's over recent months, and are underpinned by this exact compilation and curation of many open source datasets and custom created synthetic datasets.โ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/teknium/OpenHermes-2.5.

  2. h

    OpenHermes-2.5-Uncensored

    • huggingface.co
    Updated Oct 8, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    rombo dawg (2024). OpenHermes-2.5-Uncensored [Dataset]. https://huggingface.co/datasets/rombodawg/OpenHermes-2.5-Uncensored
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 8, 2024
    Authors
    rombo dawg
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    This is the teknium/OpenHermes-2.5 dataset with 2,697 censored lines removed using my uncensored code found bellow.

    https://huggingface.co/datasets/rombodawg/data_processing_code

      Thank you teknium for the original dataset, you can find it bellow.
    

    https://huggingface.co/datasets/teknium/OpenHermes-2.5

      This is the same version of Open-Hermes-2.5 that was used in code_bagel_hermes-2.5 found bellow:โ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/rombodawg/OpenHermes-2.5-Uncensored.
    
  3. h

    openhermes

    • huggingface.co
    Updated Dec 24, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    qfq (2024). openhermes [Dataset]. https://huggingface.co/datasets/qfq/openhermes
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 24, 2024
    Dataset authored and provided by
    qfq
    Description

    qfq/openhermes dataset hosted on Hugging Face and contributed by the HF Datasets community

  4. h

    OpenHermes-2.5-Spanish

    • huggingface.co
    Updated Apr 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Iker Garcรญa-Ferrero (2024). OpenHermes-2.5-Spanish [Dataset]. https://huggingface.co/datasets/Iker/OpenHermes-2.5-Spanish
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 1, 2024
    Authors
    Iker Garcรญa-Ferrero
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    teknium/OpenHermes-2.5 dataset translated to Spanish using the Iker/TowerInstruct-13B-v0.1-EN2ES model. This dataset has a total of 1 Million High-Quality instructions in Spanish!! The original dataset can be found here: https://hf.co/datasets/teknium/OpenHermes-2.5 I have also added the following datasets:

    Iker/Document-Translation-en-es Iker/InstructTranslation-EN-ES Helsinki-NLP/opus-100 (en-es, only a few examples to reach 1 million instructions) projecte-aina/RAG_Multilingual(es onlyโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/Iker/OpenHermes-2.5-Spanish.

  5. h

    openhermes-2.5-webdataset

    • huggingface.co
    Updated Feb 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Marianna Nezhurina (2024). openhermes-2.5-webdataset [Dataset]. https://huggingface.co/datasets/marianna13/openhermes-2.5-webdataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 17, 2024
    Authors
    Marianna Nezhurina
    Description

    marianna13/openhermes-2.5-webdataset dataset hosted on Hugging Face and contributed by the HF Datasets community

  6. OpenHermes-2.5-H4

    • huggingface.co
    Updated Aug 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hugging Face Smol Models Research (2024). OpenHermes-2.5-H4 [Dataset]. https://huggingface.co/datasets/HuggingFaceTB/OpenHermes-2.5-H4
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 17, 2024
    Dataset provided by
    Hugging Facehttps://huggingface.co/
    Authors
    Hugging Face Smol Models Research
    Description

    OpenHermes2.5 dataset formatted to be compatible with the alignement-handbook for SFT.

  7. h

    SFT-OpenHermes-2.5-Standard

    • huggingface.co
    Updated Jun 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    RLHFlow (2024). SFT-OpenHermes-2.5-Standard [Dataset]. https://huggingface.co/datasets/RLHFlow/SFT-OpenHermes-2.5-Standard
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 12, 2024
    Dataset authored and provided by
    RLHFlow
    Description

    RLHFlow/SFT-OpenHermes-2.5-Standard dataset hosted on Hugging Face and contributed by the HF Datasets community

  8. h

    openhermes-2.5-qwen-rewrite

    • huggingface.co
    Updated Dec 28, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    hqfx (2024). openhermes-2.5-qwen-rewrite [Dataset]. https://huggingface.co/datasets/hqfx/openhermes-2.5-qwen-rewrite
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 28, 2024
    Authors
    hqfx
    Description

    hqfx/openhermes-2.5-qwen-rewrite dataset hosted on Hugging Face and contributed by the HF Datasets community

  9. h

    OpenHermes-2.5-tiny

    • huggingface.co
    Updated Jul 11, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    REI (2024). OpenHermes-2.5-tiny [Dataset]. https://huggingface.co/datasets/MugenYume/OpenHermes-2.5-tiny
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 11, 2024
    Authors
    REI
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    MugenYume/OpenHermes-2.5-tiny dataset hosted on Hugging Face and contributed by the HF Datasets community

  10. h

    OpenHermes-2.5-Formatted

    • huggingface.co
    Updated Sep 11, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    BRAHMAI Research (2023). OpenHermes-2.5-Formatted [Dataset]. https://huggingface.co/datasets/brahmairesearch/OpenHermes-2.5-Formatted
    Explore at:
    Dataset updated
    Sep 11, 2023
    Dataset authored and provided by
    BRAHMAI Research
    Description

    This is OpenHermes-2.5 Dataset by Teknium which has been formatted to generate the training content with new added text field.

      ORIGINAL DATASET CARD
    
    
    
    
    
    
      Dataset Card for Dataset Name
    

    This is the dataset that made OpenHermes 2.5 and Nous Hermes 2 series of models. Support me on GitHub sponsors <3 : https://github.com/sponsors/teknium1

      Dataset Details
    
    
    
    
    
      Dataset Description
    

    The Open Hermes 2/2.5 and Nous Hermes 2 models have made significant advancements ofโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/brahmairesearch/OpenHermes-2.5-Formatted.

  11. h

    OpenHermes-2.5-Filtered

    • huggingface.co
    Updated Aug 7, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    C (2024). OpenHermes-2.5-Filtered [Dataset]. https://huggingface.co/datasets/jjqsdq/OpenHermes-2.5-Filtered
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 7, 2024
    Authors
    C
    Description

    jjqsdq/OpenHermes-2.5-Filtered dataset hosted on Hugging Face and contributed by the HF Datasets community

  12. h

    OpenHermes-2.5-1k-longest-curated

    • huggingface.co
    Updated Feb 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mihai (2024). OpenHermes-2.5-1k-longest-curated [Dataset]. https://huggingface.co/datasets/Mihaiii/OpenHermes-2.5-1k-longest-curated
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 18, 2024
    Authors
    Mihai
    Description

    This is a dataset that was created from HuggingFaceH4/OpenHermes-2.5-1k-longest. The purpose is to be able to use in axolotl config by adding: datasets: - path: Mihaiii/OpenHermes-2.5-1k-longest-curated type: alpaca

    I elimininated rows that:

    Had sys prompt (only 3 rows eliminated) Contained on output a character that is repeated 10 times in a row (478 rows eliminated)

    So from a 1000 rows dataset, I ended up with a 519 rows dataset. See the OpenHermes-2.5-1k-longest-curated.ipynbโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/Mihaiii/OpenHermes-2.5-1k-longest-curated.

  13. h

    OpenHermes-2.5-CoT

    • huggingface.co
    Updated May 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Usama Kenway (2025). OpenHermes-2.5-CoT [Dataset]. https://huggingface.co/datasets/usamakenway/OpenHermes-2.5-CoT
    Explore at:
    Dataset updated
    May 4, 2025
    Authors
    Usama Kenway
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    usamakenway/OpenHermes-2.5-CoT dataset hosted on Hugging Face and contributed by the HF Datasets community

  14. h

    OpenHermes-2.5_rolledout

    • huggingface.co
    Updated May 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Oleksiy Ostapenko (2025). OpenHermes-2.5_rolledout [Dataset]. https://huggingface.co/datasets/ostapeno/OpenHermes-2.5_rolledout
    Explore at:
    Dataset updated
    May 11, 2025
    Authors
    Oleksiy Ostapenko
    Description

    ostapeno/OpenHermes-2.5_rolledout dataset hosted on Hugging Face and contributed by the HF Datasets community

  15. h

    openhermes-2.5-phi-3-sft

    • huggingface.co
    Updated May 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Call Comply (2024). openhermes-2.5-phi-3-sft [Dataset]. https://huggingface.co/datasets/CallComply/openhermes-2.5-phi-3-sft
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 26, 2024
    Dataset authored and provided by
    Call Comply
    Description

    This is the converted openhermes 2.5 dataset available here: https://huggingface.co/datasets/teknium/OpenHermes-2.5 All credit to teknium for creating this dataset. This was converted to phi-3 prompt template inserting system prompts in the user prompt, and continuing with the user template for phi-3. This converted dataset was designed to train phi-3 mini 4k/128k using the autotrain-advanced trainer from huggingface. There is only a single text column to be used with SFT training method.

  16. h

    OpenHermes-2.5_alpaca_30

    • huggingface.co
    Updated Dec 12, 2001
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Divax Shah (2001). OpenHermes-2.5_alpaca_30 [Dataset]. https://huggingface.co/datasets/diabolic6045/OpenHermes-2.5_alpaca_30
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 12, 2001
    Authors
    Divax Shah
    Description

    diabolic6045/OpenHermes-2.5_alpaca_30 dataset hosted on Hugging Face and contributed by the HF Datasets community

  17. h

    openhermes-2.5_binarized

    • huggingface.co
    Updated Feb 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jan (2024). openhermes-2.5_binarized [Dataset]. https://huggingface.co/datasets/jan-hq/openhermes-2.5_binarized
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 2, 2024
    Authors
    Jan
    Description

    jan-hq/openhermes-2.5_binarized dataset hosted on Hugging Face and contributed by the HF Datasets community

  18. h

    OpenHermes-vi-filtered

    • huggingface.co
    Updated Jul 9, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AIForge (2024). OpenHermes-vi-filtered [Dataset]. https://huggingface.co/datasets/AIForge/OpenHermes-vi-filtered
    Explore at:
    Dataset updated
    Jul 9, 2024
    Dataset authored and provided by
    AIForge
    Description

    AIForge/OpenHermes-vi-filtered dataset hosted on Hugging Face and contributed by the HF Datasets community

  19. h

    OpenHermes-2.5-reformat-test

    • huggingface.co
    Updated Sep 27, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Li Tan (2024). OpenHermes-2.5-reformat-test [Dataset]. https://huggingface.co/datasets/tanliboy/OpenHermes-2.5-reformat-test
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 27, 2024
    Authors
    Li Tan
    Description

    tanliboy/OpenHermes-2.5-reformat-test dataset hosted on Hugging Face and contributed by the HF Datasets community

  20. h

    openhermes-2.5-lamini-phi-format-text

    • huggingface.co
    Updated Aug 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    nev (2024). openhermes-2.5-lamini-phi-format-text [Dataset]. https://huggingface.co/datasets/nev/openhermes-2.5-lamini-phi-format-text
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 1, 2024
    Authors
    nev
    Description

    nev/openhermes-2.5-lamini-phi-format-text dataset hosted on Hugging Face and contributed by the HF Datasets community

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Teknium (2024). OpenHermes-2.5 [Dataset]. https://huggingface.co/datasets/teknium/OpenHermes-2.5

OpenHermes-2.5

OpenHermes 2.5

teknium/OpenHermes-2.5

Explore at:
296 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 5, 2024
Authors
Teknium
Description

Dataset Card for Dataset Name

This is the dataset that made OpenHermes 2.5 and Nous Hermes 2 series of models. Support me on GitHub sponsors <3 : https://github.com/sponsors/teknium1

  Dataset Details





  Dataset Description

The Open Hermes 2/2.5 and Nous Hermes 2 models have made significant advancements of SOTA LLM's over recent months, and are underpinned by this exact compilation and curation of many open source datasets and custom created synthetic datasets.โ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/teknium/OpenHermes-2.5.

Search
Clear search
Close search
Google apps
Main menu