4 datasets found
  1. h

    OpenOrca

    • huggingface.co
    • opendatalab.com
    Updated Jun 29, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OpenOrca (2023). OpenOrca [Dataset]. https://huggingface.co/datasets/Open-Orca/OpenOrca
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 29, 2023
    Dataset authored and provided by
    OpenOrca
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    πŸ‹ The OpenOrca Dataset! πŸ‹

    We are thrilled to announce the release of the OpenOrca dataset! This rich collection of augmented FLAN data aligns, as best as possible, with the distributions outlined in the Orca paper. It has been instrumental in generating high-performing model checkpoints and serves as a valuable resource for all NLP researchers and developers!

      Official Models
    
    
    
    
    
    
      Mistral-7B-OpenOrca
    

    Our latest model, the first 7B to score better overall than all… See the full description on the dataset page: https://huggingface.co/datasets/Open-Orca/OpenOrca.

  2. h

    OpenOrca-Open-Orca

    • huggingface.co
    Updated May 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GenRM: Generative Reward Models (2025). OpenOrca-Open-Orca [Dataset]. https://huggingface.co/datasets/GenRM/OpenOrca-Open-Orca
    Explore at:
    Dataset updated
    May 11, 2025
    Dataset authored and provided by
    GenRM: Generative Reward Models
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    πŸ‹ The OpenOrca Dataset! πŸ‹

    We are thrilled to announce the release of the OpenOrca dataset! This rich collection of augmented FLAN data aligns, as best as possible, with the distributions outlined in the Orca paper. It has been instrumental in generating high-performing model checkpoints and serves as a valuable resource for all NLP researchers and developers!

      Official Models
    
    
    
    
    
    
      Mistral-7B-OpenOrca
    

    Our latest model, the first 7B to score better overall than all… See the full description on the dataset page: https://huggingface.co/datasets/GenRM/OpenOrca-Open-Orca.

  3. h

    Open-Orca

    • huggingface.co
    Updated Aug 30, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lymeman (2024). Open-Orca [Dataset]. https://huggingface.co/datasets/Triangle104/Open-Orca
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 30, 2024
    Authors
    Lymeman
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    πŸ‹ The OpenOrca Dataset! πŸ‹

    We are thrilled to announce the release of the OpenOrca dataset! This rich collection of augmented FLAN data aligns, as best as possible, with the distributions outlined in the Orca paper. It has been instrumental in generating high-performing model checkpoints and serves as a valuable resource for all NLP researchers and developers!

      Official Models
    
    
    
    
    
    
    
      Mistral-7B-OpenOrca
    

    Our latest model, the first 7B to score better overall than all… See the full description on the dataset page: https://huggingface.co/datasets/Triangle104/Open-Orca.

  4. h

    OpenOrca

    • huggingface.co
    Updated Oct 20, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Polina Kazakova (2023). OpenOrca [Dataset]. https://huggingface.co/datasets/polinaeterna/OpenOrca
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 20, 2023
    Authors
    Polina Kazakova
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    πŸ‹ The OpenOrca Dataset! πŸ‹

    We are thrilled to announce the release of the OpenOrca dataset! This rich collection of augmented FLAN data aligns, as best as possible, with the distributions outlined in the Orca paper. It has been instrumental in generating high-performing model checkpoints and serves as a valuable resource for all NLP researchers and developers!

      Official Models
    
    
    
    
    
    
      Mistral-7B-OpenOrca
    

    Our latest model, the first 7B to score better overall than all… See the full description on the dataset page: https://huggingface.co/datasets/polinaeterna/OpenOrca.

  5. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
OpenOrca (2023). OpenOrca [Dataset]. https://huggingface.co/datasets/Open-Orca/OpenOrca

OpenOrca

OpenOrca

Open-Orca/OpenOrca

Explore at:
382 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 29, 2023
Dataset authored and provided by
OpenOrca
License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

πŸ‹ The OpenOrca Dataset! πŸ‹

We are thrilled to announce the release of the OpenOrca dataset! This rich collection of augmented FLAN data aligns, as best as possible, with the distributions outlined in the Orca paper. It has been instrumental in generating high-performing model checkpoints and serves as a valuable resource for all NLP researchers and developers!

  Official Models






  Mistral-7B-OpenOrca

Our latest model, the first 7B to score better overall than all… See the full description on the dataset page: https://huggingface.co/datasets/Open-Orca/OpenOrca.

Search
Clear search
Close search
Google apps
Main menu