100+ datasets found
  1. h

    distilabel-intel-orca-dpo-pairs

    • huggingface.co
    Updated Dec 11, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Argilla (2024). distilabel-intel-orca-dpo-pairs [Dataset]. https://huggingface.co/datasets/argilla/distilabel-intel-orca-dpo-pairs
    Explore at:
    Dataset updated
    Dec 11, 2024
    Dataset authored and provided by
    Argilla
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    distilabel Orca Pairs for DPO

    The dataset is a "distilabeled" version of the widely used dataset: Intel/orca_dpo_pairs. The original dataset has been used by 100s of open-source practitioners and models. We knew from fixing UltraFeedback (and before that, Alpacas and Dollys) that this dataset could be highly improved. Continuing with our mission to build the best alignment datasets for open-source LLMs and the community, we spent a few hours improving it with… See the full description on the dataset page: https://huggingface.co/datasets/argilla/distilabel-intel-orca-dpo-pairs.

  2. h

    distilabel-capybara-dpo-7k-binarized

    • huggingface.co
    Updated Jan 31, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Argilla (2024). distilabel-capybara-dpo-7k-binarized [Dataset]. https://huggingface.co/datasets/argilla/distilabel-capybara-dpo-7k-binarized
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 31, 2024
    Dataset authored and provided by
    Argilla
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Capybara-DPO 7K binarized

    A DPO dataset built with distilabel atop the awesome LDJnr/Capybara

    This is a preview version to collect feedback from the community. v2 will include the full base dataset and responses from more powerful models.

      Why?
    

    Multi-turn dialogue data is key to fine-tune capable chat models. Multi-turn preference data has been used by the most relevant RLHF works (Anthropic, Meta Llama2, etc.). Unfortunately, there are very few… See the full description on the dataset page: https://huggingface.co/datasets/argilla/distilabel-capybara-dpo-7k-binarized.

  3. h

    distilabel-dataset-generator-only-instructions

    • huggingface.co
    Updated Sep 12, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Daniel Vila (2024). distilabel-dataset-generator-only-instructions [Dataset]. https://huggingface.co/datasets/dvilasuero/distilabel-dataset-generator-only-instructions
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 12, 2024
    Authors
    Daniel Vila
    Description

    Dataset Card for distilabel-dataset-generator-only-instructions

    This dataset has been created with distilabel.

      Dataset Summary
    

    This dataset contains a pipeline.yaml which can be used to reproduce the pipeline that generated it in distilabel using the distilabel CLI: distilabel pipeline run --config "https://huggingface.co/datasets/dvilasuero/distilabel-dataset-generator-only-instructions/raw/main/pipeline.yaml"

    or explore the configuration: distilabel… See the full description on the dataset page: https://huggingface.co/datasets/dvilasuero/distilabel-dataset-generator-only-instructions.

  4. h

    distilabel-example4

    • huggingface.co
    Updated Oct 3, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    archit (2024). distilabel-example4 [Dataset]. https://huggingface.co/datasets/archit11/distilabel-example4
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 3, 2024
    Authors
    archit
    Description

    Dataset Card for distilabel-example4

    This dataset has been created with distilabel.

      Dataset Summary
    

    This dataset contains a pipeline.yaml which can be used to reproduce the pipeline that generated it in distilabel using the distilabel CLI: distilabel pipeline run --config "https://huggingface.co/datasets/archit11/distilabel-example4/raw/main/pipeline.yaml"

    or explore the configuration: distilabel pipeline info --config… See the full description on the dataset page: https://huggingface.co/datasets/archit11/distilabel-example4.

  5. h

    distilabel-reflection-tuning

    • huggingface.co
    Updated Sep 6, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gabriel Martín Blázquez (2024). distilabel-reflection-tuning [Dataset]. https://huggingface.co/datasets/gabrielmbmb/distilabel-reflection-tuning
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 6, 2024
    Authors
    Gabriel Martín Blázquez
    Description

    Dataset Card for distilabel-reflection-tuning

    This dataset has been created with distilabel. The pipeline script was uploaded to easily reproduce the dataset: reflection.py. It can be run directly using the CLI: distilabel pipeline run --script "https://huggingface.co/datasets/gabrielmbmb/distilabel-reflection-tuning/raw/main/reflection.py"

      Dataset Summary
    

    This dataset contains a pipeline.yaml which can be used to reproduce the pipeline that generated… See the full description on the dataset page: https://huggingface.co/datasets/gabrielmbmb/distilabel-reflection-tuning.

  6. h

    distilabel-intel-orca-kto

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Argilla, distilabel-intel-orca-kto [Dataset]. https://huggingface.co/datasets/argilla/distilabel-intel-orca-kto
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset authored and provided by
    Argilla
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    distilabel Orca Pairs for KTO

    A KTO signal transformed version of the highly loved distilabel Orca Pairs for DPO.

    The dataset is a "distilabeled" version of the widely used dataset: Intel/orca_dpo_pairs. The original dataset has been used by 100s of open-source practitioners and models. We knew from fixing UltraFeedback (and before that, Alpacas and Dollys) that this dataset could be highly improved. Continuing with our mission to build the best alignment datasets… See the full description on the dataset page: https://huggingface.co/datasets/argilla/distilabel-intel-orca-kto.

  7. h

    instruction-dataset-sample

    • huggingface.co
    Updated Feb 10, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    distilabel-internal-testing (2023). instruction-dataset-sample [Dataset]. https://huggingface.co/datasets/distilabel-internal-testing/instruction-dataset-sample
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 10, 2023
    Dataset authored and provided by
    distilabel-internal-testing
    Description

    distilabel-internal-testing/instruction-dataset-sample dataset hosted on Hugging Face and contributed by the HF Datasets community

  8. h

    distilabel-instruction-to-preference-dataset

    • huggingface.co
    Updated Feb 10, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jaewon Chung (2023). distilabel-instruction-to-preference-dataset [Dataset]. https://huggingface.co/datasets/Mervyn999/distilabel-instruction-to-preference-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 10, 2023
    Authors
    Jaewon Chung
    Description

    Dataset Card for distilabel-instruction-to-preference-dataset

    This dataset has been created with distilabel.

      Dataset Summary
    

    This dataset contains a pipeline.yaml which can be used to reproduce the pipeline that generated it in distilabel using the distilabel CLI: distilabel pipeline run --config "https://huggingface.co/datasets/Mervyn999/distilabel-instruction-to-preference-dataset/raw/main/pipeline.yaml"

    or explore the configuration: distilabel pipeline… See the full description on the dataset page: https://huggingface.co/datasets/Mervyn999/distilabel-instruction-to-preference-dataset.

  9. h

    distilabel-sample-evol-instruct

    • huggingface.co
    Updated Feb 8, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Argilla (2024). distilabel-sample-evol-instruct [Dataset]. https://huggingface.co/datasets/argilla/distilabel-sample-evol-instruct
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 8, 2024
    Dataset authored and provided by
    Argilla
    Description

    argilla/distilabel-sample-evol-instruct dataset hosted on Hugging Face and contributed by the HF Datasets community

  10. h

    distilabel-magpie-math

    • huggingface.co
    Updated Sep 12, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gabriel Martín Blázquez (2024). distilabel-magpie-math [Dataset]. https://huggingface.co/datasets/gabrielmbmb/distilabel-magpie-math
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 12, 2024
    Authors
    Gabriel Martín Blázquez
    Description

    Dataset Card for distilabel-magpie-math

    This dataset has been created with distilabel.

      Dataset Summary
    

    This dataset contains a pipeline.yaml which can be used to reproduce the pipeline that generated it in distilabel using the distilabel CLI: distilabel pipeline run --config "https://huggingface.co/datasets/gabrielmbmb/distilabel-magpie-math/raw/main/pipeline.yaml"

    or explore the configuration: distilabel pipeline info --config… See the full description on the dataset page: https://huggingface.co/datasets/gabrielmbmb/distilabel-magpie-math.

  11. h

    distilabel-reasoning-R1-Llama-70B

    • huggingface.co
    Updated Jan 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lightblue KK. (2025). distilabel-reasoning-R1-Llama-70B [Dataset]. https://huggingface.co/datasets/lightblue/distilabel-reasoning-R1-Llama-70B
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 27, 2025
    Dataset authored and provided by
    Lightblue KK.
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    How this Data was made

    We made this data through the following steps:

    Sample English reasoning-style prompts from argilla/distilabel-reasoning-prompts. Remove similar prompts using text similarity based on BAAI/bge-m3 embeddings. Translate English prompts to Japanese using gpt-4o-mini-2024-07-18. Generate answers to prompts using deepseek-ai/DeepSeek-R1-Distill-Llama-70B. Filter responses (to ja_valid) which did not: Finish within 2048 tokens Contain a valid

  12. h

    distilabel-example-test

    • huggingface.co
    Updated Nov 10, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Thomas Wolf (2024). distilabel-example-test [Dataset]. https://huggingface.co/datasets/thomwolf/distilabel-example-test
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 10, 2024
    Authors
    Thomas Wolf
    Description

    Dataset Card for distilabel-example-test

    This dataset has been created with Argilla. As shown in the sections below, this dataset can be loaded into your Argilla server as explained in Load with Argilla, or used directly with the datasets library in Load with datasets.

      Using this dataset with Argilla
    

    To load with Argilla, you'll just need to install Argilla as pip install argilla --upgrade and then use the following code: import argilla as rg

    ds =… See the full description on the dataset page: https://huggingface.co/datasets/thomwolf/distilabel-example-test.

  13. h

    instruction-dataset-with-llama3

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    distilabel-internal-testing, instruction-dataset-with-llama3 [Dataset]. https://huggingface.co/datasets/distilabel-internal-testing/instruction-dataset-with-llama3
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset authored and provided by
    distilabel-internal-testing
    Description

    Dataset Card for instruction-dataset-with-llama3

    This dataset has been created with distilabel.

      Dataset Summary
    

    This dataset contains a pipeline.yaml which can be used to reproduce the pipeline that generated it in distilabel using the distilabel CLI: distilabel pipeline run --config "https://huggingface.co/datasets/distilabel-internal-testing/instruction-dataset-with-llama3/raw/main/pipeline.yaml"

    or explore the configuration: distilabel pipeline info… See the full description on the dataset page: https://huggingface.co/datasets/distilabel-internal-testing/instruction-dataset-with-llama3.

  14. h

    instruction-dataset-mini-with-generations

    • huggingface.co
    Updated Feb 10, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    distilabel-internal-testing (2023). instruction-dataset-mini-with-generations [Dataset]. https://huggingface.co/datasets/distilabel-internal-testing/instruction-dataset-mini-with-generations
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 10, 2023
    Dataset authored and provided by
    distilabel-internal-testing
    Description

    Dataset Card for instruction-dataset-mini-with-generations

    This dataset has been created with distilabel.

      Dataset Summary
    

    This dataset contains a pipeline.yaml which can be used to reproduce the pipeline that generated it in distilabel using the distilabel CLI: distilabel pipeline run --config "https://huggingface.co/datasets/distilabel-internal-testing/instruction-dataset-mini-with-generations/raw/main/pipeline.yaml"

    or explore the configuration:… See the full description on the dataset page: https://huggingface.co/datasets/distilabel-internal-testing/instruction-dataset-mini-with-generations.

  15. h

    inference-endpoints-structured-generation-multiple

    • huggingface.co
    Updated Jul 5, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    distilabel-internal-testing (2024). inference-endpoints-structured-generation-multiple [Dataset]. https://huggingface.co/datasets/distilabel-internal-testing/inference-endpoints-structured-generation-multiple
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 5, 2024
    Dataset authored and provided by
    distilabel-internal-testing
    Description

    Dataset Card for inference-endpoints-structured-generation-multiple

    This dataset has been created with distilabel.

      Dataset Summary
    

    This dataset contains a pipeline.yaml which can be used to reproduce the pipeline that generated it in distilabel using the distilabel CLI: distilabel pipeline run --config "https://huggingface.co/datasets/distilabel-internal-testing/inference-endpoints-structured-generation-multiple/raw/main/pipeline.yaml"

    or explore the… See the full description on the dataset page: https://huggingface.co/datasets/distilabel-internal-testing/inference-endpoints-structured-generation-multiple.

  16. h

    rag-synthetic-distilabel

    • huggingface.co
    Updated Apr 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mary N (2025). rag-synthetic-distilabel [Dataset]. https://huggingface.co/datasets/m-newhauser/rag-synthetic-distilabel
    Explore at:
    Dataset updated
    Apr 16, 2025
    Authors
    Mary N
    Description

    m-newhauser/rag-synthetic-distilabel dataset hosted on Hugging Face and contributed by the HF Datasets community

  17. h

    distilabel-artifacts-example

    • huggingface.co
    Updated Feb 10, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    distilabel-internal-testing (2023). distilabel-artifacts-example [Dataset]. https://huggingface.co/datasets/distilabel-internal-testing/distilabel-artifacts-example
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 10, 2023
    Dataset authored and provided by
    distilabel-internal-testing
    Description

    Dataset Card for distilabel-artifacts-example

    This dataset has been created with distilabel.

      Dataset Summary
    

    This dataset contains a pipeline.yaml which can be used to reproduce the pipeline that generated it in distilabel using the distilabel CLI: distilabel pipeline run --config "https://huggingface.co/datasets/distilabel-internal-testing/distilabel-artifacts-example/raw/main/pipeline.yaml"

    or explore the configuration: distilabel pipeline info --config… See the full description on the dataset page: https://huggingface.co/datasets/distilabel-internal-testing/distilabel-artifacts-example.

  18. h

    preferance-dataset-with-distilabel

    • huggingface.co
    Updated Nov 24, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ritesh Kumar Tiwary (2024). preferance-dataset-with-distilabel [Dataset]. https://huggingface.co/datasets/riteshkr/preferance-dataset-with-distilabel
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 24, 2024
    Authors
    Ritesh Kumar Tiwary
    Description

    Dataset Card for preferance-dataset-with-distilabel

    This dataset has been created with distilabel.

      Dataset Summary
    

    This dataset contains a pipeline.yaml which can be used to reproduce the pipeline that generated it in distilabel using the distilabel CLI: distilabel pipeline run --config "https://huggingface.co/datasets/riteshkr/preferance-dataset-with-distilabel/raw/main/pipeline.yaml"

    or explore the configuration: distilabel pipeline info --config… See the full description on the dataset page: https://huggingface.co/datasets/riteshkr/preferance-dataset-with-distilabel.

  19. h

    instruction-dataset-mini

    • huggingface.co
    Updated Feb 10, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    distilabel-internal-testing (2023). instruction-dataset-mini [Dataset]. https://huggingface.co/datasets/distilabel-internal-testing/instruction-dataset-mini
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 10, 2023
    Dataset authored and provided by
    distilabel-internal-testing
    Description

    distilabel-internal-testing/instruction-dataset-mini dataset hosted on Hugging Face and contributed by the HF Datasets community

  20. h

    testing-dataset-distilabel

    • huggingface.co
    Updated Sep 27, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shahzaib Niaz (2024). testing-dataset-distilabel [Dataset]. https://huggingface.co/datasets/shazoo2k/testing-dataset-distilabel
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 27, 2024
    Authors
    Shahzaib Niaz
    Description

    Dataset Card for testing-dataset-distilabel

    This dataset has been created with distilabel.

      Dataset Summary
    

    This dataset contains a pipeline.yaml which can be used to reproduce the pipeline that generated it in distilabel using the distilabel CLI: distilabel pipeline run --config "https://huggingface.co/datasets/shazoo2k/testing-dataset-distilabel/raw/main/pipeline.yaml"

    or explore the configuration: distilabel pipeline info --config… See the full description on the dataset page: https://huggingface.co/datasets/shazoo2k/testing-dataset-distilabel.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Argilla (2024). distilabel-intel-orca-dpo-pairs [Dataset]. https://huggingface.co/datasets/argilla/distilabel-intel-orca-dpo-pairs

distilabel-intel-orca-dpo-pairs

argilla/distilabel-intel-orca-dpo-pairs

Explore at:
18 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Dec 11, 2024
Dataset authored and provided by
Argilla
License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

distilabel Orca Pairs for DPO

The dataset is a "distilabeled" version of the widely used dataset: Intel/orca_dpo_pairs. The original dataset has been used by 100s of open-source practitioners and models. We knew from fixing UltraFeedback (and before that, Alpacas and Dollys) that this dataset could be highly improved. Continuing with our mission to build the best alignment datasets for open-source LLMs and the community, we spent a few hours improving it with… See the full description on the dataset page: https://huggingface.co/datasets/argilla/distilabel-intel-orca-dpo-pairs.

Search
Clear search
Close search
Google apps
Main menu