5 datasets found
  1. h

    guanaco-llama2-1k-en

    • huggingface.co
    Updated Mar 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Girish Koushik (2025). guanaco-llama2-1k-en [Dataset]. https://huggingface.co/datasets/mindhunter23/guanaco-llama2-1k-en
    Explore at:
    Dataset updated
    Mar 23, 2025
    Authors
    Girish Koushik
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Guanaco-1k: Lazy Llama 2 Formatting

    This is an English subset (1k samples) of the timdettmers/openassistant-guanaco dataset, processed to match Llama 2's prompt format as described in this article. It was created using the following colab notebook. Useful if you don't want to reformat it by yourself (e.g., using a script). It was designed for this article about fine-tuning a Llama 2 (chat) model in a Google Colab. Reference: mlabonne/guanaco-llama2-1k

  2. h

    guanaco-llama2-1k

    • huggingface.co
    Updated Feb 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Federico Puy (2024). guanaco-llama2-1k [Dataset]. https://huggingface.co/datasets/federicopuy/guanaco-llama2-1k
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 7, 2024
    Authors
    Federico Puy
    Description

    Guanaco-1k -> Llama-2 Dataset

    Subset of 1000 samples of the timdettmers/openassistant-guanaco dataset. It has been transformed to match Llama 2's prompt format according to how to prompt llama 2. Colab notebook.

  3. h

    test-dataset

    • huggingface.co
    Updated Jul 25, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pravar Ved (2023). test-dataset [Dataset]. https://huggingface.co/datasets/Pravarved/test-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 25, 2023
    Authors
    Pravar Ved
    Description

    Guanaco-1k: Lazy Llama 2 Formatting

    This is a subset (1000 samples) of the excellent timdettmers/openassistant-guanaco dataset, processed to match Llama 2's prompt format as described in this article. It was created using the following colab notebook. Useful if you don't want to reformat it by yourself (e.g., using a script). It was designed for this article about fine-tuning a Llama 2 (chat) model in a Google Colab.

  4. h

    guanaco-llama2-1k

    • huggingface.co
    • opendatalab.com
    Updated Jul 25, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maxime Labonne (2023). guanaco-llama2-1k [Dataset]. https://huggingface.co/datasets/mlabonne/guanaco-llama2-1k
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 25, 2023
    Authors
    Maxime Labonne
    Description

    Guanaco-1k: Lazy Llama 2 Formatting

    This is a subset (1000 samples) of the excellent timdettmers/openassistant-guanaco dataset, processed to match Llama 2's prompt format as described in this article. It was created using the following colab notebook. Useful if you don't want to reformat it by yourself (e.g., using a script). It was designed for this article about fine-tuning a Llama 2 (chat) model in a Google Colab.

  5. h

    DictionaryTrain

    • huggingface.co
    Updated Feb 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    malik (2024). DictionaryTrain [Dataset]. https://huggingface.co/datasets/M0hammed87/DictionaryTrain
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 7, 2024
    Authors
    malik
    Description

    Guanaco-1k: Lazy Llama 2 Formatting

    This is a subset (1000 samples) of the excellent timdettmers/openassistant-guanaco dataset, processed to match Llama 2's prompt format as described in this article. It was created using the following colab notebook. Useful if you don't want to reformat it by yourself (e.g., using a script). It was designed for this article about fine-tuning a Llama 2 (chat) model in a Google Colab.

  6. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Girish Koushik (2025). guanaco-llama2-1k-en [Dataset]. https://huggingface.co/datasets/mindhunter23/guanaco-llama2-1k-en

guanaco-llama2-1k-en

mindhunter23/guanaco-llama2-1k-en

Explore at:
Dataset updated
Mar 23, 2025
Authors
Girish Koushik
License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

Guanaco-1k: Lazy Llama 2 Formatting

This is an English subset (1k samples) of the timdettmers/openassistant-guanaco dataset, processed to match Llama 2's prompt format as described in this article. It was created using the following colab notebook. Useful if you don't want to reformat it by yourself (e.g., using a script). It was designed for this article about fine-tuning a Llama 2 (chat) model in a Google Colab. Reference: mlabonne/guanaco-llama2-1k

Search
Clear search
Close search
Google apps
Main menu