5 datasets found

h
guanaco-llama2-1k-en
huggingface.co
Updated Mar 23, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Girish Koushik (2025). guanaco-llama2-1k-en [Dataset]. https://huggingface.co/datasets/mindhunter23/guanaco-llama2-1k-en
Explore at:
Dataset updated
Mar 23, 2025
Authors
Girish Koushik
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Guanaco-1k: Lazy Llama 2 Formatting

This is an English subset (1k samples) of the timdettmers/openassistant-guanaco dataset, processed to match Llama 2's prompt format as described in this article. It was created using the following colab notebook. Useful if you don't want to reformat it by yourself (e.g., using a script). It was designed for this article about fine-tuning a Llama 2 (chat) model in a Google Colab. Reference: mlabonne/guanaco-llama2-1k
h
guanaco-llama2-1k
huggingface.co
Updated Feb 7, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Federico Puy (2024). guanaco-llama2-1k [Dataset]. https://huggingface.co/datasets/federicopuy/guanaco-llama2-1k
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 7, 2024
Authors
Federico Puy
Description
Guanaco-1k -> Llama-2 Dataset

Subset of 1000 samples of the timdettmers/openassistant-guanaco dataset. It has been transformed to match Llama 2's prompt format according to how to prompt llama 2. Colab notebook.
h
test-dataset
huggingface.co
Updated Jul 25, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pravar Ved (2023). test-dataset [Dataset]. https://huggingface.co/datasets/Pravarved/test-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 25, 2023
Authors
Pravar Ved
Description
Guanaco-1k: Lazy Llama 2 Formatting

This is a subset (1000 samples) of the excellent timdettmers/openassistant-guanaco dataset, processed to match Llama 2's prompt format as described in this article. It was created using the following colab notebook. Useful if you don't want to reformat it by yourself (e.g., using a script). It was designed for this article about fine-tuning a Llama 2 (chat) model in a Google Colab.
h
guanaco-llama2-1k
huggingface.co
opendatalab.com
Updated Jul 25, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Maxime Labonne (2023). guanaco-llama2-1k [Dataset]. https://huggingface.co/datasets/mlabonne/guanaco-llama2-1k
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 25, 2023
Authors
Maxime Labonne
Description
Guanaco-1k: Lazy Llama 2 Formatting

This is a subset (1000 samples) of the excellent timdettmers/openassistant-guanaco dataset, processed to match Llama 2's prompt format as described in this article. It was created using the following colab notebook. Useful if you don't want to reformat it by yourself (e.g., using a script). It was designed for this article about fine-tuning a Llama 2 (chat) model in a Google Colab.
h
DictionaryTrain
huggingface.co
Updated Feb 7, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
malik (2024). DictionaryTrain [Dataset]. https://huggingface.co/datasets/M0hammed87/DictionaryTrain
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 7, 2024
Authors
malik
Description
Guanaco-1k: Lazy Llama 2 Formatting

This is a subset (1000 samples) of the excellent timdettmers/openassistant-guanaco dataset, processed to match Llama 2's prompt format as described in this article. It was created using the following colab notebook. Useful if you don't want to reformat it by yourself (e.g., using a script). It was designed for this article about fine-tuning a Llama 2 (chat) model in a Google Colab.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Girish Koushik (2025). guanaco-llama2-1k-en [Dataset]. https://huggingface.co/datasets/mindhunter23/guanaco-llama2-1k-en

guanaco-llama2-1k-en

mindhunter23/guanaco-llama2-1k-en

Explore at:

Dataset updated

Mar 23, 2025

Authors

Girish Koushik

License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

Guanaco-1k: Lazy Llama 2 Formatting

This is an English subset (1k samples) of the timdettmers/openassistant-guanaco dataset, processed to match Llama 2's prompt format as described in this article. It was created using the following colab notebook. Useful if you don't want to reformat it by yourself (e.g., using a script). It was designed for this article about fine-tuning a Llama 2 (chat) model in a Google Colab. Reference: mlabonne/guanaco-llama2-1k

Clear search

Close search

Google apps

Main menu

guanaco-llama2-1k-en

guanaco-llama2-1k

test-dataset

guanaco-llama2-1k

DictionaryTrain

guanaco-llama2-1k-en

mindhunter23/guanaco-llama2-1k-en