Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Chat Fine-tuning Dataset - Llama 2 Style
This dataset allows for fine-tuning chat models using [INST] AND [/INST] to wrap user messages. Preparation:
The dataset is cloned from TimDettmers, which itself is a subset of the Open Assistant dataset, which you can find here. This subset of the data only contains the highest-rated paths in the conversation tree, with a total of 9,846 samples. The dataset was then filtered to:
replace instances of '### Human:' with '[INST]' replace… See the full description on the dataset page: https://huggingface.co/datasets/Trelis/openassistant-llama-style.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Chat Fine-tuning Dataset - Llama 2 Style
This dataset allows for fine-tuning chat models using [INST] AND [/INST] to wrap user messages. Preparation:
The dataset is cloned from TimDettmers, which itself is a subset of the Open Assistant dataset, which you can find here. This subset of the data only contains the highest-rated paths in the conversation tree, with a total of 9,846 samples. The dataset was then filtered to:
replace instances of '### Human:' with '[INST]' replace… See the full description on the dataset page: https://huggingface.co/datasets/Trelis/openassistant-llama-style.