LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
This dataset contains one million real-world conversations with 25 state-of-the-art LLMs. It is collected from 210K unique IP addresses in the wild on the Vicuna demo and Chatbot Arena website from April to August 2023. Each sample includes a conversation ID, model name, conversation text in OpenAI API JSON format, detected language tag, and OpenAI moderation API tag. User consent is obtained through the "Terms of⦠See the full description on the dataset page: https://huggingface.co/datasets/lmsys/lmsys-chat-1m.
bew/lmsys-chat-1m-qwen2-instruct dataset hosted on Hugging Face and contributed by the HF Datasets community
jsonifize/lmsys-chat-1m-jsonify-v2 dataset hosted on Hugging Face and contributed by the HF Datasets community
Stanleytowne/lmsys-chat-Qwen2.5-1.5B-Instruct-1epoch-100k dataset hosted on Hugging Face and contributed by the HF Datasets community
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
MT Bench by LMSYS
This set of evaluation prompts is created by the LMSYS org for better evaluation of chat models. For more information, see the paper.
Dataset loading
To load this dataset, use π€ datasets: from datasets import load_dataset data = load_dataset(HuggingFaceH4/mt_bench_prompts, split="train")
Dataset creation
To create the dataset, we do the following for our internal tooling.
rename turns to prompts, add empty reference to⦠See the full description on the dataset page: https://huggingface.co/datasets/HuggingFaceH4/mt_bench_prompts.
Dataset Card for "ScaleBiO-Train-lmsys-chat-1m"
More Information needed
Dataset Card for "lmsys-finance"
This dataset is a curated version of the lmsys-chat-1m dataset, focusing solely on finance-related conversations. The refinement process encompassed:
Removing non-English conversations. Selecting conversations from models: "vicuna-33b", "wizardlm-13b", "gpt-4", "gpt-3.5-turbo", "claude-2", "palm-2", and "claude-instant-1". Excluding conversations with responses under 30 characters. Using 100 financial keywords, choosing conversations with at⦠See the full description on the dataset page: https://huggingface.co/datasets/amphora/lmsys-finance.
Dataset made on the basis of lmsys/lmsys-chat-1m With data only for the Spanish language.
x-angelkawaii-x/lmsys-chat-tiny-20k dataset hosted on Hugging Face and contributed by the HF Datasets community
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
WildIFEval Dataset
This dataset was originally introduced in the paper WildIFEval: Instruction Following in the Wild, available on arXiv. Code: https://github.com/gililior/wild-if-eval
Dataset Overview
The WildIFEval dataset is designed for evaluating instruction-following capabilities in language models. It provides decompositions of conversations extracted from the LMSYS-Chat-1M dataset. Each example includes:
conversation_id: A unique identifier for each conversation.β¦ See the full description on the dataset page: https://huggingface.co/datasets/gililior/wild-if-eval.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Contains maximum activating examples for all the features of our crosscoder trained on gemma 2 2B layer 13 available here: https://huggingface.co/Butanium/gemma-2-2b-crosscoder-l13-mu4.1e-02-lr1e-04/blob/main/README.md
base_examples.pt contains all the maximum examples of the feature on a subset of validation test of fineweb chat_examples.pt is the same but for lmsys chat data chat_base_examples.pt is a merge of the two above files. All files are of the type dict[int, list[tuple[float⦠See the full description on the dataset page: https://huggingface.co/datasets/science-of-finetuning/diffing-stats-gemma-2-2b-crosscoder-l13-mu4.1e-02-lr1e-04.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
This dataset contains one million real-world conversations with 25 state-of-the-art LLMs. It is collected from 210K unique IP addresses in the wild on the Vicuna demo and Chatbot Arena website from April to August 2023. Each sample includes a conversation ID, model name, conversation text in OpenAI API JSON format, detected language tag, and OpenAI moderation API tag. User consent is obtained through the "Terms of⦠See the full description on the dataset page: https://huggingface.co/datasets/lmsys/lmsys-chat-1m.