20 datasets found

h
toxic-chat
huggingface.co
Updated Jan 25, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Large Model Systems Organization (2024). toxic-chat [Dataset]. https://huggingface.co/datasets/lmsys/toxic-chat
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 25, 2024
Dataset authored and provided by
Large Model Systems Organization
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
Update

[01/31/2024] We update the OpenAI Moderation API results for ToxicChat (0124) based on their updated moderation model on on Jan 25, 2024.[01/28/2024] We release an official T5-Large model trained on ToxicChat (toxicchat0124). Go and check it for you baseline comparision![01/19/2024] We have a new version of ToxicChat (toxicchat0124)!

Content

This dataset contains toxicity annotations on 10K user prompts collected from the Vicuna online demo. We utilize a human-AI… See the full description on the dataset page: https://huggingface.co/datasets/lmsys/toxic-chat.
P
ToxicChat Dataset
paperswithcode.com
Updated May 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Zi Lin; Zihan Wang; Yongqi Tong; Yangkun Wang; Yuxin Guo; Yujia Wang; Jingbo Shang (2025). ToxicChat Dataset [Dataset]. https://paperswithcode.com/dataset/toxicchat
Explore at:
Dataset updated
May 15, 2025
Authors
Zi Lin; Zihan Wang; Yongqi Tong; Yangkun Wang; Yuxin Guo; Yujia Wang; Jingbo Shang
Description
ToxicChat is a novel benchmark dataset constructed based on real user queries from an open-source chatbot. Unlike previous toxicity detection benchmarks that primarily rely on social media content, ToxicChat captures the rich and nuanced phenomena inherent in real-world user-AI interactions. This unique dataset reveals significant domain differences compared to social media contents, making it a valuable resource for exploring the challenges of toxicity detection in user-AI conversations¹.

Here are some key details about the ToxicChat dataset:

Construction: ToxicChat was created using real user queries collected from an open-source chatbot. Challenges: It contains phenomena that can be tricky for current toxicity detection models to identify. Domain Difference: ToxicChat exhibits a significant domain difference when compared to social media content. Purpose: ToxicChat serves as a benchmark to drive advancements in building a safe and healthy environment for user-AI interactions.

Source: Conversation with Bing, 3/17/2024 (1) ToxicChat: Unveiling Hidden Challenges of Toxicity Detection in Real .... https://aclanthology.org/2023.findings-emnlp.311/. (2) arXiv:2310.17389v1 [cs.CL] 26 Oct 2023. https://arxiv.org/pdf/2310.17389. (3) README.md · lmsys/toxic-chat at main - Hugging Face. https://huggingface.co/datasets/lmsys/toxic-chat/blob/main/README.md. (4) The Toxicity Dataset - GitHub. https://github.com/surge-ai/toxicity. (5) undefined. https://aclanthology.org/2023.findings-emnlp.311. (6) undefined. https://aclanthology.org/2023.findings-emnlp.311.pdf.
h
dota-2-toxic-chat-data
huggingface.co
Updated Apr 3, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Daniel Fesalbon (2024). dota-2-toxic-chat-data [Dataset]. https://huggingface.co/datasets/dffesalbon/dota-2-toxic-chat-data
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 3, 2024
Authors
Daniel Fesalbon
Description
dffesalbon/dota-2-toxic-chat-data dataset hosted on Hugging Face and contributed by the HF Datasets community
h
Toxic-Chat-V2
huggingface.co
Updated Sep 10, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ian Li (2020). Toxic-Chat-V2 [Dataset]. http://doi.org/10.57967/hf/3749
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.57967/hf/3749
Dataset updated
Sep 10, 2020
Authors
Ian Li
Description
IanLi233/Toxic-Chat-V2 dataset hosted on Hugging Face and contributed by the HF Datasets community
toxic_chat_parquet
kaggle.com
Updated Nov 12, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ali waleed (2023). toxic_chat_parquet [Dataset]. https://www.kaggle.com/datasets/alimistro123/toxic-chat-parquet
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 12, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
ali waleed
Description
Dataset

This dataset was created by ali waleed

Contents
h
toxic-chat
huggingface.co
Updated Nov 12, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
akcit ijf (2024). toxic-chat [Dataset]. https://huggingface.co/datasets/akcit-ijf/toxic-chat
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 12, 2024
Dataset authored and provided by
akcit ijf
Description
akcit-ijf/toxic-chat dataset hosted on Hugging Face and contributed by the HF Datasets community
h
sensai
huggingface.co
Updated Jun 6, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Holodata Archive (2022). sensai [Dataset]. https://huggingface.co/datasets/holodata/sensai
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 6, 2022
Dataset authored and provided by
Holodata Archive
Description
❤️‍🩹 Sensai: Toxic Chat Dataset

Sensai is a toxic chat dataset consists of live chats from Virtual YouTubers' live streams. Download the dataset from Kaggle Datasets and join #livechat-dataset channel on holodata Discord for discussions.

Provenance

Source: YouTube Live Chat events (all streams covered by Holodex, including Hololive, Nijisanji, 774inc, etc) Temporal Coverage: From 2021-01-15T05:15:33Z Update Frequency: At least once per month

Research Ideas… See the full description on the dataset page: https://huggingface.co/datasets/holodata/sensai.
R
Ee_chat Dataset
universe.roboflow.com
zip
Updated Mar 7, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Eve Echoes (2025). Ee_chat Dataset [Dataset]. https://universe.roboflow.com/eve-echoes/ee_chat/dataset/1
Explore at:
zipAvailable download formats
Dataset updated
Mar 7, 2025
Dataset authored and provided by
Eve Echoes
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
Message Bounding Boxes
Description
Here are a few use cases for this project:

Gaming Communication Management: "EE_Chat" can be used by game developers and gaming companies to understand the communication patterns among players. It can help to analyze in-game messaging, detect toxic behaviour or keywords, manage group interactions, and gather insights on player behaviour.

Online Gaming Experience Improvement: This model can be used to analyze the chatting patterns, popular topics, frequent message times, and the interactions between players. These insights can then be used to improve the chatting function, enhance user experience, and boost overall game engagement.

Chat Support Systems: EE_Chat can be adapted for use in understanding customer service interactions on various platforms. It can be used to categorize messages, analyze response times, and understand the effectiveness of the customer support team.

Interactive Game Streaming: Streamers and content creators can integrate "EE_Chat" into their streaming platforms for live interaction with their audience. The model can help them keep track of messages, prioritize certain types of messages, or filter out unwanted content.

Marketing & Advertising: This model can be used by businesses to analyze discussions in gaming communities. They can understand popular chat channels, the most active times, and trending topics. It can help in identifying huge potential markets, effective advertising spots, and creating targeted marketing campaigns.
h
toxic-chat-pt
huggingface.co
Updated Jul 8, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
toxic-chat-pt [Dataset]. https://huggingface.co/datasets/BRlkl/toxic-chat-pt
Explore at:
Dataset updated
Jul 8, 2025
Authors
Pedro Ribeiro
Description
BRlkl/toxic-chat-pt dataset hosted on Hugging Face and contributed by the HF Datasets community
real-toxicity-prompts
huggingface.co
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
real-toxicity-prompts [Dataset]. https://huggingface.co/datasets/allenai/real-toxicity-prompts
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.57967/hf/0002
Dataset provided by
Allen Institute for AIhttp://allenai.org/
Authors
Ai2
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset Card for Real Toxicity Prompts

Dataset Summary

RealToxicityPrompts is a dataset of 100k sentence snippets from the web for researchers to further address the risk of neural toxic degeneration in models.

Languages

English

Dataset Structure Data Instances

Each instance represents a prompt and its metadata: { "filename":"0766186-bc7f2a64cb271f5f56cf6f25570cd9ed.txt", "begin":340, "end":564, "challenging":false… See the full description on the dataset page: https://huggingface.co/datasets/allenai/real-toxicity-prompts.
WildChat-nontoxic
huggingface.co
Updated Nov 14, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
WildChat-nontoxic [Dataset]. https://huggingface.co/datasets/allenai/WildChat-nontoxic
Explore at:
Dataset updated
Nov 14, 2023
Dataset provided by
Allen Institute for AIhttp://allenai.org/
Authors
Ai2
Description
Dataset Card for WildChat-nontoxic

Note: a newer version with 1 million conversations and demographic information can be found here. Dataset Description

Paper: https://wenting-zhao.github.io/papers/wildchat.pdf

License: https://allenai.org/licenses/impact-lr

Language(s) (NLP): multi-lingual

Point of Contact: Yuntian Deng

Dataset Summary

WildChat-nontoxic is the nontoxic subset of the WildChat dataset, a collection of 530K conversations… See the full description on the dataset page: https://huggingface.co/datasets/allenai/WildChat-nontoxic.
h
toxic_conversations
huggingface.co
Updated Jun 29, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
SetFit (2022). toxic_conversations [Dataset]. https://huggingface.co/datasets/SetFit/toxic_conversations
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 29, 2022
Dataset authored and provided by
SetFit
Description
Toxic Conversation

This is a version of the Jigsaw Unintended Bias in Toxicity Classification dataset. It contains comments from the Civil Comments platform together with annotations if the comment is toxic or not. 10 annotators annotated each example and, as recommended in the task page, set a comment as toxic when target >= 0.5 The dataset is inbalanced, with only about 8% of the comments marked as toxic.
h
test_toxicity_bias
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
David Lee, test_toxicity_bias [Dataset]. https://huggingface.co/datasets/PerAsperaAd/test_toxicity_bias
Explore at:
Authors
David Lee
Description
Toxic Chat Dataset
h
dynamoai-benchmark-safety
huggingface.co
Updated May 8, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dynamo AI (2024). dynamoai-benchmark-safety [Dataset]. https://huggingface.co/datasets/dynamoai/dynamoai-benchmark-safety
Explore at:
Dataset updated
May 8, 2024
Dataset provided by
Dynamo AI LLC
Authors
Dynamo AI
License
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Description
⚠️ Warning: this dataset contains examples of toxic, offensive, and inappropriate language. The current LLM safety landscape struggles with accurate real-world benchmarking of content moderation systems. This dataset is a subset of a larger benchmark dataset constructed by the Dynamo AI research team. It consists of real-world chats written by humans with toxic intent. In order to ensure impartiality and standardization, we drew upon open-source and human-annotated examples from Allen… See the full description on the dataset page: https://huggingface.co/datasets/dynamoai/dynamoai-benchmark-safety.
WildChat-1M
huggingface.co
Updated Jul 22, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ai2 (2024). WildChat-1M [Dataset]. https://huggingface.co/datasets/allenai/WildChat-1M
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 22, 2024
Dataset provided by
Allen Institute for AIhttp://allenai.org/
Authors
Ai2
License
https://choosealicense.com/licenses/odc-by/https://choosealicense.com/licenses/odc-by/
Description
Dataset Card for WildChat

Dataset Description

Paper: https://arxiv.org/abs/2405.01470

Interactive Search Tool: https://wildvisualizer.com (paper)

License: ODC-BY

Language(s) (NLP): multi-lingual

Point of Contact: Yuntian Deng

Dataset Summary

WildChat is a collection of 1 million conversations between human users and ChatGPT, alongside demographic data, including state, country, hashed IP addresses, and request headers. We collected WildChat by… See the full description on the dataset page: https://huggingface.co/datasets/allenai/WildChat-1M.
h
NLU-Toxicity-Detection
huggingface.co
Updated Dec 19, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NLU-Toxicity-Detection [Dataset]. https://huggingface.co/datasets/aisingapore/NLU-Toxicity-Detection
Explore at:
Dataset updated
Dec 19, 2024
Dataset authored and provided by
AI Singapore
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
SEA Toxicity Detection

SEA Toxicity Detection evaluates a model's ability to identify toxic content such as hate speech and abusive language in text. It is sampled from MLHSD for Indonesian, TTD for Thai, and ViHSD for Vietnamese.

Supported Tasks and Leaderboards

SEA Toxicity Detection is designed for evaluating chat or instruction-tuned large language models (LLMs). It is part of the SEA-HELM leaderboard from AI Singapore.

Languages

Indonesian (id) Thai… See the full description on the dataset page: https://huggingface.co/datasets/aisingapore/NLU-Toxicity-Detection.
WildChat
huggingface.co
Updated Jul 23, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ai2 (2024). WildChat [Dataset]. https://huggingface.co/datasets/allenai/WildChat
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 23, 2024
Dataset provided by
Allen Institute for AIhttp://allenai.org/
Authors
Ai2
License
https://choosealicense.com/licenses/odc-by/https://choosealicense.com/licenses/odc-by/
Description
Dataset Card for WildChat

Note: a newer version with 1 million conversations and demographic information can be found here. Dataset Description

Paper: https://arxiv.org/abs/2405.01470

Interactive Search Tool: https://wildvisualizer.com (paper)

License: ODC-BY

Language(s) (NLP): multi-lingual

Point of Contact: Yuntian Deng

Dataset Summary

WildChat is a collection of 650K conversations between human users and ChatGPT. We collected WildChat… See the full description on the dataset page: https://huggingface.co/datasets/allenai/WildChat.
h
chatbot_arena_conversations
huggingface.co
Updated Jul 18, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Large Model Systems Organization (2023). chatbot_arena_conversations [Dataset]. https://huggingface.co/datasets/lmsys/chatbot_arena_conversations
Explore at:
Dataset updated
Jul 18, 2023
Dataset authored and provided by
Large Model Systems Organization
License
https://choosealicense.com/licenses/cc/https://choosealicense.com/licenses/cc/
Description
Chatbot Arena Conversations Dataset

This dataset contains 33K cleaned conversations with pairwise human preferences. It is collected from 13K unique IP addresses on the Chatbot Arena from April to June 2023. Each sample includes a question ID, two model names, their full conversation text in OpenAI API JSON format, the user vote, the anonymized user ID, the detected language tag, the OpenAI moderation API tag, the additional toxic tag, and the timestamp. To ensure the safe release… See the full description on the dataset page: https://huggingface.co/datasets/lmsys/chatbot_arena_conversations.
h
indic-align
huggingface.co
Updated Mar 14, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
AI4Bharat (2024). indic-align [Dataset]. https://huggingface.co/datasets/ai4bharat/indic-align
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 14, 2024
Dataset authored and provided by
AI4Bharat
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
IndicAlign

A diverse collection of Instruction and Toxic alignment datasets for 14 Indic Languages. The collection comprises of:

IndicAlign - Instruct Indic-ShareLlama Dolly-T OpenAssistant-T WikiHow IndoWordNet Anudesh Wiki-Conv Wiki-Chat

IndicAlign - Toxic HHRLHF-T Toxic-Matrix

We use IndicTrans2 (Gala et al., 2023) for the translation of the datasets. We recommend the readers to check out our paper on Arxiv for detailed information on the curation process of these… See the full description on the dataset page: https://huggingface.co/datasets/ai4bharat/indic-align.
prosocial-dialog
huggingface.co
opendatalab.com
Updated Feb 22, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ai2 (2023). prosocial-dialog [Dataset]. https://huggingface.co/datasets/allenai/prosocial-dialog
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 22, 2023
Dataset provided by
Allen Institute for AIhttp://allenai.org/
Authors
Ai2
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Dataset Card for ProsocialDialog Dataset

Dataset Summary

ProsocialDialog is the first large-scale multi-turn English dialogue dataset to teach conversational agents to respond to problematic content following social norms. Covering diverse unethical, problematic, biased, and toxic situations, ProsocialDialog contains responses that encourage prosocial behavior, grounded in commonsense social rules (i.e., rules-of-thumb, RoTs). Created via a human-AI collaborative… See the full description on the dataset page: https://huggingface.co/datasets/allenai/prosocial-dialog.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Large Model Systems Organization (2024). toxic-chat [Dataset]. https://huggingface.co/datasets/lmsys/toxic-chat

toxic-chat

lmsys/toxic-chat

Explore at:

245 scholarly articles cite this dataset (View in Google Scholar)

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Jan 25, 2024

Dataset authored and provided by

Large Model Systems Organization

License

Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically

Description

Update

[01/31/2024] We update the OpenAI Moderation API results for ToxicChat (0124) based on their updated moderation model on on Jan 25, 2024.[01/28/2024] We release an official T5-Large model trained on ToxicChat (toxicchat0124). Go and check it for you baseline comparision![01/19/2024] We have a new version of ToxicChat (toxicchat0124)!

  Content

This dataset contains toxicity annotations on 10K user prompts collected from the Vicuna online demo. We utilize a human-AI… See the full description on the dataset page: https://huggingface.co/datasets/lmsys/toxic-chat.

Clear search

Close search

Google apps

Main menu

toxic-chat

ToxicChat Dataset

dota-2-toxic-chat-data

Toxic-Chat-V2

toxic_chat_parquet

Dataset

Contents

toxic-chat

sensai

Ee_chat Dataset

toxic-chat-pt

real-toxicity-prompts

WildChat-nontoxic

toxic_conversations

test_toxicity_bias

dynamoai-benchmark-safety

WildChat-1M

NLU-Toxicity-Detection

WildChat

chatbot_arena_conversations

indic-align

prosocial-dialog

toxic-chat

lmsys/toxic-chat