20 datasets found
  1. h

    toxic-chat

    • huggingface.co
    Updated Jan 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Large Model Systems Organization (2024). toxic-chat [Dataset]. https://huggingface.co/datasets/lmsys/toxic-chat
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 25, 2024
    Dataset authored and provided by
    Large Model Systems Organization
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Update

    [01/31/2024] We update the OpenAI Moderation API results for ToxicChat (0124) based on their updated moderation model on on Jan 25, 2024.[01/28/2024] We release an official T5-Large model trained on ToxicChat (toxicchat0124). Go and check it for you baseline comparision![01/19/2024] We have a new version of ToxicChat (toxicchat0124)!

      Content
    

    This dataset contains toxicity annotations on 10K user prompts collected from the Vicuna online demo. We utilize a human-AI… See the full description on the dataset page: https://huggingface.co/datasets/lmsys/toxic-chat.

  2. P

    ToxicChat Dataset

    • paperswithcode.com
    Updated May 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zi Lin; Zihan Wang; Yongqi Tong; Yangkun Wang; Yuxin Guo; Yujia Wang; Jingbo Shang (2025). ToxicChat Dataset [Dataset]. https://paperswithcode.com/dataset/toxicchat
    Explore at:
    Dataset updated
    May 15, 2025
    Authors
    Zi Lin; Zihan Wang; Yongqi Tong; Yangkun Wang; Yuxin Guo; Yujia Wang; Jingbo Shang
    Description

    ToxicChat is a novel benchmark dataset constructed based on real user queries from an open-source chatbot. Unlike previous toxicity detection benchmarks that primarily rely on social media content, ToxicChat captures the rich and nuanced phenomena inherent in real-world user-AI interactions. This unique dataset reveals significant domain differences compared to social media contents, making it a valuable resource for exploring the challenges of toxicity detection in user-AI conversations¹.

    Here are some key details about the ToxicChat dataset:

    Construction: ToxicChat was created using real user queries collected from an open-source chatbot. Challenges: It contains phenomena that can be tricky for current toxicity detection models to identify. Domain Difference: ToxicChat exhibits a significant domain difference when compared to social media content. Purpose: ToxicChat serves as a benchmark to drive advancements in building a safe and healthy environment for user-AI interactions.

    Source: Conversation with Bing, 3/17/2024 (1) ToxicChat: Unveiling Hidden Challenges of Toxicity Detection in Real .... https://aclanthology.org/2023.findings-emnlp.311/. (2) arXiv:2310.17389v1 [cs.CL] 26 Oct 2023. https://arxiv.org/pdf/2310.17389. (3) README.md · lmsys/toxic-chat at main - Hugging Face. https://huggingface.co/datasets/lmsys/toxic-chat/blob/main/README.md. (4) The Toxicity Dataset - GitHub. https://github.com/surge-ai/toxicity. (5) undefined. https://aclanthology.org/2023.findings-emnlp.311. (6) undefined. https://aclanthology.org/2023.findings-emnlp.311.pdf.

  3. h

    dota-2-toxic-chat-data

    • huggingface.co
    Updated Apr 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Daniel Fesalbon (2024). dota-2-toxic-chat-data [Dataset]. https://huggingface.co/datasets/dffesalbon/dota-2-toxic-chat-data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 3, 2024
    Authors
    Daniel Fesalbon
    Description

    dffesalbon/dota-2-toxic-chat-data dataset hosted on Hugging Face and contributed by the HF Datasets community

  4. h

    Toxic-Chat-V2

    • huggingface.co
    Updated Sep 10, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ian Li (2020). Toxic-Chat-V2 [Dataset]. http://doi.org/10.57967/hf/3749
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 10, 2020
    Authors
    Ian Li
    Description

    IanLi233/Toxic-Chat-V2 dataset hosted on Hugging Face and contributed by the HF Datasets community

  5. toxic_chat_parquet

    • kaggle.com
    Updated Nov 12, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ali waleed (2023). toxic_chat_parquet [Dataset]. https://www.kaggle.com/datasets/alimistro123/toxic-chat-parquet
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 12, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    ali waleed
    Description

    Dataset

    This dataset was created by ali waleed

    Contents

  6. h

    toxic-chat

    • huggingface.co
    Updated Nov 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    akcit ijf (2024). toxic-chat [Dataset]. https://huggingface.co/datasets/akcit-ijf/toxic-chat
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 12, 2024
    Dataset authored and provided by
    akcit ijf
    Description

    akcit-ijf/toxic-chat dataset hosted on Hugging Face and contributed by the HF Datasets community

  7. h

    sensai

    • huggingface.co
    Updated Jun 6, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Holodata Archive (2022). sensai [Dataset]. https://huggingface.co/datasets/holodata/sensai
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 6, 2022
    Dataset authored and provided by
    Holodata Archive
    Description

    ❤️‍🩹 Sensai: Toxic Chat Dataset

    Sensai is a toxic chat dataset consists of live chats from Virtual YouTubers' live streams. Download the dataset from Kaggle Datasets and join #livechat-dataset channel on holodata Discord for discussions.

      Provenance
    

    Source: YouTube Live Chat events (all streams covered by Holodex, including Hololive, Nijisanji, 774inc, etc) Temporal Coverage: From 2021-01-15T05:15:33Z Update Frequency: At least once per month

      Research Ideas… See the full description on the dataset page: https://huggingface.co/datasets/holodata/sensai.
    
  8. R

    Ee_chat Dataset

    • universe.roboflow.com
    zip
    Updated Mar 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Eve Echoes (2025). Ee_chat Dataset [Dataset]. https://universe.roboflow.com/eve-echoes/ee_chat/dataset/1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Mar 7, 2025
    Dataset authored and provided by
    Eve Echoes
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Message Bounding Boxes
    Description

    Here are a few use cases for this project:

    1. Gaming Communication Management: "EE_Chat" can be used by game developers and gaming companies to understand the communication patterns among players. It can help to analyze in-game messaging, detect toxic behaviour or keywords, manage group interactions, and gather insights on player behaviour.

    2. Online Gaming Experience Improvement: This model can be used to analyze the chatting patterns, popular topics, frequent message times, and the interactions between players. These insights can then be used to improve the chatting function, enhance user experience, and boost overall game engagement.

    3. Chat Support Systems: EE_Chat can be adapted for use in understanding customer service interactions on various platforms. It can be used to categorize messages, analyze response times, and understand the effectiveness of the customer support team.

    4. Interactive Game Streaming: Streamers and content creators can integrate "EE_Chat" into their streaming platforms for live interaction with their audience. The model can help them keep track of messages, prioritize certain types of messages, or filter out unwanted content.

    5. Marketing & Advertising: This model can be used by businesses to analyze discussions in gaming communities. They can understand popular chat channels, the most active times, and trending topics. It can help in identifying huge potential markets, effective advertising spots, and creating targeted marketing campaigns.

  9. h

    toxic-chat-pt

    • huggingface.co
    Updated Jul 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    toxic-chat-pt [Dataset]. https://huggingface.co/datasets/BRlkl/toxic-chat-pt
    Explore at:
    Dataset updated
    Jul 8, 2025
    Authors
    Pedro Ribeiro
    Description

    BRlkl/toxic-chat-pt dataset hosted on Hugging Face and contributed by the HF Datasets community

  10. real-toxicity-prompts

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    real-toxicity-prompts [Dataset]. https://huggingface.co/datasets/allenai/real-toxicity-prompts
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset provided by
    Allen Institute for AIhttp://allenai.org/
    Authors
    Ai2
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset Card for Real Toxicity Prompts

      Dataset Summary
    

    RealToxicityPrompts is a dataset of 100k sentence snippets from the web for researchers to further address the risk of neural toxic degeneration in models.

      Languages
    

    English

      Dataset Structure
    
    
    
    
    
      Data Instances
    

    Each instance represents a prompt and its metadata: { "filename":"0766186-bc7f2a64cb271f5f56cf6f25570cd9ed.txt", "begin":340, "end":564, "challenging":false… See the full description on the dataset page: https://huggingface.co/datasets/allenai/real-toxicity-prompts.

  11. WildChat-nontoxic

    • huggingface.co
    Updated Nov 14, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    WildChat-nontoxic [Dataset]. https://huggingface.co/datasets/allenai/WildChat-nontoxic
    Explore at:
    Dataset updated
    Nov 14, 2023
    Dataset provided by
    Allen Institute for AIhttp://allenai.org/
    Authors
    Ai2
    Description

    Dataset Card for WildChat-nontoxic

      Note: a newer version with 1 million conversations and demographic information can be found here.
    
    
    
    
    
      Dataset Description
    

    Paper: https://wenting-zhao.github.io/papers/wildchat.pdf

    License: https://allenai.org/licenses/impact-lr

    Language(s) (NLP): multi-lingual

    Point of Contact: Yuntian Deng

      Dataset Summary
    

    WildChat-nontoxic is the nontoxic subset of the WildChat dataset, a collection of 530K conversations… See the full description on the dataset page: https://huggingface.co/datasets/allenai/WildChat-nontoxic.

  12. h

    toxic_conversations

    • huggingface.co
    Updated Jun 29, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    SetFit (2022). toxic_conversations [Dataset]. https://huggingface.co/datasets/SetFit/toxic_conversations
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 29, 2022
    Dataset authored and provided by
    SetFit
    Description

    Toxic Conversation

    This is a version of the Jigsaw Unintended Bias in Toxicity Classification dataset. It contains comments from the Civil Comments platform together with annotations if the comment is toxic or not. 10 annotators annotated each example and, as recommended in the task page, set a comment as toxic when target >= 0.5 The dataset is inbalanced, with only about 8% of the comments marked as toxic.

  13. h

    test_toxicity_bias

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    David Lee, test_toxicity_bias [Dataset]. https://huggingface.co/datasets/PerAsperaAd/test_toxicity_bias
    Explore at:
    Authors
    David Lee
    Description

    Toxic Chat Dataset

  14. h

    dynamoai-benchmark-safety

    • huggingface.co
    Updated May 8, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dynamo AI (2024). dynamoai-benchmark-safety [Dataset]. https://huggingface.co/datasets/dynamoai/dynamoai-benchmark-safety
    Explore at:
    Dataset updated
    May 8, 2024
    Dataset provided by
    Dynamo AI LLC
    Authors
    Dynamo AI
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    ⚠️ Warning: this dataset contains examples of toxic, offensive, and inappropriate language. The current LLM safety landscape struggles with accurate real-world benchmarking of content moderation systems. This dataset is a subset of a larger benchmark dataset constructed by the Dynamo AI research team. It consists of real-world chats written by humans with toxic intent. In order to ensure impartiality and standardization, we drew upon open-source and human-annotated examples from Allen… See the full description on the dataset page: https://huggingface.co/datasets/dynamoai/dynamoai-benchmark-safety.

  15. WildChat-1M

    • huggingface.co
    Updated Jul 22, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ai2 (2024). WildChat-1M [Dataset]. https://huggingface.co/datasets/allenai/WildChat-1M
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 22, 2024
    Dataset provided by
    Allen Institute for AIhttp://allenai.org/
    Authors
    Ai2
    License

    https://choosealicense.com/licenses/odc-by/https://choosealicense.com/licenses/odc-by/

    Description

    Dataset Card for WildChat

      Dataset Description
    

    Paper: https://arxiv.org/abs/2405.01470

    Interactive Search Tool: https://wildvisualizer.com (paper)

    License: ODC-BY

    Language(s) (NLP): multi-lingual

    Point of Contact: Yuntian Deng

      Dataset Summary
    

    WildChat is a collection of 1 million conversations between human users and ChatGPT, alongside demographic data, including state, country, hashed IP addresses, and request headers. We collected WildChat by… See the full description on the dataset page: https://huggingface.co/datasets/allenai/WildChat-1M.

  16. h

    NLU-Toxicity-Detection

    • huggingface.co
    Updated Dec 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NLU-Toxicity-Detection [Dataset]. https://huggingface.co/datasets/aisingapore/NLU-Toxicity-Detection
    Explore at:
    Dataset updated
    Dec 19, 2024
    Dataset authored and provided by
    AI Singapore
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    SEA Toxicity Detection

    SEA Toxicity Detection evaluates a model's ability to identify toxic content such as hate speech and abusive language in text. It is sampled from MLHSD for Indonesian, TTD for Thai, and ViHSD for Vietnamese.

      Supported Tasks and Leaderboards
    

    SEA Toxicity Detection is designed for evaluating chat or instruction-tuned large language models (LLMs). It is part of the SEA-HELM leaderboard from AI Singapore.

      Languages
    

    Indonesian (id) Thai… See the full description on the dataset page: https://huggingface.co/datasets/aisingapore/NLU-Toxicity-Detection.

  17. WildChat

    • huggingface.co
    Updated Jul 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ai2 (2024). WildChat [Dataset]. https://huggingface.co/datasets/allenai/WildChat
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 23, 2024
    Dataset provided by
    Allen Institute for AIhttp://allenai.org/
    Authors
    Ai2
    License

    https://choosealicense.com/licenses/odc-by/https://choosealicense.com/licenses/odc-by/

    Description

    Dataset Card for WildChat

      Note: a newer version with 1 million conversations and demographic information can be found here.
    
    
    
    
    
      Dataset Description
    

    Paper: https://arxiv.org/abs/2405.01470

    Interactive Search Tool: https://wildvisualizer.com (paper)

    License: ODC-BY

    Language(s) (NLP): multi-lingual

    Point of Contact: Yuntian Deng

      Dataset Summary
    

    WildChat is a collection of 650K conversations between human users and ChatGPT. We collected WildChat… See the full description on the dataset page: https://huggingface.co/datasets/allenai/WildChat.

  18. h

    chatbot_arena_conversations

    • huggingface.co
    Updated Jul 18, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Large Model Systems Organization (2023). chatbot_arena_conversations [Dataset]. https://huggingface.co/datasets/lmsys/chatbot_arena_conversations
    Explore at:
    Dataset updated
    Jul 18, 2023
    Dataset authored and provided by
    Large Model Systems Organization
    License

    https://choosealicense.com/licenses/cc/https://choosealicense.com/licenses/cc/

    Description

    Chatbot Arena Conversations Dataset

    This dataset contains 33K cleaned conversations with pairwise human preferences. It is collected from 13K unique IP addresses on the Chatbot Arena from April to June 2023. Each sample includes a question ID, two model names, their full conversation text in OpenAI API JSON format, the user vote, the anonymized user ID, the detected language tag, the OpenAI moderation API tag, the additional toxic tag, and the timestamp. To ensure the safe release… See the full description on the dataset page: https://huggingface.co/datasets/lmsys/chatbot_arena_conversations.

  19. h

    indic-align

    • huggingface.co
    Updated Mar 14, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AI4Bharat (2024). indic-align [Dataset]. https://huggingface.co/datasets/ai4bharat/indic-align
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 14, 2024
    Dataset authored and provided by
    AI4Bharat
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    IndicAlign

    A diverse collection of Instruction and Toxic alignment datasets for 14 Indic Languages. The collection comprises of:

    IndicAlign - Instruct Indic-ShareLlama Dolly-T OpenAssistant-T WikiHow IndoWordNet Anudesh Wiki-Conv Wiki-Chat

    IndicAlign - Toxic HHRLHF-T Toxic-Matrix

    We use IndicTrans2 (Gala et al., 2023) for the translation of the datasets. We recommend the readers to check out our paper on Arxiv for detailed information on the curation process of these… See the full description on the dataset page: https://huggingface.co/datasets/ai4bharat/indic-align.

  20. prosocial-dialog

    • huggingface.co
    • opendatalab.com
    Updated Feb 22, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ai2 (2023). prosocial-dialog [Dataset]. https://huggingface.co/datasets/allenai/prosocial-dialog
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 22, 2023
    Dataset provided by
    Allen Institute for AIhttp://allenai.org/
    Authors
    Ai2
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset Card for ProsocialDialog Dataset

      Dataset Summary
    

    ProsocialDialog is the first large-scale multi-turn English dialogue dataset to teach conversational agents to respond to problematic content following social norms. Covering diverse unethical, problematic, biased, and toxic situations, ProsocialDialog contains responses that encourage prosocial behavior, grounded in commonsense social rules (i.e., rules-of-thumb, RoTs). Created via a human-AI collaborative… See the full description on the dataset page: https://huggingface.co/datasets/allenai/prosocial-dialog.

  21. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Large Model Systems Organization (2024). toxic-chat [Dataset]. https://huggingface.co/datasets/lmsys/toxic-chat

toxic-chat

lmsys/toxic-chat

Explore at:
245 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 25, 2024
Dataset authored and provided by
Large Model Systems Organization
License

Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically

Description

Update

[01/31/2024] We update the OpenAI Moderation API results for ToxicChat (0124) based on their updated moderation model on on Jan 25, 2024.[01/28/2024] We release an official T5-Large model trained on ToxicChat (toxicchat0124). Go and check it for you baseline comparision![01/19/2024] We have a new version of ToxicChat (toxicchat0124)!

  Content

This dataset contains toxicity annotations on 10K user prompts collected from the Vicuna online demo. We utilize a human-AI… See the full description on the dataset page: https://huggingface.co/datasets/lmsys/toxic-chat.

Search
Clear search
Close search
Google apps
Main menu