100+ datasets found
  1. h

    malicious-llm-prompts

    • huggingface.co
    Updated Mar 9, 2009
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sagar Patel (2009). malicious-llm-prompts [Dataset]. https://huggingface.co/datasets/codesagar/malicious-llm-prompts
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 9, 2009
    Authors
    Sagar Patel
    Description

    Dataset Card for "malicious-llm-prompts"

    More Information needed

  2. h

    llm-system-prompts-benchmark

    • huggingface.co
    Updated Feb 13, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Naomi Bashkansky (2024). llm-system-prompts-benchmark [Dataset]. https://huggingface.co/datasets/Naomibas/llm-system-prompts-benchmark
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 13, 2024
    Authors
    Naomi Bashkansky
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset Card for Dataset Name

    This datset is a collection of 100 system prompts for large language models.

      Dataset Details
    
    
    
    
    
      Dataset Description
    

    These 100 system prompts test a model's ability to follow grammatical patterns; answer basic multiple choice questions; act according to a particular persona; memorize information; and speak in French. Files:

    hundred_system_prompts.py: refer to this to see the (prompt, probe, function) triplets, as well as the… See the full description on the dataset page: https://huggingface.co/datasets/Naomibas/llm-system-prompts-benchmark.

  3. h

    synthetic_multilingual_llm_prompts

    • huggingface.co
    Updated Jun 11, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gretel.ai (2024). synthetic_multilingual_llm_prompts [Dataset]. https://huggingface.co/datasets/gretelai/synthetic_multilingual_llm_prompts
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 11, 2024
    Dataset provided by
    Gretel.ai
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Image generated by DALL-E. See prompt for more details

      📝🌐 Synthetic Multilingual LLM Prompts
    

    Welcome to the "Synthetic Multilingual LLM Prompts" dataset! This comprehensive collection features 1,250 synthetic LLM prompts generated using Gretel Navigator, available in seven different languages. To ensure accuracy and diversity in prompts, and translation quality and consistency across the different languages, we employed Gretel Navigator both as a generation tool and as an… See the full description on the dataset page: https://huggingface.co/datasets/gretelai/synthetic_multilingual_llm_prompts.

  4. h

    paper-llm-prompts

    • huggingface.co
    Updated Jul 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    B F (2023). paper-llm-prompts [Dataset]. https://huggingface.co/datasets/beephids/paper-llm-prompts
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 3, 2023
    Authors
    B F
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    beephids/paper-llm-prompts dataset hosted on Hugging Face and contributed by the HF Datasets community

  5. h

    System-Prompt-Library-030825

    • huggingface.co
    Updated Oct 17, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Daniel Rosehill (2025). System-Prompt-Library-030825 [Dataset]. http://doi.org/10.57967/hf/6319
    Explore at:
    Dataset updated
    Oct 17, 2025
    Authors
    Daniel Rosehill
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    System Prompts Dataset - August 2025

    Point-in-time export from Daniel Rosehill's system prompt library as of August 3rd, 2025

      Overview
    

    This repository contains a comprehensive collection of 944 system prompts designed for various AI applications, agent workflows, and conversational AI systems. While many of these prompts now serve as the foundation for more complex agent-based workflows, they continue to provide essential building blocks for AI system design and… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/System-Prompt-Library-030825.

  6. h

    LLM_Math

    • huggingface.co
    Updated Feb 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    LangChain Hub Prompts (2023). LLM_Math [Dataset]. https://huggingface.co/datasets/LangChainHub-Prompts/LLM_Math
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 19, 2023
    Dataset authored and provided by
    LangChain Hub Prompts
    Description

    Description of LLM Math

    Prompt designed to optionally output iPython syntax to be run in order to better answer math questions.

      Inputs
    

    This is a description of the inputs that the prompt expects. question: User question to be answered.

      Usage
    

    Below is a code snippet for how to use the prompt. from langchain.prompts import load_prompt from langchain.chains import LLMMathChain

    llm = ... prompt = load_prompt('lc://prompts/llm_math/

  7. h

    LLM_Bash

    • huggingface.co
    Updated Feb 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    LangChain Hub Prompts (2023). LLM_Bash [Dataset]. https://huggingface.co/datasets/LangChainHub-Prompts/LLM_Bash
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 16, 2023
    Dataset authored and provided by
    LangChain Hub Prompts
    Description

    Description of LLM Bash

    Prompt designed to convert natural language to bash command.

      Inputs
    

    This is a description of the inputs that the prompt expects. question: User question to be answered by writing a bash command.

      Usage
    

    Below is a code snippet for how to use the prompt. from langchain.prompts import load_prompt from langchain.chains import LLMBashChain

    llm = ... prompt = load_prompt('lc://prompts/llm_bash/

  8. h

    llm-prompt-injection-attacks

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aryan Sinha, llm-prompt-injection-attacks [Dataset]. https://huggingface.co/datasets/Smooth-3/llm-prompt-injection-attacks
    Explore at:
    Authors
    Aryan Sinha
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Prompt Injection Mechanisms Dataset

      Overview
    

    A 55,000-sample multi-label dataset for prompt injection detection in large language models.

      Labels
    
    • BENIGN
    • JAILBREAK
    • INSTRUCTION_OVERRIDE
    • ROLE_HIJACK
    • DATA_EXFILTRATION

      Format
      

    The dataset is provided in Apache Parquet format with train/validation splits.

      Construction
    

    The dataset was created by merging multiple public prompt-injection datasets and re-annotating them using a… See the full description on the dataset page: https://huggingface.co/datasets/Smooth-3/llm-prompt-injection-attacks.

  9. h

    llm-jailbreak-prompt-injection-dataset

    • huggingface.co
    Updated Apr 8, 2026
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Boris Matsakov (2026). llm-jailbreak-prompt-injection-dataset [Dataset]. https://huggingface.co/datasets/Necent/llm-jailbreak-prompt-injection-dataset
    Explore at:
    Dataset updated
    Apr 8, 2026
    Authors
    Boris Matsakov
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    LLM Jailbreak & Prompt-Injection Dataset

    A unified safety dataset combining 30+ public sources for training LLM guardrails, content moderation classifiers, and response-safety filters.

      Schema (orthogonal multi-label, WildGuard-style)
    

    Instead of a single binary is_dangerous, every example carries four orthogonal labels matching the structure used by AI2 WildGuard, IBM Granite Guardian, and Azure Prompt Shields:

    Column Type Description

    prompt str The user/attack… See the full description on the dataset page: https://huggingface.co/datasets/Necent/llm-jailbreak-prompt-injection-dataset.

  10. h

    dark-triad-llm-prompts

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lu zeyi, dark-triad-llm-prompts [Dataset]. https://huggingface.co/datasets/lucerne04/dark-triad-llm-prompts
    Explore at:
    Authors
    Lu zeyi
    Description

    Dark Triad LLM Prompts Dataset

    Version: 1.0.0License: CC BY 4.0

      Dataset Description
    

    This dataset contains 192 user prompts designed to systematically evaluate how Large Language Models respond to descriptions of problematic behaviors reflecting Dark Triad personality traits. Unlike traditional safety benchmarks that focus on harmful requests, this dataset evaluates interactional safety—how models respond when users describe rather than request negative behaviors.… See the full description on the dataset page: https://huggingface.co/datasets/lucerne04/dark-triad-llm-prompts.

  11. h

    Official_LLM_System_Prompts

    • huggingface.co
    Updated Mar 13, 2026
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nymbo (2026). Official_LLM_System_Prompts [Dataset]. https://huggingface.co/datasets/Nymbo/Official_LLM_System_Prompts
    Explore at:
    Dataset updated
    Mar 13, 2026
    Authors
    Nymbo
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Official LLM System Prompts

    This short dataset contains a few system prompts leaked from proprietary models. Contains date-stamped prompts from OpenAI, Anthropic, MS Copilot, GitHub Copilot, Grok, and Perplexity.

  12. llmail-inject-challenge

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Microsoft, llmail-inject-challenge [Dataset]. https://huggingface.co/datasets/microsoft/llmail-inject-challenge
    Explore at:
    Dataset authored and provided by
    Microsofthttp://microsoft.com/
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset Summary

    This dataset contains a large number of attack prompts collected as part of the now closed LLMail-Inject: Adaptive Prompt Injection Challenge. We first describe the details of the challenge, and then we provide a documentation of the dataset For the accompanying code, check out: https://github.com/microsoft/llmail-inject-challenge.

      Citation
    

    @article{abdelnabi2025, title = {LLMail-Inject: A Dataset from a Realistic Adaptive Prompt Injection… See the full description on the dataset page: https://huggingface.co/datasets/microsoft/llmail-inject-challenge.

  13. h

    JailbreakPrompts

    • huggingface.co
    Updated Jun 7, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Simon Knuts (2025). JailbreakPrompts [Dataset]. https://huggingface.co/datasets/Simsonsun/JailbreakPrompts
    Explore at:
    Dataset updated
    Jun 7, 2025
    Authors
    Simon Knuts
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Independent Jailbreak Datasets for LLM Guardrail Evaluation

    Constructed for the thesis:“Contamination Effects: How Training Data Leakage Affects Red Team Evaluation of LLM Jailbreak Detection” The effectiveness of LLM guardrails is commonly evaluated using open-source red teaming tools. However, this study reveals that significant data contamination exists between the training sets of binary jailbreak classifiers (ProtectAI, Katanemo, TestSavantAI, etc.) and the test prompts used in… See the full description on the dataset page: https://huggingface.co/datasets/Simsonsun/JailbreakPrompts.

  14. h

    harmful-dataset

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    LLM Latent Adversarial Training, harmful-dataset [Dataset]. https://huggingface.co/datasets/LLM-LAT/harmful-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset authored and provided by
    LLM Latent Adversarial Training
    Description

    LLM-LAT/harmful-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community

  15. real-toxicity-prompts

    • huggingface.co
    Updated Sep 24, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ai2 (2020). real-toxicity-prompts [Dataset]. http://doi.org/10.57967/hf/0002
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 24, 2020
    Dataset provided by
    Allen Institute for AIhttp://allenai.org/
    Authors
    Ai2
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset Card for Real Toxicity Prompts

      Dataset Summary
    

    RealToxicityPrompts is a dataset of 100k sentence snippets from the web for researchers to further address the risk of neural toxic degeneration in models.

      Languages
    

    English

      Dataset Structure
    
    
    
    
    
      Data Instances
    

    Each instance represents a prompt and its metadata: { "filename":"0766186-bc7f2a64cb271f5f56cf6f25570cd9ed.txt", "begin":340, "end":564, "challenging":false… See the full description on the dataset page: https://huggingface.co/datasets/allenai/real-toxicity-prompts.

  16. h

    prompt-injections

    • huggingface.co
    Updated May 4, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HSE LLM @ Saint Petersburg (2025). prompt-injections [Dataset]. https://huggingface.co/datasets/hse-llm/prompt-injections
    Explore at:
    Dataset updated
    May 4, 2025
    Dataset authored and provided by
    HSE LLM @ Saint Petersburg
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    hse-llm/prompt-injections dataset hosted on Hugging Face and contributed by the HF Datasets community

  17. h

    OpenEndedLLMPrompts

    • huggingface.co
    Updated Jul 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shreyan (2024). OpenEndedLLMPrompts [Dataset]. https://huggingface.co/datasets/shreyanmitra/OpenEndedLLMPrompts
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 6, 2024
    Authors
    Shreyan
    Description

    Dataset Card for OpenEndedLLMPrompts

    A cleaned and consolidated set of questions (without context) and answers for LLM hallucination detection. Each question-answer pair is not the work of the author, but was selected from OpenAssistant/oasst2. If you use any of the data provided, please cite this source in addition to the following paper Shreyan Mitra and Leilani Gilpin. Detecting LLM Hallucinations Pre-generation (paper pending) The original dataset was provided in a tree… See the full description on the dataset page: https://huggingface.co/datasets/shreyanmitra/OpenEndedLLMPrompts.

  18. h

    llm-aes-writing-prompts-deduplicated-0.9-similarity

    • huggingface.co
    Updated Jan 13, 2026
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jim Lai (2026). llm-aes-writing-prompts-deduplicated-0.9-similarity [Dataset]. https://huggingface.co/datasets/grimjim/llm-aes-writing-prompts-deduplicated-0.9-similarity
    Explore at:
    Dataset updated
    Jan 13, 2026
    Authors
    Jim Lai
    Description

    grimjim/llm-aes-writing-prompts-deduplicated-0.9-similarity dataset hosted on Hugging Face and contributed by the HF Datasets community

  19. h

    gemma-llm-prompt-recovery

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sambhav Dixit, gemma-llm-prompt-recovery [Dataset]. https://huggingface.co/datasets/Sambhavnoobcoder/gemma-llm-prompt-recovery
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Authors
    Sambhav Dixit
    Description

    Sambhavnoobcoder/gemma-llm-prompt-recovery dataset hosted on Hugging Face and contributed by the HF Datasets community

  20. h

    eu-cyber-llm-benchmark-prompts

    • huggingface.co
    Updated Mar 29, 2026
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Eric Romang (2026). eu-cyber-llm-benchmark-prompts [Dataset]. https://huggingface.co/datasets/eromang/eu-cyber-llm-benchmark-prompts
    Explore at:
    Dataset updated
    Mar 29, 2026
    Authors
    Eric Romang
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    EU Cyber Threat Landscape LLM Benchmark — Prompts

    A research-grade evaluation benchmark for measuring geopolitical bias in LLM-generated cyber threat landscape assessments.

      What this is
    

    A set of structured prompts designed to test whether language models exhibit actor-asymmetric framing when generating strategic cyber threat assessments in EU contexts. Each prompt describes a cyber incident in a specific critical infrastructure sector, paired with an attribution condition… See the full description on the dataset page: https://huggingface.co/datasets/eromang/eu-cyber-llm-benchmark-prompts.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Sagar Patel (2009). malicious-llm-prompts [Dataset]. https://huggingface.co/datasets/codesagar/malicious-llm-prompts

malicious-llm-prompts

codesagar/malicious-llm-prompts

Explore at:
6 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 9, 2009
Authors
Sagar Patel
Description

Dataset Card for "malicious-llm-prompts"

More Information needed

Search
Clear search
Close search
Google apps
Main menu