10 datasets found
  1. h

    RAGTruth-processed

    • huggingface.co
    Updated Feb 7, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Weights and Biases (2022). RAGTruth-processed [Dataset]. https://huggingface.co/datasets/wandb/RAGTruth-processed
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 7, 2022
    Dataset authored and provided by
    Weights and Biases
    Description

    RAGTruth Dataset

      Dataset Description
    
    
    
    
    
      Dataset Summary
    

    The RAGTruth dataset is designed for evaluating hallucinations in text generation models, particularly in retrieval-augmented generation (RAG) contexts. It contains examples of model outputs along with expert annotations indicating whether the outputs contain hallucinations.

      Dataset Structure
    

    Each example contains:

    A query/question Context passages Model output Hallucination labels (evident… See the full description on the dataset page: https://huggingface.co/datasets/wandb/RAGTruth-processed.

  2. h

    RAGTruth_test

    • huggingface.co
    Updated Sep 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Flow AI (2024). RAGTruth_test [Dataset]. https://huggingface.co/datasets/flowaicom/RAGTruth_test
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 13, 2024
    Dataset authored and provided by
    Flow AI
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    RAGTruth test set

      Dataset
    

    Test split of RAGTruth dataset by ParticleMedia available from https://github.com/ParticleMedia/RAGTruth/tree/main/dataset The dataset was published in RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models

      Preprocessing
    

    We kept only the test split of the original dataset Joined response and source info files Created the response level hallucination labels as described in the paper using binary… See the full description on the dataset page: https://huggingface.co/datasets/flowaicom/RAGTruth_test.

  3. h

    RAGTruth

    • huggingface.co
    Updated Jan 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nimit Kalra (2025). RAGTruth [Dataset]. https://huggingface.co/datasets/nimitkalra/RAGTruth
    Explore at:
    Dataset updated
    Jan 20, 2025
    Authors
    Nimit Kalra
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    nimitkalra/RAGTruth dataset hosted on Hugging Face and contributed by the HF Datasets community

  4. h

    ragtruth-de-translated

    • huggingface.co
    Updated May 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    KR Labs (2025). ragtruth-de-translated [Dataset]. https://huggingface.co/datasets/KRLabsOrg/ragtruth-de-translated
    Explore at:
    Dataset updated
    May 18, 2025
    Dataset authored and provided by
    KR Labs
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    The dataset is created from the RAGTruth dataset by translating it to German. We've used Mistral Small 3.1 for the translation. The translation was done on a single A100 machine using VLLM as a server.

  5. h

    ragtruth-it-translated

    • huggingface.co
    Updated May 18, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    KR Labs (2025). ragtruth-it-translated [Dataset]. https://huggingface.co/datasets/KRLabsOrg/ragtruth-it-translated
    Explore at:
    Dataset updated
    May 18, 2025
    Dataset authored and provided by
    KR Labs
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    The dataset is created from the RAGTruth dataset by translating it to Italian. We've used Gemma 3 27B for the translation. The translation was done on a single A100 machine using VLLM as a server.

  6. h

    ragtruth-hu-translated

    • huggingface.co
    Updated May 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    KR Labs (2025). ragtruth-hu-translated [Dataset]. https://huggingface.co/datasets/KRLabsOrg/ragtruth-hu-translated
    Explore at:
    Dataset updated
    May 18, 2025
    Dataset authored and provided by
    KR Labs
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    The dataset is created from the RAGTruth dataset by translating it to Hungarian. We've used Gemma 3 27B for the translation. The translation was done on a single A100 machine using VLLM as a server.

  7. h

    ragtruth-cn-translated

    • huggingface.co
    Updated May 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    KR Labs (2025). ragtruth-cn-translated [Dataset]. https://huggingface.co/datasets/KRLabsOrg/ragtruth-cn-translated
    Explore at:
    Dataset updated
    May 18, 2025
    Dataset authored and provided by
    KR Labs
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    The dataset is created from the RAGTruth dataset by translating it to Chinese. We've used Gemma 3 27B for the translation. The translation was done on a single A100 machine using VLLM as a server.

  8. h

    ragtruth-de-translated-manual-300

    • huggingface.co
    Updated May 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    KR Labs (2025). ragtruth-de-translated-manual-300 [Dataset]. https://huggingface.co/datasets/KRLabsOrg/ragtruth-de-translated-manual-300
    Explore at:
    Dataset updated
    May 18, 2025
    Dataset authored and provided by
    KR Labs
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    KRLabsOrg/ragtruth-de-translated-manual-300 dataset hosted on Hugging Face and contributed by the HF Datasets community

  9. h

    ragtruth-qa-ko

    • huggingface.co
    Updated Sep 12, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    yettiesoft (2024). ragtruth-qa-ko [Dataset]. https://huggingface.co/datasets/Yettiesoft/ragtruth-qa-ko
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 12, 2024
    Dataset authored and provided by
    yettiesoft
    Description

    Dataset Card for Dataset Name

    ragtruth-qa 데이터셋을 gpt-4o를 이용하여 한글로 번역 한 데이터셋.

      Dataset Details
    
    
    
    
    
      Dataset Description
    

    Curated by: [More Information Needed] Language(s) (NLP): [한국어] License: [미정]

      Dataset Sources [optional]
    

    Repository: [https://huggingface.co/datasets/flowaicom/formatted-ragtruth-qa]

      Uses
    
    
    
    
    
    
    
      Direct Use
    

    [More Information Needed]

      Out-of-Scope Use
    

    [More Information Needed]

      Dataset Structure… See the full description on the dataset page: https://huggingface.co/datasets/Yettiesoft/ragtruth-qa-ko.
    
  10. h

    LLM-AggreFact

    • huggingface.co
    Updated May 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Liyan Tang (2025). LLM-AggreFact [Dataset]. https://huggingface.co/datasets/lytang/LLM-AggreFact
    Explore at:
    Dataset updated
    May 17, 2025
    Authors
    Liyan Tang
    License

    Attribution-NoDerivs 4.0 (CC BY-ND 4.0)https://creativecommons.org/licenses/by-nd/4.0/
    License information was derived automatically

    Description

    Important Update 08.09.2024

    We announce the LLM-AggreFact leaderboard with 35 latest fact-checking models being evaluated.

    We include one additional dataset RAGTruth to our benchmark. We convert the dataset to the same format as in our benchmark and removed those non-checkworthy claims. We include a randomly sampled subset of the training set from RAGTruth into the validation set of the benchmark since the original training set is too large after conversion.… See the full description on the dataset page: https://huggingface.co/datasets/lytang/LLM-AggreFact.

  11. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Weights and Biases (2022). RAGTruth-processed [Dataset]. https://huggingface.co/datasets/wandb/RAGTruth-processed

RAGTruth-processed

wandb/RAGTruth-processed

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 7, 2022
Dataset authored and provided by
Weights and Biases
Description

RAGTruth Dataset

  Dataset Description





  Dataset Summary

The RAGTruth dataset is designed for evaluating hallucinations in text generation models, particularly in retrieval-augmented generation (RAG) contexts. It contains examples of model outputs along with expert annotations indicating whether the outputs contain hallucinations.

  Dataset Structure

Each example contains:

A query/question Context passages Model output Hallucination labels (evident… See the full description on the dataset page: https://huggingface.co/datasets/wandb/RAGTruth-processed.

Search
Clear search
Close search
Google apps
Main menu