10 datasets found

h
RAGTruth-processed
huggingface.co
Updated Feb 7, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Weights and Biases (2022). RAGTruth-processed [Dataset]. https://huggingface.co/datasets/wandb/RAGTruth-processed
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 7, 2022
Dataset authored and provided by
Weights and Biases
Description
RAGTruth Dataset

Dataset Description Dataset Summary

The RAGTruth dataset is designed for evaluating hallucinations in text generation models, particularly in retrieval-augmented generation (RAG) contexts. It contains examples of model outputs along with expert annotations indicating whether the outputs contain hallucinations.

Dataset Structure

Each example contains:

A query/question Context passages Model output Hallucination labels (evident… See the full description on the dataset page: https://huggingface.co/datasets/wandb/RAGTruth-processed.
h
RAGTruth_test
huggingface.co
Updated Sep 13, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Flow AI (2024). RAGTruth_test [Dataset]. https://huggingface.co/datasets/flowaicom/RAGTruth_test
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 13, 2024
Dataset authored and provided by
Flow AI
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
RAGTruth test set

Dataset

Test split of RAGTruth dataset by ParticleMedia available from https://github.com/ParticleMedia/RAGTruth/tree/main/dataset The dataset was published in RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models

Preprocessing

We kept only the test split of the original dataset Joined response and source info files Created the response level hallucination labels as described in the paper using binary… See the full description on the dataset page: https://huggingface.co/datasets/flowaicom/RAGTruth_test.
h
RAGTruth
huggingface.co
Updated Jan 20, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nimit Kalra (2025). RAGTruth [Dataset]. https://huggingface.co/datasets/nimitkalra/RAGTruth
Explore at:
Dataset updated
Jan 20, 2025
Authors
Nimit Kalra
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
nimitkalra/RAGTruth dataset hosted on Hugging Face and contributed by the HF Datasets community
h
ragtruth-de-translated
huggingface.co
Updated May 18, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
KR Labs (2025). ragtruth-de-translated [Dataset]. https://huggingface.co/datasets/KRLabsOrg/ragtruth-de-translated
Explore at:
Dataset updated
May 18, 2025
Dataset authored and provided by
KR Labs
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
The dataset is created from the RAGTruth dataset by translating it to German. We've used Mistral Small 3.1 for the translation. The translation was done on a single A100 machine using VLLM as a server.
h
ragtruth-it-translated
huggingface.co
Updated May 18, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
KR Labs (2025). ragtruth-it-translated [Dataset]. https://huggingface.co/datasets/KRLabsOrg/ragtruth-it-translated
Explore at:
Dataset updated
May 18, 2025
Dataset authored and provided by
KR Labs
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
The dataset is created from the RAGTruth dataset by translating it to Italian. We've used Gemma 3 27B for the translation. The translation was done on a single A100 machine using VLLM as a server.
h
ragtruth-hu-translated
huggingface.co
Updated May 18, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
KR Labs (2025). ragtruth-hu-translated [Dataset]. https://huggingface.co/datasets/KRLabsOrg/ragtruth-hu-translated
Explore at:
Dataset updated
May 18, 2025
Dataset authored and provided by
KR Labs
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
The dataset is created from the RAGTruth dataset by translating it to Hungarian. We've used Gemma 3 27B for the translation. The translation was done on a single A100 machine using VLLM as a server.
h
ragtruth-cn-translated
huggingface.co
Updated May 18, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
KR Labs (2025). ragtruth-cn-translated [Dataset]. https://huggingface.co/datasets/KRLabsOrg/ragtruth-cn-translated
Explore at:
Dataset updated
May 18, 2025
Dataset authored and provided by
KR Labs
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
The dataset is created from the RAGTruth dataset by translating it to Chinese. We've used Gemma 3 27B for the translation. The translation was done on a single A100 machine using VLLM as a server.
h
ragtruth-de-translated-manual-300
huggingface.co
Updated May 18, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
KR Labs (2025). ragtruth-de-translated-manual-300 [Dataset]. https://huggingface.co/datasets/KRLabsOrg/ragtruth-de-translated-manual-300
Explore at:
Dataset updated
May 18, 2025
Dataset authored and provided by
KR Labs
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
KRLabsOrg/ragtruth-de-translated-manual-300 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
ragtruth-qa-ko
huggingface.co
Updated Sep 12, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
yettiesoft (2024). ragtruth-qa-ko [Dataset]. https://huggingface.co/datasets/Yettiesoft/ragtruth-qa-ko
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 12, 2024
Dataset authored and provided by
yettiesoft
Description
Dataset Card for Dataset Name

ragtruth-qa 데이터셋을 gpt-4o를 이용하여 한글로 번역 한 데이터셋.

Dataset Details Dataset Description

Curated by: [More Information Needed] Language(s) (NLP): [한국어] License: [미정]

Dataset Sources [optional]

Repository: [https://huggingface.co/datasets/flowaicom/formatted-ragtruth-qa]

Uses Direct Use

[More Information Needed]

Out-of-Scope Use

[More Information Needed]

Dataset Structure… See the full description on the dataset page: https://huggingface.co/datasets/Yettiesoft/ragtruth-qa-ko.
h
LLM-AggreFact
huggingface.co
Updated May 17, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Liyan Tang (2025). LLM-AggreFact [Dataset]. https://huggingface.co/datasets/lytang/LLM-AggreFact
Explore at:
Dataset updated
May 17, 2025
Authors
Liyan Tang
License
Attribution-NoDerivs 4.0 (CC BY-ND 4.0)https://creativecommons.org/licenses/by-nd/4.0/
License information was derived automatically
Description
Important Update 08.09.2024

We announce the LLM-AggreFact leaderboard with 35 latest fact-checking models being evaluated.

We include one additional dataset RAGTruth to our benchmark. We convert the dataset to the same format as in our benchmark and removed those non-checkworthy claims. We include a randomly sampled subset of the training set from RAGTruth into the validation set of the benchmark since the original training set is too large after conversion.… See the full description on the dataset page: https://huggingface.co/datasets/lytang/LLM-AggreFact.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Weights and Biases (2022). RAGTruth-processed [Dataset]. https://huggingface.co/datasets/wandb/RAGTruth-processed

RAGTruth-processed

wandb/RAGTruth-processed

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Feb 7, 2022

Dataset authored and provided by

Weights and Biases

Description

RAGTruth Dataset

  Dataset Description





  Dataset Summary

The RAGTruth dataset is designed for evaluating hallucinations in text generation models, particularly in retrieval-augmented generation (RAG) contexts. It contains examples of model outputs along with expert annotations indicating whether the outputs contain hallucinations.

  Dataset Structure

Each example contains:

A query/question Context passages Model output Hallucination labels (evident… See the full description on the dataset page: https://huggingface.co/datasets/wandb/RAGTruth-processed.

Clear search

Close search

Google apps

Main menu

RAGTruth-processed

RAGTruth_test

RAGTruth

ragtruth-de-translated

ragtruth-it-translated

ragtruth-hu-translated

ragtruth-cn-translated

ragtruth-de-translated-manual-300

ragtruth-qa-ko

LLM-AggreFact

RAGTruth-processed

wandb/RAGTruth-processed