17 datasets found
  1. h

    ragbench

    • huggingface.co
    Updated Jun 8, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Galileo (2025). ragbench [Dataset]. https://huggingface.co/datasets/galileo-ai/ragbench
    Explore at:
    Dataset updated
    Jun 8, 2024
    Dataset authored and provided by
    Galileo
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    RAGBench

      Dataset Overview
    

    RAGBEnch is a large-scale RAG benchmark dataset of 100k RAG examples. It covers five unique industry-specific domains and various RAG task types. RAGBench examples are sourced from industry corpora such as user manuals, making it particularly relevant for industry applications. RAGBench comrises 12 sub-component datasets, each one split into train/validation/test splits

      Usage
    

    from datasets import load_dataset

    loadโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/galileo-ai/ragbench.

  2. h

    ragbench-dual-clf-preprocessed

    • huggingface.co
    Updated Dec 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ramanathan (2024). ragbench-dual-clf-preprocessed [Dataset]. https://huggingface.co/datasets/param-bharat/ragbench-dual-clf-preprocessed
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 15, 2024
    Authors
    Ramanathan
    Description

    param-bharat/ragbench-dual-clf-preprocessed dataset hosted on Hugging Face and contributed by the HF Datasets community

  3. h

    mbpp

    • huggingface.co
    Updated Jun 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CodeRAG-Bench (2024). mbpp [Dataset]. https://huggingface.co/datasets/code-rag-bench/mbpp
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 2, 2024
    Dataset authored and provided by
    CodeRAG-Bench
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    MBPP dataset annotated with ground-truth programming solutions, to enable evaluations for retrieval and retrieval-augmented code generation. Please refer to code-rag-bench for more details.

  4. h

    stackoverflow-posts

    • huggingface.co
    Updated Jun 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CodeRAG-Bench (2024). stackoverflow-posts [Dataset]. https://huggingface.co/datasets/code-rag-bench/stackoverflow-posts
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 2, 2024
    Dataset authored and provided by
    CodeRAG-Bench
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    The StackOverflow posts retrieval source for code-rag-bench.

  5. h

    ds1000

    • huggingface.co
    Updated Jun 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CodeRAG-Bench (2024). ds1000 [Dataset]. https://huggingface.co/datasets/code-rag-bench/ds1000
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 2, 2024
    Dataset authored and provided by
    CodeRAG-Bench
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    DS-1000 dataset annotated with the ground-truth library documentation, to enable evaluations for retrieval and retrieval-augmented code generation. Please refer to [code-rag-bench] for more details

  6. h

    Data from: odex

    • huggingface.co
    Updated Jun 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CodeRAG-Bench (2024). odex [Dataset]. https://huggingface.co/datasets/code-rag-bench/odex
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 2, 2024
    Dataset authored and provided by
    CodeRAG-Bench
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    ODEX dataset annotated with the ground-truth library documentation, to enable evaluations for retrieval and retrieval-augmented code generation. Please refer to [code-rag-bench] for more details.

  7. h

    github-repos

    • huggingface.co
    Updated Jun 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CodeRAG-Bench (2024). github-repos [Dataset]. https://huggingface.co/datasets/code-rag-bench/github-repos
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 2, 2024
    Dataset authored and provided by
    CodeRAG-Bench
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    The entire dump of GitHub repositories.

  8. h

    rag-bench-public-texts

    • huggingface.co
    Updated Mar 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ai-forever (2025). rag-bench-public-texts [Dataset]. https://huggingface.co/datasets/ai-forever/rag-bench-public-texts
    Explore at:
    Dataset updated
    Mar 25, 2025
    Authors
    ai-forever
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Public RAG bench dataset with texts

  9. h

    delucionqa

    • huggingface.co
    Updated Aug 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Corvic.ai (2025). delucionqa [Dataset]. https://huggingface.co/datasets/corvicai/delucionqa
    Explore at:
    Dataset updated
    Aug 18, 2025
    Dataset provided by
    Corvic.ai
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset Summary

    This dataset is a domain-specific benchmark for Question Answering, using the Jeep 2023 Gladiator Car manual as its knowledge base. It combines the corpus from the original DelucionQA project by Bosch Research with questions sourced from the RAGBench dataset. The result is a challenging dataset designed to evaluate a system's ability to answer specific, technical questions based on a complex, real-world document.

      Supported Tasks
    

    Question Answering:โ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/corvicai/delucionqa.

  10. h

    code-retrieval-stackoverflow-small

    • huggingface.co
    Updated Jul 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CodeRAG-Bench (2024). code-retrieval-stackoverflow-small [Dataset]. https://huggingface.co/datasets/code-rag-bench/code-retrieval-stackoverflow-small
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 6, 2024
    Dataset authored and provided by
    CodeRAG-Bench
    Description

    code-rag-bench/code-retrieval-stackoverflow-small dataset hosted on Hugging Face and contributed by the HF Datasets community

  11. REAL-MM-RAG_FinReport

    • huggingface.co
    Updated Mar 13, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    IBM Research (2025). REAL-MM-RAG_FinReport [Dataset]. https://huggingface.co/datasets/ibm-research/REAL-MM-RAG_FinReport
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 13, 2025
    Dataset provided by
    IBMhttp://ibm.com/
    IBM Research
    Authors
    IBM Research
    License

    https://choosealicense.com/licenses/cdla-permissive-2.0/https://choosealicense.com/licenses/cdla-permissive-2.0/

    Description

    REAL-MM-RAG-Bench: A Real-World Multi-Modal Retrieval Benchmark

    We introduced REAL-MM-RAG-Bench, a real-world multi-modal retrieval benchmark designed to evaluate retrieval models in reliable, challenging, and realistic settings. The benchmark was constructed using an automated pipeline, where queries were generated by a vision-language model (VLM), filtered by a large language model (LLM), and rephrased by an LLM to ensure high-quality retrieval evaluation. To simulate real-worldโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/ibm-research/REAL-MM-RAG_FinReport.

  12. h

    hist-rag-bench-public-texts

    • huggingface.co
    Updated Jul 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ai-forever (2025). hist-rag-bench-public-texts [Dataset]. https://huggingface.co/datasets/ai-forever/hist-rag-bench-public-texts
    Explore at:
    Dataset updated
    Jul 10, 2025
    Authors
    ai-forever
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    DRAGON bench history public texts. Date: 2025.07.10

  13. h

    test-rag-bench-private-qa

    • huggingface.co
    Updated Jul 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ai-forever (2025). test-rag-bench-private-qa [Dataset]. https://huggingface.co/datasets/ai-forever/test-rag-bench-private-qa
    Explore at:
    Dataset updated
    Jul 17, 2025
    Authors
    ai-forever
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    RAG bench private QA dataset. Test version

  14. h

    hist-rag-bench-private-qa

    • huggingface.co
    Updated Jul 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ai-forever (2025). hist-rag-bench-private-qa [Dataset]. https://huggingface.co/datasets/ai-forever/hist-rag-bench-private-qa
    Explore at:
    Dataset updated
    Jul 10, 2025
    Authors
    ai-forever
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    DRAGON bench history private QA dataset. Date: 2025.07.10

  15. h

    hist-rag-bench-public-questions

    • huggingface.co
    Updated Jul 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ai-forever (2025). hist-rag-bench-public-questions [Dataset]. https://huggingface.co/datasets/ai-forever/hist-rag-bench-public-questions
    Explore at:
    Dataset updated
    Jul 19, 2025
    Authors
    ai-forever
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    DRAGON bench history public questions. Date: 2025.07.10

  16. h

    hist-rag-bench-private-texts

    • huggingface.co
    Updated Jul 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ai-forever (2025). hist-rag-bench-private-texts [Dataset]. https://huggingface.co/datasets/ai-forever/hist-rag-bench-private-texts
    Explore at:
    Dataset updated
    Jul 10, 2025
    Authors
    ai-forever
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    DRAGON bench history private texts (mappings). Date: 2025.07.10

  17. ChatRAG-Bench

    • huggingface.co
    Updated May 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NVIDIA (2025). ChatRAG-Bench [Dataset]. https://huggingface.co/datasets/nvidia/ChatRAG-Bench
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 2, 2024
    Dataset provided by
    Nvidiahttp://nvidia.com/
    Authors
    NVIDIA
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    ChatRAG Bench

    ChatRAG Bench is a benchmark for evaluating a model's conversational QA capability over documents or retrieved context. ChatRAG Bench are built on and derived from 10 existing datasets: Doc2Dial, QuAC, QReCC, TopioCQA, INSCIT, CoQA, HybriDialogue, DoQA, SQA, ConvFinQA. ChatRAG Bench covers a wide range of documents and question types, which require models to generate responses from long context, comprehend and reason over tables, conduct arithmetic calculations, andโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/nvidia/ChatRAG-Bench.

  18. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Galileo (2025). ragbench [Dataset]. https://huggingface.co/datasets/galileo-ai/ragbench

ragbench

galileo-ai/ragbench

Explore at:
112 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Jun 8, 2024
Dataset authored and provided by
Galileo
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

RAGBench

  Dataset Overview

RAGBEnch is a large-scale RAG benchmark dataset of 100k RAG examples. It covers five unique industry-specific domains and various RAG task types. RAGBench examples are sourced from industry corpora such as user manuals, making it particularly relevant for industry applications. RAGBench comrises 12 sub-component datasets, each one split into train/validation/test splits

  Usage

from datasets import load_dataset

loadโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/galileo-ai/ragbench.

Search
Clear search
Close search
Google apps
Main menu