2 datasets found
  1. h

    OmniEval-AutoGen-Dataset

    • huggingface.co
    Updated Jan 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OmniEval-AutoGen-Dataset [Dataset]. https://huggingface.co/datasets/RUC-NLPIR/OmniEval-AutoGen-Dataset
    Explore at:
    Dataset updated
    Jan 2, 2025
    Dataset authored and provided by
    NLPIR Lab @ RUC
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Dataset Information

    We introduce an omnidirectional and automatic RAG benchmark, OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain, in the financial domain. Our benchmark is characterized by its multi-dimensional evaluation framework, including:

    a matrix-based RAG scenario evaluation system that categorizes queries into five task classes and 16 financial topics, leading to a structured assessment of diverse query scenarios; a… See the full description on the dataset page: https://huggingface.co/datasets/RUC-NLPIR/OmniEval-AutoGen-Dataset.

  2. h

    OmniEval-KnowledgeCorpus

    • huggingface.co
    Updated Jan 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NLPIR Lab @ RUC (2025). OmniEval-KnowledgeCorpus [Dataset]. https://huggingface.co/datasets/RUC-NLPIR/OmniEval-KnowledgeCorpus
    Explore at:
    Dataset updated
    Jan 2, 2025
    Dataset authored and provided by
    NLPIR Lab @ RUC
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Dataset Information

    We introduce an omnidirectional and automatic RAG benchmark, OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain, in the financial domain. Our benchmark is characterized by its multi-dimensional evaluation framework, including:

    a matrix-based RAG scenario evaluation system that categorizes queries into five task classes and 16 financial topics, leading to a structured assessment of diverse query scenarios; a… See the full description on the dataset page: https://huggingface.co/datasets/RUC-NLPIR/OmniEval-KnowledgeCorpus.

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
OmniEval-AutoGen-Dataset [Dataset]. https://huggingface.co/datasets/RUC-NLPIR/OmniEval-AutoGen-Dataset

OmniEval-AutoGen-Dataset

RUC-NLPIR/OmniEval-AutoGen-Dataset

Explore at:
Dataset updated
Jan 2, 2025
Dataset authored and provided by
NLPIR Lab @ RUC
License

Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically

Description

Dataset Information

We introduce an omnidirectional and automatic RAG benchmark, OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain, in the financial domain. Our benchmark is characterized by its multi-dimensional evaluation framework, including:

a matrix-based RAG scenario evaluation system that categorizes queries into five task classes and 16 financial topics, leading to a structured assessment of diverse query scenarios; a… See the full description on the dataset page: https://huggingface.co/datasets/RUC-NLPIR/OmniEval-AutoGen-Dataset.

Search
Clear search
Close search
Google apps
Main menu