2 datasets found
  1. discoverybench

    • huggingface.co
    Updated Jun 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ai2 (2024). discoverybench [Dataset]. https://huggingface.co/datasets/allenai/discoverybench
    Explore at:
    Dataset updated
    Jun 13, 2024
    Dataset provided by
    Allen Institute for AIhttp://allenai.org/
    Authors
    Ai2
    License

    https://choosealicense.com/licenses/odc-by/https://choosealicense.com/licenses/odc-by/

    Description

    Data-driven Discovery Benchmark from the paper: "DiscoveryBench: Towards Data-Driven Discovery with Large Language Models"

      🔭 Overview
    

    DiscoveryBench is designed to systematically assess current model capabilities in data-driven discovery tasks and provide a useful resource for improving them. Each DiscoveryBench task consists of a goal and dataset(s). Solving the task requires both statistical analysis and semantic reasoning. A faceted evaluation allows open-ended… See the full description on the dataset page: https://huggingface.co/datasets/allenai/discoverybench.

  2. h

    discoverybench

    • huggingface.co
    Updated May 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Niklas Hoepner (2025). discoverybench [Dataset]. https://huggingface.co/datasets/nhop/discoverybench
    Explore at:
    Dataset updated
    May 1, 2025
    Authors
    Niklas Hoepner
    License

    https://choosealicense.com/licenses/odc-by/https://choosealicense.com/licenses/odc-by/

    Description

    DiscoveryBench - Alias

    A reformatted version of the original DiscoveryBench dataset for easier usage.

    🤗 Original Dataset on HF
    💻 GitHub Repository
    📄 Paper (arXiv)

      📁 Dataset Structure
    

    The dataset consists of real and synthetic subsets: Real Splits:

    real_train real_test

    Synthetic Splits:

    synth_train synth_dev synth_test

    Each split contains a list of tasks with references to associated CSV datasets needed to answer the query. LLMs are expected to use the… See the full description on the dataset page: https://huggingface.co/datasets/nhop/discoverybench.

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Ai2 (2024). discoverybench [Dataset]. https://huggingface.co/datasets/allenai/discoverybench
Organization logo

discoverybench

allenai/discoverybench

Explore at:
29 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Jun 13, 2024
Dataset provided by
Allen Institute for AIhttp://allenai.org/
Authors
Ai2
License

https://choosealicense.com/licenses/odc-by/https://choosealicense.com/licenses/odc-by/

Description

Data-driven Discovery Benchmark from the paper: "DiscoveryBench: Towards Data-Driven Discovery with Large Language Models"

  🔭 Overview

DiscoveryBench is designed to systematically assess current model capabilities in data-driven discovery tasks and provide a useful resource for improving them. Each DiscoveryBench task consists of a goal and dataset(s). Solving the task requires both statistical analysis and semantic reasoning. A faceted evaluation allows open-ended… See the full description on the dataset page: https://huggingface.co/datasets/allenai/discoverybench.

Search
Clear search
Close search
Google apps
Main menu