2 datasets found

discoverybench
huggingface.co
Updated Jun 13, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ai2 (2024). discoverybench [Dataset]. https://huggingface.co/datasets/allenai/discoverybench
Explore at:
Dataset updated
Jun 13, 2024
Dataset provided by
Allen Institute for AIhttp://allenai.org/
Authors
Ai2
License
https://choosealicense.com/licenses/odc-by/https://choosealicense.com/licenses/odc-by/
Description
Data-driven Discovery Benchmark from the paper: "DiscoveryBench: Towards Data-Driven Discovery with Large Language Models"

🔭 Overview

DiscoveryBench is designed to systematically assess current model capabilities in data-driven discovery tasks and provide a useful resource for improving them. Each DiscoveryBench task consists of a goal and dataset(s). Solving the task requires both statistical analysis and semantic reasoning. A faceted evaluation allows open-ended… See the full description on the dataset page: https://huggingface.co/datasets/allenai/discoverybench.
h
discoverybench
huggingface.co
Updated May 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Niklas Hoepner (2025). discoverybench [Dataset]. https://huggingface.co/datasets/nhop/discoverybench
Explore at:
Dataset updated
May 1, 2025
Authors
Niklas Hoepner
License
https://choosealicense.com/licenses/odc-by/https://choosealicense.com/licenses/odc-by/
Description
DiscoveryBench - Alias

A reformatted version of the original DiscoveryBench dataset for easier usage.

🤗 Original Dataset on HF
💻 GitHub Repository
📄 Paper (arXiv)

📁 Dataset Structure

The dataset consists of real and synthetic subsets: Real Splits:

real_train real_test

Synthetic Splits:

synth_train synth_dev synth_test

Each split contains a list of tasks with references to associated CSV datasets needed to answer the query. LLMs are expected to use the… See the full description on the dataset page: https://huggingface.co/datasets/nhop/discoverybench.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Ai2 (2024). discoverybench [Dataset]. https://huggingface.co/datasets/allenai/discoverybench

discoverybench

allenai/discoverybench

Explore at:

29 scholarly articles cite this dataset (View in Google Scholar)

Dataset updated

Jun 13, 2024

Dataset provided by

Allen Institute for AIhttp://allenai.org/

Authors

Ai2

License

https://choosealicense.com/licenses/odc-by/https://choosealicense.com/licenses/odc-by/

Description

Data-driven Discovery Benchmark from the paper: "DiscoveryBench: Towards Data-Driven Discovery with Large Language Models"

  🔭 Overview

DiscoveryBench is designed to systematically assess current model capabilities in data-driven discovery tasks and provide a useful resource for improving them. Each DiscoveryBench task consists of a goal and dataset(s). Solving the task requires both statistical analysis and semantic reasoning. A faceted evaluation allows open-ended… See the full description on the dataset page: https://huggingface.co/datasets/allenai/discoverybench.

Clear search

Close search

Google apps

Main menu