2 datasets found

ai2_arc
huggingface.co
tensorflow.org
+1more
Updated Jan 17, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ai2 (2024). ai2_arc [Dataset]. https://huggingface.co/datasets/allenai/ai2_arc
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 17, 2024
Dataset provided by
Allen Institute for AIhttp://allenai.org/
Authors
Ai2
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
Dataset Card for "ai2_arc"

Dataset Summary

A new dataset of 7,787 genuine grade-school level, multiple-choice science questions, assembled to encourage research in advanced question-answering. The dataset is partitioned into a Challenge Set and an Easy Set, where the former contains only questions answered incorrectly by both a retrieval-based algorithm and a word co-occurrence algorithm. We are also including a corpus of over 14 million science sentences relevant to… See the full description on the dataset page: https://huggingface.co/datasets/allenai/ai2_arc.
T
ai2_arc_with_ir
tensorflow.org
Updated Nov 29, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2023). ai2_arc_with_ir [Dataset]. https://www.tensorflow.org/datasets/catalog/ai2_arc_with_ir
Explore at:
Dataset updated
Nov 29, 2023
Description
A new dataset of 7,787 genuine grade-school level, multiple-choice science questions, assembled to encourage research in advanced question-answering. The dataset is partitioned into a Challenge Set and an Easy Set, where the former contains only questions answered incorrectly by both a retrieval-based algorithm and a word co-occurrence algorithm. We are also including a corpus of over 14 million science sentences relevant to the task, and an implementation of three neural baseline models for this dataset. We pose ARC as a challenge to the community.

Compared to the original dataset, this adds context sentences obtained through information retrieval in the same way as UnifiedQA (see: https://arxiv.org/abs/2005.00700 ).

To use this dataset:

import tensorflow_datasets as tfds ds = tfds.load('ai2_arc_with_ir', split='train') for ex in ds.take(4): print(ex)

See the guide for more informations on tensorflow_datasets.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Ai2 (2024). ai2_arc [Dataset]. https://huggingface.co/datasets/allenai/ai2_arc

ai2_arc

Ai2Arc

allenai/ai2_arc

Explore at:

93 scholarly articles cite this dataset (View in Google Scholar)

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Jan 17, 2024

Dataset provided by

Allen Institute for AIhttp://allenai.org/

Authors

Ai2

License

Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically

Description

Dataset Card for "ai2_arc"

  Dataset Summary

A new dataset of 7,787 genuine grade-school level, multiple-choice science questions, assembled to encourage research in advanced question-answering. The dataset is partitioned into a Challenge Set and an Easy Set, where the former contains only questions answered incorrectly by both a retrieval-based algorithm and a word co-occurrence algorithm. We are also including a corpus of over 14 million science sentences relevant to… See the full description on the dataset page: https://huggingface.co/datasets/allenai/ai2_arc.

Clear search

Close search

Google apps

Main menu

ai2_arc

ai2_arc_with_ir

ai2_arc

Ai2Arc

allenai/ai2_arc