Dataset Card for nfcorpus/train
The nfcorpus/train dataset, provided by the ir-datasets package. For more information about the dataset, see the documentation.
Data
This dataset provides:
queries (i.e., topics); count=2,594
qrels: (relevance assessments); count=139,350
For docs, use irds/nfcorpus
Usage
from datasets import load_dataset
queries = load_dataset('irds/nfcorpus_train', 'queries') for record in queries: record # {'query_id': ..., 'title':… See the full description on the dataset page: https://huggingface.co/datasets/irds/nfcorpus_train.
Dataset Card for nfcorpus/train/video
The nfcorpus/train/video dataset, provided by the ir-datasets package. For more information about the dataset, see the documentation.
Data
This dataset provides:
queries (i.e., topics); count=812
qrels: (relevance assessments); count=27,465
For docs, use irds/nfcorpus
Usage
from datasets import load_dataset
queries = load_dataset('irds/nfcorpus_train_video', 'queries') for record in queries: record # {'query_id':… See the full description on the dataset page: https://huggingface.co/datasets/irds/nfcorpus_train_video.
Dataset Card for nfcorpus/train/nontopic
The nfcorpus/train/nontopic dataset, provided by the ir-datasets package. For more information about the dataset, see the documentation.
Data
This dataset provides:
queries (i.e., topics); count=1,141
qrels: (relevance assessments); count=37,383
For docs, use irds/nfcorpus
Usage
from datasets import load_dataset
queries = load_dataset('irds/nfcorpus_train_nontopic', 'queries') for record in queries: record #… See the full description on the dataset page: https://huggingface.co/datasets/irds/nfcorpus_train_nontopic.
Dataset Card for nfcorpus
The nfcorpus dataset, provided by the ir-datasets package. For more information about the dataset, see the documentation.
Data
This dataset provides:
docs (documents, i.e., the corpus); count=5,371
This dataset is used by: nfcorpus_dev, nfcorpus_dev_nontopic, nfcorpus_dev_video, nfcorpus_test, nfcorpus_test_nontopic, nfcorpus_test_video, nfcorpus_train, nfcorpus_train_nontopic, nfcorpus_train_video
Usage
from datasets import… See the full description on the dataset page: https://huggingface.co/datasets/irds/nfcorpus.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Dataset Card for nfcorpus/train
The nfcorpus/train dataset, provided by the ir-datasets package. For more information about the dataset, see the documentation.
Data
This dataset provides:
queries (i.e., topics); count=2,594
qrels: (relevance assessments); count=139,350
For docs, use irds/nfcorpus
Usage
from datasets import load_dataset
queries = load_dataset('irds/nfcorpus_train', 'queries') for record in queries: record # {'query_id': ..., 'title':… See the full description on the dataset page: https://huggingface.co/datasets/irds/nfcorpus_train.