MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
ClueWeb-Reco:
Source Files
-- cwid_to_id.tsv: mapping bewteen official ClueWeb22 docids and our internal docids
Splits in pure interaction format
interaction_splits: valid_inter_input.tsv: input for validation dataset valid_inter_target.tsv: validation dataset ground truth test_inter_input.tsv: input for testing dataset (ground truth hidden)
Splits in ordered cw id list format
ordered_id_splits: valid_input.tsv: input for validation dataset… See the full description on the dataset page: https://huggingface.co/datasets/cx-cmu/ClueWeb-Reco.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
ClueWeb-Reco:
Source Files
-- cwid_to_id.tsv: mapping bewteen official ClueWeb22 docids and our internal docids
Splits in pure interaction format
interaction_splits: valid_inter_input.tsv: input for validation dataset valid_inter_target.tsv: validation dataset ground truth test_inter_input.tsv: input for testing dataset (ground truth hidden)
Splits in ordered cw id list format
ordered_id_splits: valid_input.tsv: input for validation dataset… See the full description on the dataset page: https://huggingface.co/datasets/cx-cmu/ClueWeb-Reco.