1 dataset found
  1. h

    pan2020_dict_author_fandom_doc

    • huggingface.co
    Updated May 29, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Peter KIrby (2020). pan2020_dict_author_fandom_doc [Dataset]. https://huggingface.co/datasets/peterkirby/pan2020_dict_author_fandom_doc
    Explore at:
    Dataset updated
    May 29, 2020
    Authors
    Peter KIrby
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    PAN2020 Fanfiction Author-Fandom-Disjoint Train/Validation Split

    PAN 2020 / PAN 2021 fanfiction authorship verification data with Train/Validation split. The training data has been pre-split into Train and Validation under Author-Fandom-Disjoint constraints as is appropriate for PAN21 test data. The training data is one row per document to allow easy recombination. The PAN21 validation and test splits consist of fixed document pairs for consistent scoring. The string fields are the… See the full description on the dataset page: https://huggingface.co/datasets/peterkirby/pan2020_dict_author_fandom_doc.

  2. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Peter KIrby (2020). pan2020_dict_author_fandom_doc [Dataset]. https://huggingface.co/datasets/peterkirby/pan2020_dict_author_fandom_doc

pan2020_dict_author_fandom_doc

peterkirby/pan2020_dict_author_fandom_doc

PAN2020 Fanfiction Author-Fandom-Disjoint Train/Validation Split

Explore at:
Dataset updated
May 29, 2020
Authors
Peter KIrby
License

https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

Description

PAN2020 Fanfiction Author-Fandom-Disjoint Train/Validation Split

PAN 2020 / PAN 2021 fanfiction authorship verification data with Train/Validation split. The training data has been pre-split into Train and Validation under Author-Fandom-Disjoint constraints as is appropriate for PAN21 test data. The training data is one row per document to allow easy recombination. The PAN21 validation and test splits consist of fixed document pairs for consistent scoring. The string fields are the… See the full description on the dataset page: https://huggingface.co/datasets/peterkirby/pan2020_dict_author_fandom_doc.

Search
Clear search
Close search
Google apps
Main menu