1 dataset found
  1. Z

    PAN14 Author Identification: Verification

    • data.niaid.nih.gov
    Updated Nov 13, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Barrón-Cedeño, Alberto (2023). PAN14 Author Identification: Verification [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_3716032
    Explore at:
    Dataset updated
    Nov 13, 2023
    Dataset provided by
    Stein, Benno
    Juola, Patrick
    A. Sanchez-Perez, Miguel
    Potthast, Martin
    Verhoeven, Ben
    Stamatatos, Efstathios
    Barrón-Cedeño, Alberto
    Daelemans, Walter
    Description

    We provide you with a training corpus that comprises a set of author verification problems in several languages/genres. Each problem consists of some (up to five) known documents by a single person and exactly one questioned document. All documents within a single problem instance will be in the same language and best efforts are applied to assure that within-problem documents are matched for genre, register, theme, and date of writing. The document lengths vary from a few hundred to a few thousand words.

    More information: Link

  2. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Barrón-Cedeño, Alberto (2023). PAN14 Author Identification: Verification [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_3716032

PAN14 Author Identification: Verification

Explore at:
Dataset updated
Nov 13, 2023
Dataset provided by
Stein, Benno
Juola, Patrick
A. Sanchez-Perez, Miguel
Potthast, Martin
Verhoeven, Ben
Stamatatos, Efstathios
Barrón-Cedeño, Alberto
Daelemans, Walter
Description

We provide you with a training corpus that comprises a set of author verification problems in several languages/genres. Each problem consists of some (up to five) known documents by a single person and exactly one questioned document. All documents within a single problem instance will be in the same language and best efforts are applied to assure that within-problem documents are matched for genre, register, theme, and date of writing. The document lengths vary from a few hundred to a few thousand words.

More information: Link

Search
Clear search
Close search
Google apps
Main menu