2 datasets found
  1. W

    PAN-PC-11

    • webis.de
    3250095
    Updated 2011
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Martin Potthast; Benno Stein; Paolo Rosso (2011). PAN-PC-11 [Dataset]. http://doi.org/10.5281/zenodo.3250095
    Explore at:
    3250095Available download formats
    Dataset updated
    2011
    Dataset provided by
    The Web Technology & Information Systems Network
    University of Kassel, hessian.AI, and ScaDS.AI
    Bauhaus-Universität Weimar
    Authors
    Martin Potthast; Benno Stein; Paolo Rosso
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The PAN plagiarism corpus 2011 (PAN-PC-11) is a corpus for the evaluation of automatic plagiarism detection algorithms. For research purposes the corpus can be used free of charge.

  2. o

    PAN Plagiarism Corpus 2011 (PAN-PC-11)

    • explore.openaire.eu
    • live.european-language-grid.eu
    • +1more
    Updated Jun 1, 2011
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Martin Potthast; Benno Stein; Andreas Eiselt; Alberto Barrón-Cedeño; Paolo Rosso (2011). PAN Plagiarism Corpus 2011 (PAN-PC-11) [Dataset]. http://doi.org/10.5281/zenodo.3250094
    Explore at:
    Dataset updated
    Jun 1, 2011
    Authors
    Martin Potthast; Benno Stein; Andreas Eiselt; Alberto Barrón-Cedeño; Paolo Rosso
    Description

    The PAN plagiarism corpus 2011 (PAN-PC-11) is a corpus for the evaluation of automatic plagiarism detection algorithms. For research purposes the corpus can be used free of charge. The PAN-PC-11 contains documents in which plagiarism has been inserted automatically as well as documents in which plagiarism has been inserted manually. The former have been constructed using a so-called random plagiarist, a computer program which constructs plagiarism according to a number of parameters, while the latter have been obtained with crowdsourcing via Amazon's Mechanical Turk. {"references": ["Benno Stein, Martin Potthast, Alberto Barr\u00f3n-Cede\u00f1o, Paolo Rosso, Efstathios Stamatatos, and Moshe Koppel. 4th International Workshop on Uncovering Plagiarism, Authorship, and Social Software Misuse (PAN 2010). SIGIR Forum, 45 (1) : 45-48, June 2011."]}

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Martin Potthast; Benno Stein; Paolo Rosso (2011). PAN-PC-11 [Dataset]. http://doi.org/10.5281/zenodo.3250095

PAN-PC-11

Explore at:
108 scholarly articles cite this dataset (View in Google Scholar)
3250095Available download formats
Dataset updated
2011
Dataset provided by
The Web Technology & Information Systems Network
University of Kassel, hessian.AI, and ScaDS.AI
Bauhaus-Universität Weimar
Authors
Martin Potthast; Benno Stein; Paolo Rosso
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The PAN plagiarism corpus 2011 (PAN-PC-11) is a corpus for the evaluation of automatic plagiarism detection algorithms. For research purposes the corpus can be used free of charge.

Search
Clear search
Close search
Google apps
Main menu