Search
Clear search
Close search
Main menu
Google apps
2 datasets found
  1. W

    PAN-PC-11

    • webis.de
    • anthology.aicmu.ac.cn
    3250095
    Updated 2011
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Martin Potthast; Benno Stein; Paolo Rosso (2011). PAN-PC-11 [Dataset]. http://doi.org/10.5281/zenodo.3250095
    Explore at:
    3250095Available download formats
    Dataset updated
    2011
    Dataset provided by
    University of Kassel, hessian.AI, and ScaDS.AI
    Bauhaus-Universität Weimar
    The Web Technology & Information Systems Network
    Authors
    Martin Potthast; Benno Stein; Paolo Rosso
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The PAN plagiarism corpus 2011 (PAN-PC-11) is a corpus for the evaluation of automatic plagiarism detection algorithms. For research purposes the corpus can be used free of charge.

  2. E

    PAN Plagiarism Corpus 2011 (PAN-PC-11)

    • live.european-language-grid.eu
    • data.niaid.nih.gov
    • +1more
    txt
    Updated Apr 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). PAN Plagiarism Corpus 2011 (PAN-PC-11) [Dataset]. https://live.european-language-grid.eu/catalogue/corpus/7529
    Explore at:
    txtAvailable download formats
    Dataset updated
    Apr 26, 2024
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The PAN plagiarism corpus 2011 (PAN-PC-11) is a corpus for the evaluation of automatic plagiarism detection algorithms. For research purposes the corpus can be used free of charge.The PAN-PC-11 contains documents in which plagiarism has been inserted automatically as well as documents in which plagiarism has been inserted manually. The former have been constructed using a so-called random plagiarist, a computer program which constructs plagiarism according to a number of parameters, while the latter have been obtained with crowdsourcing via Amazon's Mechanical Turk.

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Martin Potthast; Benno Stein; Paolo Rosso (2011). PAN-PC-11 [Dataset]. http://doi.org/10.5281/zenodo.3250095

PAN-PC-11

Explore at:
97 scholarly articles cite this dataset (View in Google Scholar)
3250095Available download formats
Dataset updated
2011
Dataset provided by
University of Kassel, hessian.AI, and ScaDS.AI
Bauhaus-Universität Weimar
The Web Technology & Information Systems Network
Authors
Martin Potthast; Benno Stein; Paolo Rosso
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The PAN plagiarism corpus 2011 (PAN-PC-11) is a corpus for the evaluation of automatic plagiarism detection algorithms. For research purposes the corpus can be used free of charge.