Saved datasets
Last updated
Download format
Usage rights
License from data provider
Please review the applicable license to make sure your contemplated use is permitted.
Topic
Free
Cost to access
Described as free to access or have a license that allows redistribution.
9 datasets found
  1. PAN Plagiarism Corpus 2011 (PAN-PC-11)

    • zenodo.org
    rar
    Updated Jun 1, 2011
  2. PAN-PC-11

    • webis.de
    Updated 2011
  3. PAN Plagiarism Corpus 2010 (PAN-PC-10)

    • zenodo.org
    • explore.openaire.eu
    • +1more
    rar
    Updated May 1, 2010
  4. PAN-PC-10

    • webis.de
    Updated 2010
  5. PAN Plagiarism Corpus 2009 (PAN-PC-09)

    • zenodo.org
    • search.datacite.org
    rar
    Updated Sep 10, 2009
  6. PAN-PC-09

    • webis.de
    3250083
    Updated 2009
  7. o

    Webis Plagiarism Corpus 2008 (Webis-PC-08)

    • explore.openaire.eu
    • zenodo.org
    Updated Jan 1, 2008
  8. Detecting Cross-Language Plagiarism using Open Knowledge Graphs

    • zenodo.org
    zip
    Updated Jan 21, 2021
  9. Webis-PC-08

    • webis.de
    3254618
    Updated 2008
  10. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Potthast, Martin; Stein, Benno; Eiselt, Andreas; Barrón-Cedeño, Alberto; Rosso, Paolo (2011). PAN Plagiarism Corpus 2011 (PAN-PC-11) [Dataset]. http://doi.org/10.5281/zenodo.3250095
Organization logo

PAN Plagiarism Corpus 2011 (PAN-PC-11)

3 scholarly articles cite this dataset (View in Google Scholar)
rarAvailable download formats
Dataset updated
Jun 1, 2011
Dataset provided by
Bauhaus-Universität Weimarhttp://www.uni-weimar.de/
Universidad Polytécnica de Valencia
Authors
Potthast, Martin; Stein, Benno; Eiselt, Andreas; Barrón-Cedeño, Alberto; Rosso, Paolo
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The PAN plagiarism corpus 2011 (PAN-PC-11) is a corpus for the evaluation of automatic plagiarism detection algorithms. For research purposes the corpus can be used free of charge.

The PAN-PC-11 contains documents in which plagiarism has been inserted automatically as well as documents in which plagiarism has been inserted manually. The former have been constructed using a so-called random plagiarist, a computer program which constructs plagiarism according to a number of parameters, while the latter have been obtained with crowdsourcing via Amazon's Mechanical Turk.

Search
Clear search
Close search
Google apps
Main menu