Saved datasets
Last updated
Download format
Usage rights
License from data provider
Please review the applicable license to make sure your contemplated use is permitted.
Topic
Provider
Free
Cost to access
Described as free to access or have a license that allows redistribution.
8 datasets found
  1. PAN Plagiarism Corpus 2011 (PAN-PC-11)

    • zenodo.org
    • live.european-language-grid.eu
    • +1more
    bin
    Updated Jun 11, 2022
    + more versions
  2. Z

    PAN Plagiarism Corpus 2010 (PAN-PC-10)

    • data.niaid.nih.gov
    • zenodo.org
    Updated Jan 24, 2020
    + more versions
  3. W

    PAN-PC-09

    • webis.de
    • anthology.aicmu.ac.cn
    3250083
    Updated 2009
  4. W

    PAN-PC-10

    • webis.de
    • anthology.aicmu.ac.cn
    3250123
    Updated 2010
  5. Z

    Webis Plagiarism Corpus 2008 (Webis-PC-08)

    • data.niaid.nih.gov
    Updated Jun 11, 2022
  6. E

    Data from: Detecting Cross-Language Plagiarism using Open Knowledge Graphs

    • live.european-language-grid.eu
    • explore.openaire.eu
    • +2more
    txt
    Updated Apr 12, 2024
  7. W

    Webis-PC-08

    • anthology.aicmu.ac.cn
    • webis.de
    3254618
    Updated 2008
  8. W

    Webis-CPC-11

    • anthology.aicmu.ac.cn
    • webis.de
    3251771
    Updated 2011
    + more versions
  9. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Martin Potthast; Martin Potthast; Benno Stein; Benno Stein; Andreas Eiselt; Alberto Barrón-Cedeño; Paolo Rosso; Andreas Eiselt; Alberto Barrón-Cedeño; Paolo Rosso (2022). PAN Plagiarism Corpus 2011 (PAN-PC-11) [Dataset]. http://doi.org/10.5281/zenodo.3250095
Organization logo

PAN Plagiarism Corpus 2011 (PAN-PC-11)

Explore at:
8 scholarly articles cite this dataset (View in Google Scholar)
binAvailable download formats
Dataset updated
Jun 11, 2022
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Martin Potthast; Martin Potthast; Benno Stein; Benno Stein; Andreas Eiselt; Alberto Barrón-Cedeño; Paolo Rosso; Andreas Eiselt; Alberto Barrón-Cedeño; Paolo Rosso
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The PAN plagiarism corpus 2011 (PAN-PC-11) is a corpus for the evaluation of automatic plagiarism detection algorithms. For research purposes the corpus can be used free of charge.

The PAN-PC-11 contains documents in which plagiarism has been inserted automatically as well as documents in which plagiarism has been inserted manually. The former have been constructed using a so-called random plagiarist, a computer program which constructs plagiarism according to a number of parameters, while the latter have been obtained with crowdsourcing via Amazon's Mechanical Turk.

Search
Clear search
Close search
Google apps
Main menu