Saved datasets
1 dataset found
  1. PAN13 Originality: Source Retrieval

    • zenodo.org
    Updated Sep 23, 2013
  2. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Potthast, Martin; Gollub, Tim; Hagen, Matthias; Tippmann, Martin; Kiesel, Johannes; Rosso, Paolo; Stamatatos, Efstathios; Stein, Benno (2013). PAN13 Originality: Source Retrieval [Dataset]. http://doi.org/10.5281/zenodo.3715962
Organization logoOrganization logo

PAN13 Originality: Source Retrieval

Dataset updated Sep 23, 2013
Dataset provided by
Bauhaus-Universität Weimarhttps://www.uni-weimar.de/
Martin-Luther-University Halle-Wittenberghttp://www.uni-halle.de/
Universität Leipzig
Authors
Potthast, Martin; Gollub, Tim; Hagen, Matthias; Tippmann, Martin; Kiesel, Johannes; Rosso, Paolo; Stamatatos, Efstathios; Stein, Benno
Description

We provide you with a training corpus that consists of suspicious documents. Each suspicious document is about a specific topic and may consist of plagiarized passages obtained from web pages on that topic found in the ClueWeb09 corpus.

Search
Clear search
Close search
Google apps
Main menu