Saved datasets
1 dataset found
  1. PAN12 Originality: Source Retrieval

    • zenodo.org
    • data.niaid.nih.gov
    Updated Jun 11, 2022
    + more versions
  2. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Martin Potthast; Martin Potthast; Tim Gollub; Tim Gollub; Matthias Hagen; Matthias Hagen; Jan Graßegger; Johannes Kiesel; Johannes Kiesel; Maximilian Michel; Arnd Oberländer; Martin Tippmann; Alberto Barrón-Cedeño; Parth Gupta; Paolo Rosso; Benno Stein; Benno Stein; Jan Graßegger; Maximilian Michel; Arnd Oberländer; Martin Tippmann; Alberto Barrón-Cedeño; Parth Gupta; Paolo Rosso (2022). PAN12 Originality: Source Retrieval [Dataset]. http://doi.org/10.5281/zenodo.3713288
Organization logo

PAN12 Originality: Source Retrieval

Explore at:
Dataset updated
Jun 11, 2022
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Martin Potthast; Martin Potthast; Tim Gollub; Tim Gollub; Matthias Hagen; Matthias Hagen; Jan Graßegger; Johannes Kiesel; Johannes Kiesel; Maximilian Michel; Arnd Oberländer; Martin Tippmann; Alberto Barrón-Cedeño; Parth Gupta; Paolo Rosso; Benno Stein; Benno Stein; Jan Graßegger; Maximilian Michel; Arnd Oberländer; Martin Tippmann; Alberto Barrón-Cedeño; Parth Gupta; Paolo Rosso
Description

We provide you with a training corpus that consists of suspicious documents. Each suspicious document is about a specific topic and may consist of plagiarized passages obtained from web pages on that topic found in the ClueWeb09 corpus.

Search
Clear search
Close search
Google apps
Main menu