Search
Clear search
Close search
Main menu
Google apps
3 datasets found
  1. Z

    PAN12 Originality: Source Retrieval

    • data.niaid.nih.gov
    • zenodo.org
    Updated Jun 11, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stein, Benno (2022). PAN12 Originality: Source Retrieval [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_3713287
    Explore at:
    Dataset updated
    Jun 11, 2022
    Dataset provided by
    Stein, Benno
    Oberländer, Arnd
    Kiesel, Johannes
    Tippmann, Martin
    Hagen, Matthias
    Potthast, Martin
    Gupta, Parth
    Gollub, Tim
    Rosso, Paolo
    Barrón-Cedeño, Alberto
    Michel, Maximilian
    Graßegger, Jan
    Description

    We provide you with a training corpus that consists of suspicious documents. Each suspicious document is about a specific topic and may consist of plagiarized passages obtained from web pages on that topic found in the ClueWeb09 corpus.

  2. PAN14 Originality: Source Retrieval

    • zenodo.org
    Updated Apr 2, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Martin Potthast; Martin Potthast; Matthias Hagen; Matthias Hagen; Anne Beyer; Matthias Busse; Martin Tippmann; Paolo Rosso; Benno Stein; Benno Stein; Anne Beyer; Matthias Busse; Martin Tippmann; Paolo Rosso (2020). PAN14 Originality: Source Retrieval [Dataset]. http://doi.org/10.5281/zenodo.3716010
    Explore at:
    Dataset updated
    Apr 2, 2020
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Martin Potthast; Martin Potthast; Matthias Hagen; Matthias Hagen; Anne Beyer; Matthias Busse; Martin Tippmann; Paolo Rosso; Benno Stein; Benno Stein; Anne Beyer; Matthias Busse; Martin Tippmann; Paolo Rosso
    Description

    We provide you with a training corpus that consists of suspicious documents. Each suspicious document is about a specific topic and may consist of plagiarized passages obtained from web pages on that topic found in the ClueWeb09 corpus.

  3. PAN13 Originality: Source Retrieval

    • zenodo.org
    Updated Apr 21, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Martin Potthast; Martin Potthast; Tim Gollub; Tim Gollub; Matthias Hagen; Matthias Hagen; Martin Tippmann; Johannes Kiesel; Johannes Kiesel; Paolo Rosso; Efstathios Stamatatos; Benno Stein; Benno Stein; Martin Tippmann; Paolo Rosso; Efstathios Stamatatos (2020). PAN13 Originality: Source Retrieval [Dataset]. http://doi.org/10.5281/zenodo.3715962
    Explore at:
    Dataset updated
    Apr 21, 2020
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Martin Potthast; Martin Potthast; Tim Gollub; Tim Gollub; Matthias Hagen; Matthias Hagen; Martin Tippmann; Johannes Kiesel; Johannes Kiesel; Paolo Rosso; Efstathios Stamatatos; Benno Stein; Benno Stein; Martin Tippmann; Paolo Rosso; Efstathios Stamatatos
    Description

    We provide you with a training corpus that consists of suspicious documents. Each suspicious document is about a specific topic and may consist of plagiarized passages obtained from web pages on that topic found in the ClueWeb09 corpus.

  4. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Stein, Benno (2022). PAN12 Originality: Source Retrieval [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_3713287

PAN12 Originality: Source Retrieval

Explore at:
Dataset updated
Jun 11, 2022
Dataset provided by
Stein, Benno
Oberländer, Arnd
Kiesel, Johannes
Tippmann, Martin
Hagen, Matthias
Potthast, Martin
Gupta, Parth
Gollub, Tim
Rosso, Paolo
Barrón-Cedeño, Alberto
Michel, Maximilian
Graßegger, Jan
Description

We provide you with a training corpus that consists of suspicious documents. Each suspicious document is about a specific topic and may consist of plagiarized passages obtained from web pages on that topic found in the ClueWeb09 corpus.