2 datasets found
  1. W

    Webis-Clickbait-16

    • webis.de
    • anthology.aicmu.ac.cn
    3251557
    Updated 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Martin Potthast; Benno Stein; Matthias Hagen; Sebastian Köpsel (2016). Webis-Clickbait-16 [Dataset]. http://doi.org/10.5281/zenodo.3251557
    Explore at:
    3251557Available download formats
    Dataset updated
    2016
    Dataset provided by
    The Web Technology & Information Systems Network
    Friedrich Schiller University Jena
    Bauhaus-Universität Weimar
    University of Kassel, hessian.AI, and ScaDS.AI
    Authors
    Martin Potthast; Benno Stein; Matthias Hagen; Sebastian Köpsel
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The Webis Clickbait Corpus 2016 (Webis-Clickbait-16) comprises 2992 Twitter tweets sampled from top 20 news publishers as per retweets in 2014. The tweets have been manually annotated by three independent annotators with regard to whether they can be considered clickbait. A total of 767 tweets are considered clickbait by the majority of annotators. The majority vote of reviewers can be used as a ground truth to build clickbait detection technology. This corpus is the first of its kind and gives rise to the development of technology to tackle clickbait.

  2. E

    Webis Clickbait Corpus 2016 (Webis-Clickbait-16)

    • live.european-language-grid.eu
    • data.niaid.nih.gov
    json
    Updated Apr 30, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Webis Clickbait Corpus 2016 (Webis-Clickbait-16) [Dataset]. https://live.european-language-grid.eu/catalogue/corpus/7534
    Explore at:
    jsonAvailable download formats
    Dataset updated
    Apr 30, 2024
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The Webis Clickbait Corpus 2016 (Webis-Clickbait-16) comprises 2992 Twitter tweets sampled from top 20 news publishers as per retweets in 2014. The tweets have been manually annotated by three independent annotators with regard to whether they can be considered clickbait. A total of 767 tweets are considered clickbait by the majority of annotators. The majority vote of reviewers can be used as a ground truth to build clickbait detection technology. This corpus is the first of its kind and gives rise to the development of technology to tackle clickbait.

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Martin Potthast; Benno Stein; Matthias Hagen; Sebastian Köpsel (2016). Webis-Clickbait-16 [Dataset]. http://doi.org/10.5281/zenodo.3251557

Webis-Clickbait-16

Explore at:
3251557Available download formats
Dataset updated
2016
Dataset provided by
The Web Technology & Information Systems Network
Friedrich Schiller University Jena
Bauhaus-Universität Weimar
University of Kassel, hessian.AI, and ScaDS.AI
Authors
Martin Potthast; Benno Stein; Matthias Hagen; Sebastian Köpsel
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The Webis Clickbait Corpus 2016 (Webis-Clickbait-16) comprises 2992 Twitter tweets sampled from top 20 news publishers as per retweets in 2014. The tweets have been manually annotated by three independent annotators with regard to whether they can be considered clickbait. A total of 767 tweets are considered clickbait by the majority of annotators. The majority vote of reviewers can be used as a ground truth to build clickbait detection technology. This corpus is the first of its kind and gives rise to the development of technology to tackle clickbait.

Search
Clear search
Close search
Google apps
Main menu