Saved datasets
Last updated
Download format
Croissant
Croissant is a format for Machine Learning datasets
Learn more about this at mlcommons.org/croissant.
Usage rights
License from data provider
Please review the applicable license to make sure your contemplated use is permitted.
Topic
Provider
Free
Cost to access
Described as free to access or have a license that allows redistribution.
100+ datasets found
  1. W

    Webis-Clickbait-22

    • webis.de
    Updated 2022
    + more versions
  2. W

    Webis-Dataset-Reviews-21

    • webis.de
    4491927
    Updated 2021
  3. W

    Webis-QInC-22

    • webis.de
    5820673
    Updated 2022
    + more versions
  4. W

    Webis-SameSide-19

    • webis.de
    • zenodo.org
    4382353
    Updated 2020
  5. W

    Webis-Ambient-15

    • webis.de
    • live.european-language-grid.eu
    • +1more
    3250669
    Updated 2015
    + more versions
  6. W

    Webis-PC-08

    • webis.de
    3254618
    Updated 2008
  7. W

    webis-comparative-web-search-questions-20

    • webis.de
    Updated 2020
  8. W

    Webis-QSpell-17

    • webis.de
    3256201
    Updated 2017
    + more versions
  9. W

    Webis-Context-sensitive-Word-Search-Queries-2022

    • webis.de
    6425595
    Updated 2022
  10. W

    Webis-TLDR-17

    • webis.de
    1043504
    Updated 2017
  11. W

    Webis-CLS-10

    • webis.de
    3251672
    Updated 2010
  12. W

    Webis-Clickbait-17

    • webis.de
    5530410
    Updated 2017
    + more versions
  13. W

    Webis-Bias-Flipper-18

    • webis.de
    • data.niaid.nih.gov
    • +1more
    3250686
    Updated 2018
  14. W

    Webis-Mnemonics-17

    • webis.de
    • live.european-language-grid.eu
    • +1more
    3254443
    Updated 2017
  15. W

    Webis-WebSeg-20-Algorithm-Segmentations

    • webis.de
    • explore.openaire.eu
    • +2more
    4146889
    Updated 2021
    + more versions
  16. W

    Webis-WVC-07

    • webis.de
    3341473
    Updated 2007
  17. W

    Webis-Trigger-Warning-Corpus-22

    • webis.de
    7976807
    Updated 2023
  18. W

    CauseNet-20

    • webis.de
    3876154
    Updated 2020
  19. W

    Webis-Snippet-20

    • webis.de
    3653834
    Updated 2020
    + more versions
  20. W

    Webis-WebSeg-20

    • webis.de
    • zenodo.org
    3354902
    Updated 2020
Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Matthias Hagen; Maik Fröbe; Artur Jurk; Martin Potthast (2022). Webis-Clickbait-22 [Dataset]. https://webis.de/data/webis-clickbait-22.html

Webis-Clickbait-22

Explore at:
3 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
2022
Dataset provided by
The Web Technology & Information Systems Network
University of Kassel, hessian.AI, and ScaDS.AI
Friedrich Schiller University Jena
Authors
Matthias Hagen; Maik Fröbe; Artur Jurk; Martin Potthast
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The Webis Clickbait Spoiling Corpus 2022 (Webis-Clickbait-22) contains 5,000 spoiled clickbait posts crawled from Facebook, Reddit, and Twitter. This corpus supports the task of clickbait spoiling, which deals with generating a short text that satisfies the curiosity induced by a clickbait post.

Search
Clear search
Close search
Google apps
Main menu