Saved datasets
Last updated
Download format
Usage rights
License from data provider
Please review the applicable license to make sure your contemplated use is permitted.
Cost to access
Described as free to access or have a license that allows redistribution.
2 datasets found
  1. BuzzFeed-Webis Fake News Corpus 16

    Updated Feb 20, 2018
  2. BuzzFeed-Webis Fake News Corpus 2016

    zip, txt, csv, xsd
    Updated Feb 20, 2018
  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Click to copy link
Link copied
Potthast, Martin; Kiesel, Johannes; Reinartz, Kevin; Bevendorff, Janek; Stein, Benno (2018) BuzzFeed-Webis Fake News Corpus 16. [Dataset]
Organization logoOrganization logo

BuzzFeed-Webis Fake News Corpus 16

1181813Available download formats
Dataset updated Feb 20, 2018
Dataset provided by
Leipzig University
Bauhaus-Universität Weimar
The Web Technology & Information Systems Network
Potthast, Martin; Kiesel, Johannes; Reinartz, Kevin; Bevendorff, Janek; Stein, Benno

Attribution 4.0 (CC BY 4.0)
License information was derived automatically


The BuzzFeed-Webis Fake News Corpus 16 comprises the output of 9 publishers in a week close to the US elections. Among the selected publishers are 6 prolific hyperpartisan ones (three left-wing and three right-wing), and three mainstream publishers (see Table 1). All publishers earned Facebook’s blue checkmark, indicating authenticity and an elevated status within the network. For seven weekdays (September 19 to 23 and September 26 and 27), every post and linked news article of the 9 publishers was fact-checked by professional journalists at BuzzFeed. In total, 1,627 articles were checked, 826 mainstream, 256 left-wing and 545 right-wing. The imbalance between categories results from differing publication frequencies.

Clear search
Close search
Google apps
Main menu