Saved datasets
Last updated
Download format
Croissant
Croissant is a format for Machine Learning datasets
Learn more about this at mlcommons.org/croissant.
Usage rights
License from data provider
Please review the applicable license to make sure your contemplated use is permitted.
Topic
Provider
Free
Cost to access
Described as free to access or have a license that allows redistribution.
86 datasets found
  1. Fake News Content Detection: Weekend Hackathon #20

    • kaggle.com
    zip
    Updated Sep 11, 2020
  2. CT-FAN-21 corpus: A dataset for Fake News Detection

    • zenodo.org
    Updated Oct 23, 2022
    + more versions
  3. f

    Repository of fake news detection datasets

    • figshare.com
    • data.4tu.nl
    • +1more
    txt
    Updated Mar 18, 2021
  4. IFND dataset

    • kaggle.com
    zip
    Updated Feb 12, 2022
  5. o

    Data from: Fake news detection based on news content and social contexts: a...

    • omicsdi.org
    xml
    Updated Feb 27, 2024
  6. A

    ‘Fake News Content Detection 📰’ analyzed by Analyst-2

    • analyst-2.ai
    Updated Sep 30, 2021
  7. E

    BanFakeNews

    • live.european-language-grid.eu
    csv
    Updated Dec 30, 2020
  8. P

    MM-COVID Dataset

    • paperswithcode.com
    Updated Nov 4, 2021
    + more versions
  9. Data from: On the Role of Images for Analyzing Claims in Social Media

    • zenodo.org
    Updated Apr 23, 2021
  10. Fake News Detection⚠️👁️‍🗨️

    • kaggle.com
    zip
    Updated Jul 23, 2023
  11. P

    UPFD Dataset

    • paperswithcode.com
    Updated Apr 24, 2021
  12. Data from: Profiling Fake News Spreaders on Twitter

    • zenodo.org
    Updated Sep 22, 2020
  13. h

    fake_news_english

    • huggingface.co
    • opendatalab.com
  14. E

    Some Like it Hoax

    • live.european-language-grid.eu
    json
    Updated Dec 30, 2017
  15. Source based Fake News Classification

    • kaggle.com
    • openml.org
    zip
    Updated Aug 29, 2020
    + more versions
  16. f

    Data from: Do You Speak Disinformation? Computational Detection of Deceptive...

    • tandf.figshare.com
    txt
    Updated Feb 15, 2024
  17. CT-FAN: A Multilingual dataset for Fake News Detection

    • zenodo.org
    • explore.openaire.eu
    zip
    Updated Oct 23, 2022
  18. Welfake dataset for fake news

    • kaggle.com
    zip
    Updated Feb 19, 2024
    + more versions
  19. O

    UPFD-GOS (User Preference-aware Fake News Detection)

    • opendatalab.com
    zip
    Updated Apr 18, 2023
    + more versions
  20. Fake News Challenge

    • kaggle.com
    zip
    Updated Apr 4, 2021
Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Jaswinder Singh (2020). Fake News Content Detection: Weekend Hackathon #20 [Dataset]. https://www.kaggle.com/datasets/jassican/fake-news-content-detection-weekend-hackathon-20
Organization logo

Fake News Content Detection: Weekend Hackathon #20

Explore at:
zip(573738 bytes)Available download formats
Dataset updated
Sep 11, 2020
Authors
Jaswinder Singh
Description

Context

Welcome to another weekend hackathon, this weekend we are providing a great opportunity to the machinehackers to flex their NLP muscles again by building a fake content detection algorithm. Fake contents are everywhere from social media platforms, news platforms and there is a big list. Considering the advancement in NLP research institutes are putting a lot of sweat, blood, and tears to detect the fake content generated across the platforms.

Fake news, defined by the New York Times as “a made-up story with an intention to deceive”, often for a secondary gain, is arguably one of the most serious challenges facing the news industry today. In a December Pew Research poll, 64% of US adults said that “made-up news” has caused a “great deal of confusion” about the facts of current events

In this hackathon, your goal as a data scientist is to create an NLP model, to combat fake content problems. We believe that these AI technologies hold promise for significantly automating parts of the procedure human fact-checkers use today to determine if a story is real or a hoax.

Content

Train.csv - 10240 rows x 3 columns (Inlcudes Labels Columns as Target) Test.csv - 1267 rows x 2 columns Sample Submission.csv - Please check the Evaluation section for more details on how to generate a valid submission Text - Raw content from social media/ new platforms Text_Tag - Different types of content tags (9 unique products) Labels - Represents various classes of Labels Half-True - 2 False - 1 Mostly-True - 3 True - 5 Barely-True - 0 Not-Known - 4

Inspiration

https://www.machinehack.com/hackathons/fake_news_content_detection_weekend_hackathon_20

Search
Clear search
Close search
Google apps
Main menu