Saved datasets
Last updated
Download format
Croissant
Croissant is a format for Machine Learning datasets
Learn more about this at mlcommons.org/croissant.
Usage rights
License from data provider
Please review the applicable license to make sure your contemplated use is permitted.
Topic
Provider
Free
Cost to access
Described as free to access or have a license that allows redistribution.
6 datasets found
  1. h

    tldr-17

    • huggingface.co
    Updated Jun 12, 2020
  2. W

    Webis-TLDR-17

    • webis.de
    1043504
    Updated 2017
  3. Webis-TLDR-17 Corpus

    • zenodo.org
    • paperswithcode.com
    zip
    Updated Jan 24, 2020
  4. E

    Webis-TLDR-17 Corpus

    • live.european-language-grid.eu
    json
    Updated Dec 30, 2017
  5. h

    openai-summarize-tldr

    • huggingface.co
    Updated Apr 30, 2024
    + more versions
  6. E

    Welsh Summary Creator Tool

    • live.european-language-grid.eu
    Updated Mar 17, 2022
  7. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Webis Group (2020). tldr-17 [Dataset]. https://huggingface.co/datasets/webis/tldr-17

tldr-17

webis/tldr-17

Reddit Webis-TLDR-17

Explore at:
48 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 12, 2020
Dataset authored and provided by
Webis Group
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

This corpus contains preprocessed posts from the Reddit dataset. The dataset consists of 3,848,330 posts with an average length of 270 words for content, and 28 words for the summary.

Features includes strings: author, body, normalizedBody, content, summary, subreddit, subreddit_id. Content is used as document and summary is used as summary.

Search
Clear search
Close search
Google apps
Main menu