5 datasets found
  1. h

    trin_data_tldr_explicit_dataset

    • huggingface.co
    Updated May 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kai Ye (2025). trin_data_tldr_explicit_dataset [Dataset]. https://huggingface.co/datasets/Kyleyee/trin_data_tldr_explicit_dataset
    Explore at:
    Dataset updated
    May 3, 2025
    Authors
    Kai Ye
    Description

    TL;DR Dataset for Preference Learning

      Summary
    

    The TL;DR dataset is a processed version of Reddit posts, specifically curated to train models using the TRL library for preference learning and Reinforcement Learning from Human Feedback (RLHF) tasks. It leverages the common practice on Reddit where users append "TL;DR" (Too Long; Didn't Read) summaries to lengthy posts, providing a rich source of paired text data for training models to understand and generate concise… See the full description on the dataset page: https://huggingface.co/datasets/Kyleyee/trin_data_tldr_explicit_dataset.

  2. h

    preference_dataset

    • huggingface.co
    Updated Mar 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sivaganesh Krishnan (2025). preference_dataset [Dataset]. https://huggingface.co/datasets/Sivaganesh07/preference_dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 14, 2025
    Authors
    Sivaganesh Krishnan
    Description

    TL;DR Dataset for Preference Learning

      Summary
    

    The TL;DR dataset is a processed version of Reddit posts, specifically curated to train models using the TRL library for preference learning and Reinforcement Learning from Human Feedback (RLHF) tasks. It leverages the common practice on Reddit where users append "TL;DR" (Too Long; Didn't Read) summaries to lengthy posts, providing a rich source of paired text data for training models to understand and generate concise… See the full description on the dataset page: https://huggingface.co/datasets/Sivaganesh07/preference_dataset.

  3. h

    tldr-preference

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRL, tldr-preference [Dataset]. https://huggingface.co/datasets/trl-lib/tldr-preference
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset authored and provided by
    TRL
    Description

    TL;DR Dataset for Preference Learning

      Summary
    

    The TL;DR dataset is a processed version of Reddit posts, specifically curated to train models using the TRL library for preference learning and Reinforcement Learning from Human Feedback (RLHF) tasks. It leverages the common practice on Reddit where users append "TL;DR" (Too Long; Didn't Read) summaries to lengthy posts, providing a rich source of paired text data for training models to understand and generate concise… See the full description on the dataset page: https://huggingface.co/datasets/trl-lib/tldr-preference.

  4. h

    preference_dataset1

    • huggingface.co
    Updated Mar 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sivaganesh Krishnan (2025). preference_dataset1 [Dataset]. https://huggingface.co/datasets/Sivaganesh07/preference_dataset1
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 14, 2025
    Authors
    Sivaganesh Krishnan
    Description

    TL;DR Dataset for Preference Learning

      Summary
    

    The TL;DR dataset is a processed version of Reddit posts, specifically curated to train models using the TRL library for preference learning and Reinforcement Learning from Human Feedback (RLHF) tasks. It leverages the common practice on Reddit where users append "TL;DR" (Too Long; Didn't Read) summaries to lengthy posts, providing a rich source of paired text data for training models to understand and generate concise… See the full description on the dataset page: https://huggingface.co/datasets/Sivaganesh07/preference_dataset1.

  5. h

    preference_dataset2

    • huggingface.co
    Updated Mar 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sivaganesh Krishnan (2025). preference_dataset2 [Dataset]. https://huggingface.co/datasets/Sivaganesh07/preference_dataset2
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 14, 2025
    Authors
    Sivaganesh Krishnan
    Description

    TL;DR Dataset for Preference Learning

      Summary
    

    The TL;DR dataset is a processed version of Reddit posts, specifically curated to train models using the TRL library for preference learning and Reinforcement Learning from Human Feedback (RLHF) tasks. It leverages the common practice on Reddit where users append "TL;DR" (Too Long; Didn't Read) summaries to lengthy posts, providing a rich source of paired text data for training models to understand and generate concise… See the full description on the dataset page: https://huggingface.co/datasets/Sivaganesh07/preference_dataset2.

  6. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Kai Ye (2025). trin_data_tldr_explicit_dataset [Dataset]. https://huggingface.co/datasets/Kyleyee/trin_data_tldr_explicit_dataset

trin_data_tldr_explicit_dataset

Kyleyee/trin_data_tldr_explicit_dataset

Explore at:
Dataset updated
May 3, 2025
Authors
Kai Ye
Description

TL;DR Dataset for Preference Learning

  Summary

The TL;DR dataset is a processed version of Reddit posts, specifically curated to train models using the TRL library for preference learning and Reinforcement Learning from Human Feedback (RLHF) tasks. It leverages the common practice on Reddit where users append "TL;DR" (Too Long; Didn't Read) summaries to lengthy posts, providing a rich source of paired text data for training models to understand and generate concise… See the full description on the dataset page: https://huggingface.co/datasets/Kyleyee/trin_data_tldr_explicit_dataset.

Search
Clear search
Close search
Google apps
Main menu