2 datasets found
  1. h

    RedPajama-Data-1T-no-cc-c4

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sippy Coder, RedPajama-Data-1T-no-cc-c4 [Dataset]. https://huggingface.co/datasets/sippycoder/RedPajama-Data-1T-no-cc-c4
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Authors
    Sippy Coder
    Description

    RedPajama is a clean-room, fully open-source implementation of the LLaMa dataset.

  2. h

    RedPajama-Data-1T

    • huggingface.co
    • opendatalab.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Together, RedPajama-Data-1T [Dataset]. https://huggingface.co/datasets/togethercomputer/RedPajama-Data-1T
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset authored and provided by
    Together
    Description

    RedPajama is a clean-room, fully open-source implementation of the LLaMa dataset.

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Sippy Coder, RedPajama-Data-1T-no-cc-c4 [Dataset]. https://huggingface.co/datasets/sippycoder/RedPajama-Data-1T-no-cc-c4

RedPajama-Data-1T-no-cc-c4

Red Pajama 1T (no CC & C4)

sippycoder/RedPajama-Data-1T-no-cc-c4

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Authors
Sippy Coder
Description

RedPajama is a clean-room, fully open-source implementation of the LLaMa dataset.

Search
Clear search
Close search
Google apps
Main menu