2 datasets found
  1. P

    LIMA Dataset

    • paperswithcode.com
    • huggingface.co
    Updated Nov 24, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chunting Zhou; PengFei Liu; Puxin Xu; Srini Iyer; Jiao Sun; Yuning Mao; Xuezhe Ma; Avia Efrat; Ping Yu; Lili Yu; Susan Zhang; Gargi Ghosh; Mike Lewis; Luke Zettlemoyer; Omer Levy (2024). LIMA Dataset [Dataset]. https://paperswithcode.com/dataset/lima
    Explore at:
    Dataset updated
    Nov 24, 2024
    Authors
    Chunting Zhou; PengFei Liu; Puxin Xu; Srini Iyer; Jiao Sun; Yuning Mao; Xuezhe Ma; Avia Efrat; Ping Yu; Lili Yu; Susan Zhang; Gargi Ghosh; Mike Lewis; Luke Zettlemoyer; Omer Levy
    Description

    The LIMA dataset is a valuable resource used in natural language processing (NLP) research. Let me provide you with some details:

    Origin and Purpose: The LIMA dataset is derived from the LLaMa language model, which has an impressive 65 billion parameters.

    It serves as a fine-tuned version of the LLaMa model, specifically adjusted using approximately 1,000 prompts and responses.

    Performance and Applications:

    LIMA demonstrates remarkable performance by learning to follow specific response formats from just a handful of examples in the training data. The dataset covers a wide range of tasks, including complex queries such as planning trip itineraries and speculating about alternate history.

    Interestingly, the model tends to generalize well to unseen tasks that were not part of the training data.

    License:

    The licensing of the LIMA dataset depends on the source data it was derived from: If the source data has a stricter license than CC BY-NC-SA, the LIMA dataset follows the same restrictions. Otherwise, it adheres to the CC BY-NC-SA license.

    (1) GAIR/lima · Datasets at Hugging Face. https://huggingface.co/datasets/GAIR/lima. (2) GAIR/lima at main - Hugging Face. https://huggingface.co/datasets/GAIR/lima/tree/main. (3) 日本語LIMAデータセットlima-jaを作成したので公開します. https://zanote.net/ai/lima-ja/. (4) Paper page - LIMA: Less Is More for Alignment - Hugging Face. https://huggingface.co/papers/2305.11206. (5) undefined. https://huggingface.co/datasets/GAIR/lima/.

  2. h

    ai-tube-llama-papers

    • huggingface.co
    Updated Dec 15, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Julian Bilcke (2023). ai-tube-llama-papers [Dataset]. https://huggingface.co/datasets/jbilcke-hf/ai-tube-llama-papers
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 15, 2023
    Authors
    Julian Bilcke
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Description

    Follow me to learn about all the latest scientific papers!

      Model
    

    SVD

      Voice
    

    Julian

      Tags
    

    Science Education

      Style
    

    influencer, professional

      Music
    

    melodic balearic deep house

      Prompt
    

    A channel where a Llama will explain scientific papers, condensed into a few minutes, to make them accessible to non-scientific audiences. The typical layout should explain the context, the paper's idea, equivalent work, and why… See the full description on the dataset page: https://huggingface.co/datasets/jbilcke-hf/ai-tube-llama-papers.

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Chunting Zhou; PengFei Liu; Puxin Xu; Srini Iyer; Jiao Sun; Yuning Mao; Xuezhe Ma; Avia Efrat; Ping Yu; Lili Yu; Susan Zhang; Gargi Ghosh; Mike Lewis; Luke Zettlemoyer; Omer Levy (2024). LIMA Dataset [Dataset]. https://paperswithcode.com/dataset/lima

LIMA Dataset

Explore at:
Dataset updated
Nov 24, 2024
Authors
Chunting Zhou; PengFei Liu; Puxin Xu; Srini Iyer; Jiao Sun; Yuning Mao; Xuezhe Ma; Avia Efrat; Ping Yu; Lili Yu; Susan Zhang; Gargi Ghosh; Mike Lewis; Luke Zettlemoyer; Omer Levy
Description

The LIMA dataset is a valuable resource used in natural language processing (NLP) research. Let me provide you with some details:

Origin and Purpose: The LIMA dataset is derived from the LLaMa language model, which has an impressive 65 billion parameters.

It serves as a fine-tuned version of the LLaMa model, specifically adjusted using approximately 1,000 prompts and responses.

Performance and Applications:

LIMA demonstrates remarkable performance by learning to follow specific response formats from just a handful of examples in the training data. The dataset covers a wide range of tasks, including complex queries such as planning trip itineraries and speculating about alternate history.

Interestingly, the model tends to generalize well to unseen tasks that were not part of the training data.

License:

The licensing of the LIMA dataset depends on the source data it was derived from: If the source data has a stricter license than CC BY-NC-SA, the LIMA dataset follows the same restrictions. Otherwise, it adheres to the CC BY-NC-SA license.

(1) GAIR/lima · Datasets at Hugging Face. https://huggingface.co/datasets/GAIR/lima. (2) GAIR/lima at main - Hugging Face. https://huggingface.co/datasets/GAIR/lima/tree/main. (3) 日本語LIMAデータセットlima-jaを作成したので公開します. https://zanote.net/ai/lima-ja/. (4) Paper page - LIMA: Less Is More for Alignment - Hugging Face. https://huggingface.co/papers/2305.11206. (5) undefined. https://huggingface.co/datasets/GAIR/lima/.

Search
Clear search
Close search
Google apps
Main menu