2 datasets found

P
LIMA Dataset
paperswithcode.com
huggingface.co
Updated Nov 24, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Chunting Zhou; PengFei Liu; Puxin Xu; Srini Iyer; Jiao Sun; Yuning Mao; Xuezhe Ma; Avia Efrat; Ping Yu; Lili Yu; Susan Zhang; Gargi Ghosh; Mike Lewis; Luke Zettlemoyer; Omer Levy (2024). LIMA Dataset [Dataset]. https://paperswithcode.com/dataset/lima
Explore at:
Dataset updated
Nov 24, 2024
Authors
Chunting Zhou; PengFei Liu; Puxin Xu; Srini Iyer; Jiao Sun; Yuning Mao; Xuezhe Ma; Avia Efrat; Ping Yu; Lili Yu; Susan Zhang; Gargi Ghosh; Mike Lewis; Luke Zettlemoyer; Omer Levy
Description
The LIMA dataset is a valuable resource used in natural language processing (NLP) research. Let me provide you with some details:

Origin and Purpose: The LIMA dataset is derived from the LLaMa language model, which has an impressive 65 billion parameters.

It serves as a fine-tuned version of the LLaMa model, specifically adjusted using approximately 1,000 prompts and responses.

Performance and Applications:

LIMA demonstrates remarkable performance by learning to follow specific response formats from just a handful of examples in the training data. The dataset covers a wide range of tasks, including complex queries such as planning trip itineraries and speculating about alternate history.

Interestingly, the model tends to generalize well to unseen tasks that were not part of the training data.

License:

The licensing of the LIMA dataset depends on the source data it was derived from: If the source data has a stricter license than CC BY-NC-SA, the LIMA dataset follows the same restrictions. Otherwise, it adheres to the CC BY-NC-SA license.

(1) GAIR/lima · Datasets at Hugging Face. https://huggingface.co/datasets/GAIR/lima. (2) GAIR/lima at main - Hugging Face. https://huggingface.co/datasets/GAIR/lima/tree/main. (3) 日本語LIMAデータセットlima-jaを作成したので公開します. https://zanote.net/ai/lima-ja/. (4) Paper page - LIMA: Less Is More for Alignment - Hugging Face. https://huggingface.co/papers/2305.11206. (5) undefined. https://huggingface.co/datasets/GAIR/lima/.
h
ai-tube-llama-papers
huggingface.co
Updated Dec 15, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Julian Bilcke (2023). ai-tube-llama-papers [Dataset]. https://huggingface.co/datasets/jbilcke-hf/ai-tube-llama-papers
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 15, 2023
Authors
Julian Bilcke
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
Description

Follow me to learn about all the latest scientific papers!

Model

SVD

Voice

Julian

Tags

Science Education

Style

influencer, professional

Music

melodic balearic deep house

Prompt

A channel where a Llama will explain scientific papers, condensed into a few minutes, to make them accessible to non-scientific audiences. The typical layout should explain the context, the paper's idea, equivalent work, and why… See the full description on the dataset page: https://huggingface.co/datasets/jbilcke-hf/ai-tube-llama-papers.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Chunting Zhou; PengFei Liu; Puxin Xu; Srini Iyer; Jiao Sun; Yuning Mao; Xuezhe Ma; Avia Efrat; Ping Yu; Lili Yu; Susan Zhang; Gargi Ghosh; Mike Lewis; Luke Zettlemoyer; Omer Levy (2024). LIMA Dataset [Dataset]. https://paperswithcode.com/dataset/lima

LIMA Dataset

Explore at:

Dataset updated

Nov 24, 2024

Authors

Chunting Zhou; PengFei Liu; Puxin Xu; Srini Iyer; Jiao Sun; Yuning Mao; Xuezhe Ma; Avia Efrat; Ping Yu; Lili Yu; Susan Zhang; Gargi Ghosh; Mike Lewis; Luke Zettlemoyer; Omer Levy

Description

The LIMA dataset is a valuable resource used in natural language processing (NLP) research. Let me provide you with some details:

Origin and Purpose: The LIMA dataset is derived from the LLaMa language model, which has an impressive 65 billion parameters.

It serves as a fine-tuned version of the LLaMa model, specifically adjusted using approximately 1,000 prompts and responses.

Performance and Applications:

LIMA demonstrates remarkable performance by learning to follow specific response formats from just a handful of examples in the training data. The dataset covers a wide range of tasks, including complex queries such as planning trip itineraries and speculating about alternate history.

Interestingly, the model tends to generalize well to unseen tasks that were not part of the training data.

License:

The licensing of the LIMA dataset depends on the source data it was derived from: If the source data has a stricter license than CC BY-NC-SA, the LIMA dataset follows the same restrictions. Otherwise, it adheres to the CC BY-NC-SA license.

(1) GAIR/lima · Datasets at Hugging Face. https://huggingface.co/datasets/GAIR/lima. (2) GAIR/lima at main - Hugging Face. https://huggingface.co/datasets/GAIR/lima/tree/main. (3) 日本語LIMAデータセットlima-jaを作成したので公開します. https://zanote.net/ai/lima-ja/. (4) Paper page - LIMA: Less Is More for Alignment - Hugging Face. https://huggingface.co/papers/2305.11206. (5) undefined. https://huggingface.co/datasets/GAIR/lima/.

Clear search

Close search

Google apps

Main menu

LIMA Dataset

ai-tube-llama-papers

LIMA Dataset