2 datasets found
  1. Kaggle LLMSE Dataset

    • kaggle.com
    Updated Oct 18, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Haoquan Fang (2023). Kaggle LLMSE Dataset [Dataset]. https://www.kaggle.com/datasets/hqfang/kaggle-llmse-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 18, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Haoquan Fang
    Description

    deberta-billy is trained locally by @hqfang primarily using @radek1's notebook.

    deberta-lora-lindsey is trained locally by @lindseywei using the LoRA technique.

    deberta-openbook-eric-088 comes from @yuekaixueirc's dataset.

    deberta-openbook-eric-0897 comes from @yuekaixueirc's dataset.

    deberta-openbook-eric-0916 comes from @yuekaixueirc's dataset.

    54k_with_context_v1.csv was created by dropping duplicates @cdeotte's 60k training data all_12_with_context2.csv in this dataset.

    54k.csv was created by dropping the context column from the 54k_with_context_v1.csv.

    val_with_context_v1.csv was created by adding a context column to @itsuki9180's validation dataset.

  2. h

    financial_data

    • huggingface.co
    Updated Mar 6, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jeong (2024). financial_data [Dataset]. https://huggingface.co/datasets/csujeong/financial_data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 6, 2024
    Authors
    Jeong
    Description

    This dataset is a combination of Stanford's Alpaca (https://github.com/tatsu-lab/stanford_alpaca) and FiQA (https://sites.google.com/view/fiqa/) with another 1.3k pairs custom generated using GPT3.5 Script for tuning through Kaggle's (https://www.kaggle.com) free resources using PEFT/LoRa: https://www.kaggle.com/code/gbhacker23/wealth-alpaca-lora GitHub repo with performance analyses, training and data generation scripts, and inference notebooks: https://github.com/gaurangbharti1/wealth-alpaca… See the full description on the dataset page: https://huggingface.co/datasets/csujeong/financial_data.

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Haoquan Fang (2023). Kaggle LLMSE Dataset [Dataset]. https://www.kaggle.com/datasets/hqfang/kaggle-llmse-dataset
Organization logo

Kaggle LLMSE Dataset

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 18, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Haoquan Fang
Description

deberta-billy is trained locally by @hqfang primarily using @radek1's notebook.

deberta-lora-lindsey is trained locally by @lindseywei using the LoRA technique.

deberta-openbook-eric-088 comes from @yuekaixueirc's dataset.

deberta-openbook-eric-0897 comes from @yuekaixueirc's dataset.

deberta-openbook-eric-0916 comes from @yuekaixueirc's dataset.

54k_with_context_v1.csv was created by dropping duplicates @cdeotte's 60k training data all_12_with_context2.csv in this dataset.

54k.csv was created by dropping the context column from the 54k_with_context_v1.csv.

val_with_context_v1.csv was created by adding a context column to @itsuki9180's validation dataset.

Search
Clear search
Close search
Google apps
Main menu