12 datasets found
  1. h

    beyond_dpo_vi

    • huggingface.co
    Updated Aug 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hieu Lam (2024). beyond_dpo_vi [Dataset]. https://huggingface.co/datasets/lamhieu/beyond_dpo_vi
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 18, 2024
    Authors
    Hieu Lam
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Description

    The dataset is from unknown, formatted as dialogues for speed and ease of use. Many thanks to author for releasing it. Importantly, this format is easy to use via the default chat template of transformers, meaning you can use huggingface/alignment-handbook immediately, unsloth.

      Structure
    

    View online through viewer.

      Note
    

    We advise you to reconsider before use, thank you. If you find it useful, please like and follow this account.… See the full description on the dataset page: https://huggingface.co/datasets/lamhieu/beyond_dpo_vi.

  2. h

    math_arxiv_dialogue_en

    • huggingface.co
    Updated Aug 18, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hieu Lam (2024). math_arxiv_dialogue_en [Dataset]. https://huggingface.co/datasets/lamhieu/math_arxiv_dialogue_en
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 18, 2024
    Authors
    Hieu Lam
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Description

    The dataset is from unknown, formatted as dialogues for speed and ease of use. Many thanks to author for releasing it. Importantly, this format is easy to use via the default chat template of transformers, meaning you can use huggingface/alignment-handbook immediately, unsloth.

      Structure
    

    View online through viewer.

      Note
    

    We advise you to reconsider before use, thank you. If you find it useful, please like and follow this account.… See the full description on the dataset page: https://huggingface.co/datasets/lamhieu/math_arxiv_dialogue_en.

  3. h

    itorca_dpo_vi

    • huggingface.co
    Updated Aug 18, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hieu Lam (2024). itorca_dpo_vi [Dataset]. https://huggingface.co/datasets/lamhieu/itorca_dpo_vi
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 18, 2024
    Authors
    Hieu Lam
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Description

    The dataset is from unknown, formatted as dialogues for speed and ease of use. Many thanks to author for releasing it. Importantly, this format is easy to use via the default chat template of transformers, meaning you can use huggingface/alignment-handbook immediately, unsloth.

      Structure
    

    View online through viewer.

      Note
    

    We advise you to reconsider before use, thank you. If you find it useful, please like and follow this account.… See the full description on the dataset page: https://huggingface.co/datasets/lamhieu/itorca_dpo_vi.

  4. h

    medical_wikidoc_dialogue_en

    • huggingface.co
    Updated Aug 18, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hieu Lam (2024). medical_wikidoc_dialogue_en [Dataset]. https://huggingface.co/datasets/lamhieu/medical_wikidoc_dialogue_en
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 18, 2024
    Authors
    Hieu Lam
    Description

    Description

    The dataset is from medalpaca/medical_meadow_wikidoc, formatted as dialogues for speed and ease of use. Many thanks to author for releasing it. Importantly, this format is easy to use via the default chat template of transformers, meaning you can use huggingface/alignment-handbook immediately, unsloth.

      Structure
    

    View online through viewer.

      Note
    

    We advise you to reconsider before use, thank you. If you find it useful, please like and follow… See the full description on the dataset page: https://huggingface.co/datasets/lamhieu/medical_wikidoc_dialogue_en.

  5. h

    medical_pubmed_dialogue_en

    • huggingface.co
    Updated Aug 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hieu Lam (2024). medical_pubmed_dialogue_en [Dataset]. https://huggingface.co/datasets/lamhieu/medical_pubmed_dialogue_en
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 18, 2024
    Authors
    Hieu Lam
    Description

    Description

    The dataset is from medalpaca/medical_meadow_pubmed_causal, formatted as dialogues for speed and ease of use. Many thanks to author for releasing it. Importantly, this format is easy to use via the default chat template of transformers, meaning you can use huggingface/alignment-handbook immediately, unsloth.

      Structure
    

    View online through viewer.

      Note
    

    We advise you to reconsider before use, thank you. If you find it useful, please like and… See the full description on the dataset page: https://huggingface.co/datasets/lamhieu/medical_pubmed_dialogue_en.

  6. h

    oasst_dialogue_vi

    • huggingface.co
    Updated Aug 18, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hieu Lam (2024). oasst_dialogue_vi [Dataset]. https://huggingface.co/datasets/lamhieu/oasst_dialogue_vi
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 18, 2024
    Authors
    Hieu Lam
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Description

    The dataset is from unknown, formatted as dialogues for speed and ease of use. Many thanks to author for releasing it. Importantly, this format is easy to use via the default chat template of transformers, meaning you can use huggingface/alignment-handbook immediately, unsloth.

      Structure
    

    View online through viewer.

      Note
    

    We advise you to reconsider before use, thank you. If you find it useful, please like and follow this account.… See the full description on the dataset page: https://huggingface.co/datasets/lamhieu/oasst_dialogue_vi.

  7. h

    alpaca_gpt4_dialogue_vi

    • huggingface.co
    Updated Aug 18, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hieu Lam (2024). alpaca_gpt4_dialogue_vi [Dataset]. https://huggingface.co/datasets/lamhieu/alpaca_gpt4_dialogue_vi
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 18, 2024
    Authors
    Hieu Lam
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Description

    The dataset is from 5CD-AI/Vietnamese-c-s-ale-alpaca-gpt4-data-gg-translated, formatted as dialogues for speed and ease of use. Many thanks to 5CD-AI for releasing it. Importantly, this format is easy to use via the default chat template of transformers, meaning you can use huggingface/alignment-handbook immediately, unsloth.

      Structure
    

    View online through viewer.

      Note
    

    We advise you to reconsider before use, thank you. If you find it useful… See the full description on the dataset page: https://huggingface.co/datasets/lamhieu/alpaca_gpt4_dialogue_vi.

  8. h

    mabrycodes_dialogue_vi

    • huggingface.co
    Updated Aug 18, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hieu Lam (2024). mabrycodes_dialogue_vi [Dataset]. https://huggingface.co/datasets/lamhieu/mabrycodes_dialogue_vi
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 18, 2024
    Authors
    Hieu Lam
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Description

    The dataset is from 5CD-AI/Vietnamese-mabryCodes-tiny-cot-alpaca-gg-translated, formatted as dialogues for speed and ease of use. Many thanks to author for releasing it. Importantly, this format is easy to use via the default chat template of transformers, meaning you can use huggingface/alignment-handbook immediately, unsloth.

      Structure
    

    View online through viewer.

      Note
    

    We advise you to reconsider before use, thank you. If you find it useful… See the full description on the dataset page: https://huggingface.co/datasets/lamhieu/mabrycodes_dialogue_vi.

  9. h

    slwiki_dialogue_vi

    • huggingface.co
    Updated Aug 18, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hieu Lam (2024). slwiki_dialogue_vi [Dataset]. https://huggingface.co/datasets/lamhieu/slwiki_dialogue_vi
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 18, 2024
    Authors
    Hieu Lam
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Description

    The dataset is from unknown, formatted as dialogues for speed and ease of use. Many thanks to author for releasing it. Importantly, this format is easy to use via the default chat template of transformers, meaning you can use huggingface/alignment-handbook immediately, unsloth.

      Structure
    

    View online through viewer.

      Note
    

    We advise you to reconsider before use, thank you. If you find it useful, please like and follow this account.… See the full description on the dataset page: https://huggingface.co/datasets/lamhieu/slwiki_dialogue_vi.

  10. h

    beyond_dialogue_vi

    • huggingface.co
    Updated Aug 18, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hieu Lam (2024). beyond_dialogue_vi [Dataset]. https://huggingface.co/datasets/lamhieu/beyond_dialogue_vi
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 18, 2024
    Authors
    Hieu Lam
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Description

    The dataset is from unknown, formatted as dialogues for speed and ease of use. Many thanks to author for releasing it. Importantly, this format is easy to use via the default chat template of transformers, meaning you can use huggingface/alignment-handbook immediately, unsloth.

      Structure
    

    View online through viewer.

      Note
    

    We advise you to reconsider before use, thank you. If you find it useful, please like and follow this account.… See the full description on the dataset page: https://huggingface.co/datasets/lamhieu/beyond_dialogue_vi.

  11. h

    alpaca_multiturns_dialogue_vi

    • huggingface.co
    Updated Aug 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hieu Lam (2024). alpaca_multiturns_dialogue_vi [Dataset]. https://huggingface.co/datasets/lamhieu/alpaca_multiturns_dialogue_vi
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 18, 2024
    Authors
    Hieu Lam
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Description

    The dataset is from 5CD-AI/Vietnamese-Multi-turn-Chat-Alpaca, formatted as dialogues for speed and ease of use. Many thanks to 5CD-AI for releasing it. Importantly, this format is easy to use via the default chat template of transformers, meaning you can use huggingface/alignment-handbook immediately, unsloth.

      Structure
    

    View online through viewer.

      Note
    

    We advise you to reconsider before use, thank you. If you find it useful, please like and… See the full description on the dataset page: https://huggingface.co/datasets/lamhieu/alpaca_multiturns_dialogue_vi.

  12. h

    translate_tinystories_dialogue_envi

    • huggingface.co
    Updated Aug 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hieu Lam (2024). translate_tinystories_dialogue_envi [Dataset]. https://huggingface.co/datasets/lamhieu/translate_tinystories_dialogue_envi
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 18, 2024
    Authors
    Hieu Lam
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Description

    The dataset is from vilm/tinystories-envi, formatted as dialogues for speed and ease of use. Many thanks to vilm for releasing it. Importantly, this format is easy to use via the default chat template of transformers, meaning you can use huggingface/alignment-handbook immediately, unsloth.

      Structure
    

    Data is created from "English - Vietnamese" or "Vietnamese - English" translation data pairs with prompts to specify for the model. Here is a sample: [ {… See the full description on the dataset page: https://huggingface.co/datasets/lamhieu/translate_tinystories_dialogue_envi.

  13. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Hieu Lam (2024). beyond_dpo_vi [Dataset]. https://huggingface.co/datasets/lamhieu/beyond_dpo_vi

beyond_dpo_vi

lamhieu/beyond_dpo_vi

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 18, 2024
Authors
Hieu Lam
License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

Description

The dataset is from unknown, formatted as dialogues for speed and ease of use. Many thanks to author for releasing it. Importantly, this format is easy to use via the default chat template of transformers, meaning you can use huggingface/alignment-handbook immediately, unsloth.

  Structure

View online through viewer.

  Note

We advise you to reconsider before use, thank you. If you find it useful, please like and follow this account.… See the full description on the dataset page: https://huggingface.co/datasets/lamhieu/beyond_dpo_vi.

Search
Clear search
Close search
Google apps
Main menu