1 dataset found
  1. h

    multilingual-NLI-26lang-2mil7

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Moritz Laurer, multilingual-NLI-26lang-2mil7 [Dataset]. https://huggingface.co/datasets/MoritzLaurer/multilingual-NLI-26lang-2mil7
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Authors
    Moritz Laurer
    Description

    Datasheet for the dataset: multilingual-NLI-26lang-2mil7

      Dataset Summary
    

    This dataset contains 2 730 000 NLI text pairs in 26 languages spoken by more than 4 billion people. The dataset can be used to train models for multilingual NLI (Natural Language Inference) or zero-shot classification. The dataset is based on the English datasets MultiNLI, Fever-NLI, ANLI, LingNLI and WANLI and was created using the latest open-source machine translation models. The dataset is… See the full description on the dataset page: https://huggingface.co/datasets/MoritzLaurer/multilingual-NLI-26lang-2mil7.

  2. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Moritz Laurer, multilingual-NLI-26lang-2mil7 [Dataset]. https://huggingface.co/datasets/MoritzLaurer/multilingual-NLI-26lang-2mil7

multilingual-NLI-26lang-2mil7

MoritzLaurer/multilingual-NLI-26lang-2mil7

Explore at:
9 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Authors
Moritz Laurer
Description

Datasheet for the dataset: multilingual-NLI-26lang-2mil7

  Dataset Summary

This dataset contains 2 730 000 NLI text pairs in 26 languages spoken by more than 4 billion people. The dataset can be used to train models for multilingual NLI (Natural Language Inference) or zero-shot classification. The dataset is based on the English datasets MultiNLI, Fever-NLI, ANLI, LingNLI and WANLI and was created using the latest open-source machine translation models. The dataset is… See the full description on the dataset page: https://huggingface.co/datasets/MoritzLaurer/multilingual-NLI-26lang-2mil7.

Search
Clear search
Close search
Google apps
Main menu