56 datasets found
  1. h

    spider

    • huggingface.co
    • opendatalab.com
    Updated Dec 9, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    XLang NLP Lab (2021). spider [Dataset]. https://huggingface.co/datasets/xlangai/spider
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 9, 2021
    Dataset authored and provided by
    XLang NLP Lab
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Dataset Card for Spider

      Dataset Summary
    

    Spider is a large-scale complex and cross-domain semantic parsing and text-to-SQL dataset annotated by 11 Yale students. The goal of the Spider challenge is to develop natural language interfaces to cross-domain databases.

      Supported Tasks and Leaderboards
    

    The leaderboard can be seen at https://yale-lily.github.io/spider

      Languages
    

    The text in the dataset is in English.

      Dataset Structure
    
    
    
    
    
      Dataโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/xlangai/spider.
    
  2. h

    spider-realistic

    • huggingface.co
    • opendatalab.com
    • +1more
    Updated Feb 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AhernTech s.r.o. (2024). spider-realistic [Dataset]. https://huggingface.co/datasets/aherntech/spider-realistic
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 17, 2024
    Dataset authored and provided by
    AhernTech s.r.o.
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset Card for Spider-Releastic

    This dataset variant contains only the Spider Realistic dataset used in "Structure-Grounded Pretraining for Text-to-SQL". The dataset is created based on the dev split of the Spider dataset (2020-06-07 version from https://yale-lily.github.io/spider). The authors of the dataset modified the original questions to remove the explicit mention of column names while keeping the SQL queries unchanged to better evaluate the model's capability in aligningโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/aherntech/spider-realistic.

  3. h

    spider-schema

    • huggingface.co
    Updated Jul 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Richard R. (2023). spider-schema [Dataset]. https://huggingface.co/datasets/richardr1126/spider-schema
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 19, 2023
    Authors
    Richard R.
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset Card for Spider Schema

      Dataset Summary
    

    Spider is a large-scale complex and cross-domain semantic parsing and text-to-SQL dataset annotated by 11 Yale students The goal of the Spider challenge is to develop natural language interfaces to cross-domain databases. This dataset contains the 166 databases used in the Spider dataset.

      Yale Lily Spider Leaderboards
    

    The leaderboard can be seen at https://yale-lily.github.io/spider

      Languages
    

    The text inโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/richardr1126/spider-schema.

  4. h

    spider-en-pt-es-fr

    • huggingface.co
    Updated Jan 16, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Marcelo Archanjo Jose (2024). spider-en-pt-es-fr [Dataset]. https://huggingface.co/datasets/Marchanjo/spider-en-pt-es-fr
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 16, 2024
    Authors
    Marcelo Archanjo Jose
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Distributed under the Creative Commons-by-sa-4.0 respecting the ShareAlike of the Spider Dataset. Code explanations and links for the model's checkpoints and datasets are on Github mRAT-SQL Here is the Hugging Face collection, you can download the model's checkpoints and datasets, but to understand is better to go to Github mRAT-SQL.

      mRAT-SQL-FIT
    
    
    
    
    
    
    
      A Multilingual Translator to SQL with Database Schema Pruning to Improve Self-Attention
    

    Marcelo Archanjo Jose, Fabioโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/Marchanjo/spider-en-pt-es-fr.

  5. h

    fixed_spider

    • huggingface.co
    Updated Jun 14, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Turbular (2024). fixed_spider [Dataset]. https://huggingface.co/datasets/Turbular/fixed_spider
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 14, 2024
    Dataset authored and provided by
    Turbular
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Cleaned Spider Dataset for Text2SQL

      Dataset Summary
    

    The Cleaned Spider Dataset for Text2SQL is an improved version of the original Spider dataset, which is a large-scale, complex, and cross-domain semantic parsing and text-to-SQL dataset. This enhanced version addresses several critical issues found in the original dataset, ensuring higher quality and reliability for training text-to-SQL models. The enhancements were made possible through Turbular's advanced dataโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/Turbular/fixed_spider.

  6. h

    spider-sql

    • huggingface.co
    Updated Jun 30, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chapaneri (2024). spider-sql [Dataset]. https://huggingface.co/datasets/radhikachapaneri/spider-sql
    Explore at:
    Dataset updated
    Jun 30, 2024
    Authors
    Chapaneri
    Description

    radhikachapaneri/spider-sql dataset hosted on Hugging Face and contributed by the HF Datasets community

  7. h

    spider-skeleton-context-instruct

    • huggingface.co
    Updated Aug 9, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Richard R. (2023). spider-skeleton-context-instruct [Dataset]. https://huggingface.co/datasets/richardr1126/spider-skeleton-context-instruct
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 9, 2023
    Authors
    Richard R.
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset Card for Spider Skeleton Context Instruct

      Dataset Summary
    

    Spider is a large-scale complex and cross-domain semantic parsing and text-to-SQL dataset annotated by 11 Yale students The goal of the Spider challenge is to develop natural language interfaces to cross-domain databases. This dataset was created to finetune LLMs in a ### Instruction: and ### Response: format with database context.

      Yale Lily Spider Leaderboards
    

    The leaderboard can be seen atโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/richardr1126/spider-skeleton-context-instruct.

  8. h

    SPIDER

    • huggingface.co
    Updated Feb 24, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chris Oswald (2024). SPIDER [Dataset]. https://huggingface.co/datasets/cdoswald/SPIDER
    Explore at:
    Dataset updated
    Feb 24, 2024
    Authors
    Chris Oswald
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This is a large publicly available multi-center lumbar spine magnetic resonance imaging (MRI) dataset with reference segmentations of vertebrae, intervertebral discs (IVDs), and spinal canal. The dataset includes 447 sagittal T1 and T2 MRI series from 218 studies of 218 patients with a history of low back pain. The data was collected from four different hospitals. There is an additional hidden test set, not available here, used in the accompanying SPIDER challenge on spider.grand-challenge.org. We share this data to encourage wider participation and collaboration in the field of spine segmentation, and ultimately improve the diagnostic value of lumbar spine MRI.

    This file also provides the biological sex for all patients and the age for the patients for which this was available. It also includes a number of scanner and acquisition parameters for each individual MRI study. The dataset also comes with radiological gradings found in a separate file for the following degenerative changes:

    1.โ€‚โ€‚โ€‚โ€‚Modic changes (type I, II or III)

    2.โ€‚โ€‚โ€‚โ€‚Upper and lower endplate changes / Schmorl nodes (binary)

    3.โ€‚โ€‚โ€‚โ€‚Spondylolisthesis (binary)

    4.โ€‚โ€‚โ€‚โ€‚Disc herniation (binary)

    5.โ€‚โ€‚โ€‚โ€‚Disc narrowing (binary)

    6.โ€‚โ€‚โ€‚โ€‚Disc bulging (binary)

    7.โ€‚โ€‚โ€‚โ€‚Pfirrman grade (grade 1 to 5).

    All radiological gradings are provided per IVD level.

    Repository: https://zenodo.org/records/10159290 Paper: https://www.nature.com/articles/s41597-024-03090-w

  9. h

    spider-corpus-validation

    • huggingface.co
    Updated Sep 21, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TARGET Benchmark (2024). spider-corpus-validation [Dataset]. https://huggingface.co/datasets/target-benchmark/spider-corpus-validation
    Explore at:
    Dataset updated
    Sep 21, 2024
    Authors
    TARGET Benchmark
    Description

    Link to original dataset: https://yale-lily.github.io/spider Yu, T., Zhang, R., Yang, K., Yasunaga, M., Wang, D., Li, Z., Ma, J., Li, I., Yao, Q., Roman, S. and Zhang, Z., 2018. Spider: A large-scale human-labeled dataset for complex and cross-domain semantic parsing and text-to-sql task. arXiv preprint arXiv:1809.08887.

  10. h

    spider

    • huggingface.co
    Updated Jun 30, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    hy (2024). spider [Dataset]. https://huggingface.co/datasets/hyess/spider
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 30, 2024
    Authors
    hy
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    hyess/spider dataset hosted on Hugging Face and contributed by the HF Datasets community

  11. O

    sql-create-context

    • opendatalab.com
    • huggingface.co
    zip
    Updated Apr 21, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). sql-create-context [Dataset]. https://opendatalab.com/OpenDataLab/sql-create-context
    Explore at:
    zipAvailable download formats
    Dataset updated
    Apr 21, 2023
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset builds from WikiSQL and Spider. There are 78,577 examples of natural language queries, SQL CREATE TABLE statements, and SQL Query answering the question using the CREATE statement as context. This dataset was built with text-to-sql LLMs in mind, intending to prevent hallucination of column and table names often seen when trained on text-to-sql datasets. The CREATE TABLE statement can often be copy and pasted from different DBMS and provides table names, column names and their data types. By providing just the CREATE TABLE statement as context, we can hopefully provide better grounding for models without having to provide actual rows of data, limiting token usage and exposure to private, sensitive, or proprietary data.

  12. spider-ko

    • huggingface.co
    Updated Jun 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hugging Face KREW (2025). spider-ko [Dataset]. https://huggingface.co/datasets/huggingface-KREW/spider-ko
    Explore at:
    Dataset updated
    Jun 3, 2025
    Dataset provided by
    Hugging Facehttps://huggingface.co/
    Authors
    Hugging Face KREW
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Dataset Card for spider-ko: ํ•œ๊ตญ์–ด Text-to-SQL ๋ฐ์ดํ„ฐ์…‹

      ๋ฐ์ดํ„ฐ์…‹ ์š”์•ฝ
    

    Spider-KO๋Š” Yale University์˜ Spider ๋ฐ์ดํ„ฐ์…‹์„ ํ•œ๊ตญ์–ด๋กœ ๋ฒˆ์—ญํ•œ ํ…์ŠคํŠธ-SQL ๋ณ€ํ™˜ ๋ฐ์ดํ„ฐ์…‹์ž…๋‹ˆ๋‹ค. ์›๋ณธ Spider ๋ฐ์ดํ„ฐ์…‹์˜ ์ž์—ฐ์–ด ์งˆ๋ฌธ์„ ํ•œ๊ตญ์–ด๋กœ ๋ฒˆ์—ญํ•˜์—ฌ ๊ตฌ์„ฑํ•˜์˜€์Šต๋‹ˆ๋‹ค. ์ด ๋ฐ์ดํ„ฐ์…‹์€ ๋‹ค์–‘ํ•œ ๋„๋ฉ”์ธ์˜ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค์— ๋Œ€ํ•œ ์งˆ์˜์™€ ํ•ด๋‹น SQL ์ฟผ๋ฆฌ๋ฅผ ํฌํ•จํ•˜๊ณ  ์žˆ์œผ๋ฉฐ, ํ•œ๊ตญ์–ด Text-to-SQL ๋ชจ๋ธ ๊ฐœ๋ฐœ ๋ฐ ํ‰๊ฐ€์— ํ™œ์šฉ๋  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

      ์ง€์› ํƒœ์Šคํฌ ๋ฐ ๋ฆฌ๋”๋ณด๋“œ
    

    text-to-sql: ํ•œ๊ตญ์–ด ์ž์—ฐ์–ด ์งˆ๋ฌธ์„ SQL ์ฟผ๋ฆฌ๋กœ ๋ณ€ํ™˜ํ•˜๋Š” ํƒœ์Šคํฌ์— ์‚ฌ์šฉ๋ฉ๋‹ˆ๋‹ค.

      ์–ธ์–ด
    

    ๋ฐ์ดํ„ฐ์…‹์˜ ์งˆ๋ฌธ์€ ํ•œ๊ตญ์–ด(ko)๋กœ ๋ฒˆ์—ญ๋˜์—ˆ์œผ๋ฉฐ, SQL ์ฟผ๋ฆฌ๋Š” ์˜์–ด ๊ธฐ๋ฐ˜์œผ๋กœ ์œ ์ง€๋˜์—ˆ์Šต๋‹ˆ๋‹ค. ์›๋ณธ ์˜์–ด ์งˆ๋ฌธ๋„ ํ•จ๊ป˜ ์ œ๊ณต๋ฉ๋‹ˆ๋‹ค.

      ๋ฐ์ดํ„ฐ์…‹ ๊ตฌ์กฐ
    
    
    
    
    
      ๋ฐ์ดํ„ฐ ํ•„๋“œ
    

    db_idโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/huggingface-KREW/spider-ko.

  13. h

    spider-context-validation

    • huggingface.co
    Updated Jul 26, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Richard R. (2023). spider-context-validation [Dataset]. https://huggingface.co/datasets/richardr1126/spider-context-validation
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 26, 2023
    Authors
    Richard R.
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset Card for Spider Context Validation

      Dataset Summary
    

    Spider is a large-scale complex and cross-domain semantic parsing and text-to-SQL dataset annotated by 11 Yale students The goal of the Spider challenge is to develop natural language interfaces to cross-domain databases. This dataset was created to validate spider-fine-tuned LLMs with database context.

      Yale Lily Spider Leaderboards
    

    The leaderboard can be seen at https://yale-lily.github.io/spiderโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/richardr1126/spider-context-validation.

  14. h

    spider

    • huggingface.co
    Updated Apr 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Simone PAPICCHIO (2025). spider [Dataset]. https://huggingface.co/datasets/simone-papicchio/spider
    Explore at:
    Dataset updated
    Apr 18, 2025
    Authors
    Simone PAPICCHIO
    Description

    simone-papicchio/spider dataset hosted on Hugging Face and contributed by the HF Datasets community

  15. h

    spider

    • huggingface.co
    Updated Apr 2, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    free (2024). spider [Dataset]. https://huggingface.co/datasets/nanina1/spider
    Explore at:
    Dataset updated
    Apr 2, 2024
    Authors
    free
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    nanina1/spider dataset hosted on Hugging Face and contributed by the HF Datasets community

  16. h

    spider-FIT-en-enr-enb

    • huggingface.co
    Updated Jan 16, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Marcelo Archanjo Jose (2024). spider-FIT-en-enr-enb [Dataset]. https://huggingface.co/datasets/Marchanjo/spider-FIT-en-enr-enb
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 16, 2024
    Authors
    Marcelo Archanjo Jose
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Distributed under the Creative Commons-by-sa-4.0 respecting the ShareAlike of the Spider Dataset. Code explanations and links for the model's checkpoints and datasets are on Github mRAT-SQL Here is the Hugging Face collection, you can download the model's checkpoints and datasets, but to understand is better to go to Github mRAT-SQL.

      mRAT-SQL-FIT
    
    
    
    
    
    
    
      A Multilingual Translator to SQL with Database Schema Pruning to Improve Self-Attention
    

    Marcelo Archanjo Jose, Fabioโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/Marchanjo/spider-FIT-en-enr-enb.

  17. h

    new-spider-HM

    • huggingface.co
    Updated Feb 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HUSNA M (2024). new-spider-HM [Dataset]. https://huggingface.co/datasets/HusnaManakkot/new-spider-HM
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 29, 2024
    Authors
    HUSNA M
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset Card for Spider

      Dataset Summary
    

    Spider is a large-scale complex and cross-domain semantic parsing and text-to-SQL dataset annotated by 11 Yale students The goal of the Spider challenge is to develop natural language interfaces to cross-domain databases

      Supported Tasks and Leaderboards
    

    The leaderboard can be seen at https://yale-lily.github.io/spider

      Languages
    

    The text in the dataset is in English.

      Dataset Structure
    
    
    
    
    
      Dataโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/HusnaManakkot/new-spider-HM.
    
  18. h

    ORPRO-Spider-SQL-Filtered

    • huggingface.co
    Updated Jan 9, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maurice (2025). ORPRO-Spider-SQL-Filtered [Dataset]. https://huggingface.co/datasets/mjerome89/ORPRO-Spider-SQL-Filtered
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 9, 2025
    Authors
    Maurice
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    mjerome89/ORPRO-Spider-SQL-Filtered dataset hosted on Hugging Face and contributed by the HF Datasets community

  19. h

    spider-tableQA

    • huggingface.co
    Updated Feb 20, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vaishali Pal (2024). spider-tableQA [Dataset]. https://huggingface.co/datasets/vaishali/spider-tableQA
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 20, 2024
    Authors
    Vaishali Pal
    Description

    Dataset Card for "spider-tableQA"

      Usage
    

    import pandas as pd from datasets import load_dataset

    spider_tableQA = load_dataset("vaishali/spider-tableQA")

    for sample in spider_tableQA['train']: question = sample['question'] sql_query = sample['query'] input_table_names = sample["table_names"] input_tables = [pd.read_json(table, orient='split') for table in sample['tables']] answer = pd.read_json(sample['answer'], orient='split')

    # flattened input/outputโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/vaishali/spider-tableQA.

  20. h

    spider

    • huggingface.co
    Updated Aug 30, 2015
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Karlen Nerkararian (2015). spider [Dataset]. https://huggingface.co/datasets/karlen532/spider
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 30, 2015
    Authors
    Karlen Nerkararian
    Description

    karlen532/spider dataset hosted on Hugging Face and contributed by the HF Datasets community

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
XLang NLP Lab (2021). spider [Dataset]. https://huggingface.co/datasets/xlangai/spider

spider

Spider

xlangai/spider

Explore at:
2 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 9, 2021
Dataset authored and provided by
XLang NLP Lab
License

Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically

Description

Dataset Card for Spider

  Dataset Summary

Spider is a large-scale complex and cross-domain semantic parsing and text-to-SQL dataset annotated by 11 Yale students. The goal of the Spider challenge is to develop natural language interfaces to cross-domain databases.

  Supported Tasks and Leaderboards

The leaderboard can be seen at https://yale-lily.github.io/spider

  Languages

The text in the dataset is in English.

  Dataset Structure





  Dataโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/xlangai/spider.
Search
Clear search
Close search
Google apps
Main menu