1 dataset found

h
the-vault-function
huggingface.co
Updated Dec 14, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FPT Software AI Center (2023). the-vault-function [Dataset]. https://huggingface.co/datasets/Fsoft-AIC/the-vault-function
Explore at:
Dataset updated
Dec 14, 2023
Dataset authored and provided by
FPT Software AI Center
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
The Vault is a multilingual code-text dataset with over 40 million pairs covering 10 popular programming languages. It is the largest corpus containing parallel code-text data. By building upon The Stack, a massive raw code sample collection, the Vault offers a comprehensive and clean resource for advancing research in code understanding and generation. It provides a high-quality dataset that includes code-text pairs at multiple levels, such as class and inline-level, in addition to the function level. The Vault can serve many purposes at multiple levels.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

FPT Software AI Center (2023). the-vault-function [Dataset]. https://huggingface.co/datasets/Fsoft-AIC/the-vault-function

the-vault-function

The Vault Function

Fsoft-AIC/the-vault-function

Explore at:

35 scholarly articles cite this dataset (View in Google Scholar)

Dataset updated

Dec 14, 2023

Dataset authored and provided by

FPT Software AI Center

License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

The Vault is a multilingual code-text dataset with over 40 million pairs covering 10 popular programming languages. It is the largest corpus containing parallel code-text data. By building upon The Stack, a massive raw code sample collection, the Vault offers a comprehensive and clean resource for advancing research in code understanding and generation. It provides a high-quality dataset that includes code-text pairs at multiple levels, such as class and inline-level, in addition to the function level. The Vault can serve many purposes at multiple levels.

Clear search

Close search

Google apps

Main menu

the-vault-function

the-vault-functionSee More Versions

The Vault Function

Fsoft-AIC/the-vault-function

the-vault-function