24 datasets found
  1. h

    codenet

    • huggingface.co
    Updated Apr 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    IBM Research - University of Illinois Urbana Champaign Discovery Accelerator Institute (2024). codenet [Dataset]. https://huggingface.co/datasets/iidai/codenet
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 22, 2024
    Dataset authored and provided by
    IBM Research - University of Illinois Urbana Champaign Discovery Accelerator Institute
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    iidai/codenet dataset hosted on Hugging Face and contributed by the HF Datasets community

  2. h

    codenet

    • huggingface.co
    Updated Feb 14, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    System K Dev. (2024). codenet [Dataset]. https://huggingface.co/datasets/systemk/codenet
    Explore at:
    Dataset updated
    Feb 14, 2024
    Dataset authored and provided by
    System K Dev.
    License

    https://choosealicense.com/licenses/cdla-permissive-2.0/https://choosealicense.com/licenses/cdla-permissive-2.0/

    Description

    Dataset Card for Dataset Name

    This dataset card aims to be a base template for new datasets. It has been generated using this raw template.

      Dataset Details
    
    
    
    
    
      Dataset Description
    

    Curated by: [More Information Needed] Funded by [optional]: [More Information Needed] Shared by [optional]: [More Information Needed] Language(s) (NLP): [More Information Needed] License: [More Information Needed]

      Dataset Sources [optional]
    

    Repository: [More… See the full description on the dataset page: https://huggingface.co/datasets/systemk/codenet.

  3. h

    CodeNet-16K

    • huggingface.co
    Updated Nov 21, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sumuk's Archived Content (2024). CodeNet-16K [Dataset]. https://huggingface.co/datasets/sumukshashidhar-archive/CodeNet-16K
    Explore at:
    Dataset updated
    Nov 21, 2024
    Dataset authored and provided by
    Sumuk's Archived Content
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    πŸ“Š Dataset Card for πŸ† CodeNet-16K

      Dataset Summary
    

    The πŸ† CodeNet-16K dataset consists of 16,500 Python attempts from the CodeNet dataset, which have been carefully filtered and deduplicated to create a high-quality dataset for code generation tasks. The dataset includes problem descriptions, input/output descriptions, and sample test cases for each problem.

      Dataset Details
    
    
    
    
    
      Dataset Sources
    

    Repository:… See the full description on the dataset page: https://huggingface.co/datasets/sumukshashidhar-archive/CodeNet-16K.

  4. Codenetpy

    • kaggle.com
    zip
    Updated May 18, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alex Jercan (2023). Codenetpy [Dataset]. https://www.kaggle.com/datasets/alexjercan/codenetpy
    Explore at:
    zip(35078290 bytes)Available download formats
    Dataset updated
    May 18, 2023
    Authors
    Alex Jercan
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    Source code related tasks for machine learning have become important with the large need of software production. In this dataset our main goal is to create a dataset for bug detection and repair.

    Content

    The dataset is based on the CodeNet project and contains python code submissions for online coding competitions. The data is obtained by selecting consecutive attempts of a single user that resulted in fixing a buggy submission. Thus the data is represented by code pairs and annotated by the diff and error of each changed instruction. We have already tokenized all the source code files and kept the same format as in the original dataset.

    Acknowledgements

    CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of Coding Tasks

    Inspiration

    Our goal is to create a bug detection and repair pipeline for online coding competition problems.

    • What are the most common mistakes (input, output, solving the problem)?
    • Is there any correlation between using libraries and mistakes in function calls?
    • What type of instruction is labeled as buggy the most (function call, for loop, if statement, binary operations)?
  5. problem descriptions

    • kaggle.com
    zip
    Updated Mar 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    tiffie_1 (2024). problem descriptions [Dataset]. https://www.kaggle.com/tiffie1/problem-descriptions
    Explore at:
    zip(5109571 bytes)Available download formats
    Dataset updated
    Mar 23, 2024
    Authors
    tiffie_1
    Description

    Dataset

    This dataset was created by tiffie_1

    Contents

  6. Z

    Data from: Lost in Translation: A Study of Bugs Introduced by Large Language...

    • nde-dev.biothings.io
    • data.niaid.nih.gov
    Updated Jan 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ibrahimzada, Ali Reza (2024). Lost in Translation: A Study of Bugs Introduced by Large Language Models while Translating Code [Dataset]. https://nde-dev.biothings.io/resources?id=zenodo_8190051
    Explore at:
    Dataset updated
    Jan 25, 2024
    Dataset authored and provided by
    Ibrahimzada, Ali Reza
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Artifact repository for the paper Lost in Translation: A Study of Bugs Introduced by Large Language Models while Translating Code, accepted at ICSE 2024, Lisbon, Portugal. Authors are Rangeet Pan* Ali Reza Ibrahimzada*, Rahul Krishna, Divya Sankar, Lambert Pougeum Wassi, Michele Merler, Boris Sobolev, Raju Pavuluri, Saurabh Sinha, and Reyhaneh Jabbarvand.

    Install

    This repository contains the source code for reproducing the results in our paper. Please start by cloning this repository:

    git clone https://github.com/Intelligent-CAT-Lab/PLTranslationEmpirical

    We recommend using a virtual environment for running the scripts. Please download conda 23.11.0 from this link. You can create a virtual environment using the following command:

    conda create -n plempirical python=3.10.13

    After creating the virtual environment, you can activate it using the following command:

    conda activate plempirical

    You can run the following command to make sure that you are using the correct version of Python:

    python3 --version && pip3 --version

    Dependencies

    To install all software dependencies, please execute the following command:

    pip3 install -r requirements.txt

    As for hardware dependencies, we used 16 NVIDIA A100 GPUs with 80GBs of memory for inferencing models. The models can be inferenced on any combination of GPUs as long as the reader can properly distribute the model weights across the GPUs. We did not perform weight distribution since we had enough memory (80 GB) per GPU.

    Moreover, for compiling and testing the generated translations, we used Python 3.10, g++ 11, GCC Clang 14.0, Java 11, Go 1.20, Rust 1.73, and .Net 7.0.14 for Python, C++, C, Java, Go, Rust, and C#, respectively. Overall, we recommend using a machine with Linux OS and at least 32GB of RAM for running the scripts.

    For running scripts of alternative approaches, you need to make sure you have installed C2Rust, CxGO, and Java2C# on your machine. Please refer to their repositories for installation instructions. For Java2C#, you need to create a .csproj file like below:

    Exe
    net7.0
    enable
    enable
    

    Dataset

    We uploaded the dataset we used in our empirical study to Zenodo. The dataset is organized as follows:

    CodeNet

    AVATAR

    Evalplus

    Apache Commons-CLI

    Click

    Please download and unzip the dataset.zip file from Zenodo. After unzipping, you should see the following directory structure:

    PLTranslationEmpirical β”œβ”€β”€ dataset β”œβ”€β”€ codenet β”œβ”€β”€ avatar β”œβ”€β”€ evalplus β”œβ”€β”€ real-life-cli β”œβ”€β”€ ...

    The structure of each dataset is as follows:

    1. CodeNet & Avatar: Each directory in these datasets correspond to a source language where each include two directories Code and TestCases for code snippets and test cases, respectively. Each code snippet has an id in the filename, where the id is used as a prefix for test I/O files.

    2. Evalplus: The source language code snippets follow a similar structure as CodeNet and Avatar. However, as a one time effort, we manually created the test cases in the target Java language inside a maven project, evalplus_java. To evaluate the translations from an LLM, we recommend moving the generated Java code snippets to the src/main/java directory of the maven project and then running the command mvn clean test surefire-report:report -Dmaven.test.failure.ignore=true to compile, test, and generate reports for the translations.

    3. Real-life Projects: The real-life-cli directory represents two real-life CLI projects from Java and Python. These datasets only contain code snippets as files and no test cases. As mentioned in the paper, the authors manually evaluated the translations for these datasets.

    Scripts

    We provide bash scripts for reproducing our results in this work. First, we discuss the translation script. For doing translation with a model and dataset, first you need to create a .env file in the repository and add the following:

    OPENAI_API_KEY= LLAMA2_AUTH_TOKEN= STARCODER_AUTH_TOKEN=

    1. Translation with GPT-4: You can run the following command to translate all Python -> Java code snippets in codenet dataset with the GPT-4 while top-k sampling is k=50, top-p sampling is p=0.95, and temperature=0.7:

    bash scripts/translate.sh GPT-4 codenet Python Java 50 0.95 0.7 0

    1. Translation with CodeGeeX: Prior to running the script, you need to clone the CodeGeeX repository from here and use the instructions from their artifacts to download their model weights. After cloning it inside PLTranslationEmpirical and downloading the model weights, your directory structure should be like the following:

    PLTranslationEmpirical β”œβ”€β”€ dataset β”œβ”€β”€ codenet β”œβ”€β”€ avatar β”œβ”€β”€ evalplus β”œβ”€β”€ real-life-cli β”œβ”€β”€ CodeGeeX β”œβ”€β”€ codegeex β”œβ”€β”€ codegeex_13b.pt # this file is the model weight β”œβ”€β”€ ... β”œβ”€β”€ ...

    You can run the following command to translate all Python -> Java code snippets in codenet dataset with the CodeGeeX while top-k sampling is k=50, top-p sampling is p=0.95, and temperature=0.2 on GPU gpu_id=0:

    bash scripts/translate.sh CodeGeeX codenet Python Java 50 0.95 0.2 0

    1. For all other models (StarCoder, CodeGen, LLaMa, TB-Airoboros, TB-Vicuna), you can execute the following command to translate all Python -> Java code snippets in codenet dataset with the StarCoder|CodeGen|LLaMa|TB-Airoboros|TB-Vicuna while top-k sampling is k=50, top-p sampling is p=0.95, and temperature=0.2 on GPU gpu_id=0:

    bash scripts/translate.sh StarCoder codenet Python Java 50 0.95 0.2 0

    1. For translating and testing pairs with traditional techniques (i.e., C2Rust, CxGO, Java2C#), you can run the following commands:

    bash scripts/translate_transpiler.sh codenet C Rust c2rust fix_report bash scripts/translate_transpiler.sh codenet C Go cxgo fix_reports bash scripts/translate_transpiler.sh codenet Java C# java2c# fix_reports bash scripts/translate_transpiler.sh avatar Java C# java2c# fix_reports

    1. For compile and testing of CodeNet, AVATAR, and Evalplus (Python to Java) translations from GPT-4, and generating fix reports, you can run the following commands:

    bash scripts/test_avatar.sh Python Java GPT-4 fix_reports 1 bash scripts/test_codenet.sh Python Java GPT-4 fix_reports 1 bash scripts/test_evalplus.sh Python Java GPT-4 fix_reports 1

    1. For repairing unsuccessful translations of Java -> Python in CodeNet dataset with GPT-4, you can run the following commands:

    bash scripts/repair.sh GPT-4 codenet Python Java 50 0.95 0.7 0 1 compile bash scripts/repair.sh GPT-4 codenet Python Java 50 0.95 0.7 0 1 runtime bash scripts/repair.sh GPT-4 codenet Python Java 50 0.95 0.7 0 1 incorrect

    1. For cleaning translations of open-source LLMs (i.e., StarCoder) in codenet, you can run the following command:

    bash scripts/clean_generations.sh StarCoder codenet

    Please note that for the above commands, you can change the dataset and model name to execute the same thing for other datasets and models. Moreover, you can refer to /prompts for different vanilla and repair prompts used in our study.

    Artifacts

    Please download the artifacts.zip file from our Zenodo repository. We have organized the artifacts as follows:

    RQ1 - Translations: This directory contains the translations from all LLMs and for all datasets. We have added an excel file to show a detailed breakdown of the translation results.

    RQ2 - Manual Labeling: This directory contains an excel file which includes the manual labeling results for all translation bugs.

    RQ3 - Alternative Approaches: This directory contains the translations from all alternative approaches (i.e., C2Rust, CxGO, Java2C#). We have added an excel file to show a detailed breakdown of the translation results.

    RQ4 - Mitigating Translation Bugs: This directory contains the fix results of GPT-4, StarCoder, CodeGen, and Llama 2. We have added an excel file to show a detailed breakdown of the fix results.

    Contact

    We look forward to hearing your feedback. Please contact Rangeet Pan or Ali Reza Ibrahimzada for any questions or comments πŸ™.

  7. h

    CodeNet

    • huggingface.co
    Updated Aug 2, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Khiem Le (2017). CodeNet [Dataset]. https://huggingface.co/datasets/lhkhiem28/CodeNet
    Explore at:
    Dataset updated
    Aug 2, 2017
    Authors
    Khiem Le
    Description

    lhkhiem28/CodeNet dataset hosted on Hugging Face and contributed by the HF Datasets community

  8. h

    codenet-compile-errors

    • huggingface.co
    Updated Sep 18, 2014
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yang Gao (2014). codenet-compile-errors [Dataset]. https://huggingface.co/datasets/criyle/codenet-compile-errors
    Explore at:
    Dataset updated
    Sep 18, 2014
    Authors
    Yang Gao
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    CodeNet Compiler Errors (Re-compiled 2026)

      Dataset Summary
    

    This dataset contains source code submissions from Project CodeNet that fail to compile. Unlike the original dataset metadata (which reflects compiler versions from 2011–2020), this dataset was re-executed in a modern Debian environment (2026) to generate up-to-date compiler error messages. It is designed for research in:

    Automated Program Repair (APR): Fixing compile-time errors. Compiler Error Explanation:… See the full description on the dataset page: https://huggingface.co/datasets/criyle/codenet-compile-errors.

  9. i

    CodeNet BizTech - An Edtech Bootstrapped Company Based Out Of Noida

    • inc42.com
    Updated Dec 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Inc42 Datalabs (2025). CodeNet BizTech - An Edtech Bootstrapped Company Based Out Of Noida [Dataset]. https://inc42.com/company/codenet-biztech/
    Explore at:
    Dataset updated
    Dec 27, 2025
    Dataset provided by
    Inc42
    Authors
    Inc42 Datalabs
    Area covered
    Noida
    Description

    Founded in 2019, CodeNet BizTech operates in the Edtech sector offering advanced digital marketing training and a range of development services including web, software, and app development. The company also provides digital marketing agency services, Android development, graphics design, content writing, and logo design. Its diverse offerings cater to the growing demand for digital skills and development solutions in the current market. CodeNet BizTech aims to equip individuals and businesses with essential tools and knowledge to succeed in the digital landscape.

  10. h

    diverse-codenet

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Didula Samaraweera, diverse-codenet [Dataset]. https://huggingface.co/datasets/didula-wso2/diverse-codenet
    Explore at:
    Authors
    Didula Samaraweera
    Description

    didula-wso2/diverse-codenet dataset hosted on Hugging Face and contributed by the HF Datasets community

  11. i

    AS8227 - CODENET-AS

    • ipv4.dev
    Updated Mar 2, 2026
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    IPv4.DEV (2026). AS8227 - CODENET-AS [Dataset]. https://ipv4.dev/asn/8227
    Explore at:
    Dataset updated
    Mar 2, 2026
    Dataset provided by
    IPv4.DEV
    Description

    Network information for Autonomous System Number 8227

  12. h

    CodeNet-24K

    • huggingface.co
    Updated Mar 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sumuk's Archived Content (2024). CodeNet-24K [Dataset]. https://huggingface.co/datasets/sumukshashidhar-archive/CodeNet-24K
    Explore at:
    Dataset updated
    Mar 29, 2024
    Dataset authored and provided by
    Sumuk's Archived Content
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    sumukshashidhar-archive/CodeNet-24K dataset hosted on Hugging Face and contributed by the HF Datasets community

  13. r

    Bilanci e Fatturati CODENET SRL

    • reportaziende.it
    Updated Mar 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Media Asset (2025). Bilanci e Fatturati CODENET SRL [Dataset]. https://www.reportaziende.it/codenet_srl_ve_02285120305
    Explore at:
    Dataset updated
    Mar 28, 2025
    Dataset authored and provided by
    Media Asset
    License

    https://www.reportaziende.it/termini_e_condizioni_d_uso_del_serviziohttps://www.reportaziende.it/termini_e_condizioni_d_uso_del_servizio

    Variables measured
    Fatturato 2010, Fatturato 2011
    Measurement technique
    AI-Enhanced Financial Analysis
    Description

    Serie storica del fatturato e indicatori finanziari analizzati tramite intelligenza artificiale.

  14. h

    CodeNet-B

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CoQuIR, CodeNet-B [Dataset]. https://huggingface.co/datasets/CoQuIR/CodeNet-B
    Explore at:
    Authors
    CoQuIR
    Description

    CoQuIR/CodeNet-B dataset hosted on Hugging Face and contributed by the HF Datasets community

  15. r

    Bilanci e Fatturati CODENET SOCIETA' COOPERATIVA

    • reportaziende.it
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Media Asset, Bilanci e Fatturati CODENET SOCIETA' COOPERATIVA [Dataset]. https://www.reportaziende.it/codenet_societa_cooperativa_rm_13443981009
    Explore at:
    Dataset authored and provided by
    Media Asset
    License

    https://www.reportaziende.it/termini_e_condizioni_d_uso_del_serviziohttps://www.reportaziende.it/termini_e_condizioni_d_uso_del_servizio

    Variables measured
    Fatturato 2018, Fatturato 2019, Fatturato 2020, Fatturato 2021, Fatturato 2022, Fatturato 2023
    Measurement technique
    AI-Enhanced Financial Analysis
    Description

    Serie storica del fatturato e indicatori finanziari analizzati tramite intelligenza artificiale.

  16. h

    codenet-python-AST-NIT

    • huggingface.co
    Updated Dec 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ShijiaDong (2025). codenet-python-AST-NIT [Dataset]. https://huggingface.co/datasets/ShijiaD/codenet-python-AST-NIT
    Explore at:
    Dataset updated
    Dec 9, 2025
    Authors
    ShijiaDong
    Description

    ShijiaD/codenet-python-AST-NIT dataset hosted on Hugging Face and contributed by the HF Datasets community

  17. h

    Project_CodeNet_Python800_and_Java250

    • huggingface.co
    Updated May 7, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    guang yang (2023). Project_CodeNet_Python800_and_Java250 [Dataset]. https://huggingface.co/datasets/qiankunmu/Project_CodeNet_Python800_and_Java250
    Explore at:
    Dataset updated
    May 7, 2023
    Authors
    guang yang
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Introduction

    This is the dataset Project_CodeNet_Python800 and Project_CodeNet_Java250 from Project CodeNet (arxiv). We are not the authors of Project CodeNet, but we are the authors of Heterogeneous Directed Hypergraph Neural Network (HDHGN) in paper Heterogeneous Directed Hypergraph Neural Network over abstract syntax tree (AST) for Code Classification (official, arxiv). Our HDHGN model utilizes the Python800 and Java250 datasets. The original official dataset links Python800 and… See the full description on the dataset page: https://huggingface.co/datasets/qiankunmu/Project_CodeNet_Python800_and_Java250.

  18. h

    codenet_python

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ranran Haoran Zhang, codenet_python [Dataset]. https://huggingface.co/datasets/windchimeran/codenet_python
    Explore at:
    Authors
    Ranran Haoran Zhang
    Description

    This is dataset is extracted from CodeNet, python only. I merged the data into one single table, including metadata, problem description, test input output.

    small: accepted status only big: all status, including accepted

  19. h

    ds4sd-synth-code-net-small

    • huggingface.co
    Updated Aug 4, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    deep copy (2025). ds4sd-synth-code-net-small [Dataset]. https://huggingface.co/datasets/deepcopy/ds4sd-synth-code-net-small
    Explore at:
    Dataset updated
    Aug 4, 2025
    Authors
    deep copy
    Description

    deepcopy/ds4sd-synth-code-net-small dataset hosted on Hugging Face and contributed by the HF Datasets community

  20. h

    code_contests

    • huggingface.co
    Updated Feb 8, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Imandra Inc (2022). code_contests [Dataset]. https://huggingface.co/datasets/Imandra/code_contests
    Explore at:
    Dataset updated
    Feb 8, 2022
    Dataset authored and provided by
    Imandra Inc
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset Card for CodeContests

      Dataset Summary
    

    CodeContests is a competitive programming dataset for machine-learning. This dataset was used when training AlphaCode. It consists of programming problems, from a variety of sources:

    Site URL Source

    Aizu https://judge.u-aizu.ac.jp CodeNet

    AtCoder https://atcoder.jp CodeNet

    CodeChef https://www.codechef.com description2code

    Codeforces https://codeforces.com description2code and Codeforces

    HackerEarth… See the full description on the dataset page: https://huggingface.co/datasets/Imandra/code_contests.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
IBM Research - University of Illinois Urbana Champaign Discovery Accelerator Institute (2024). codenet [Dataset]. https://huggingface.co/datasets/iidai/codenet

codenet

iidai/codenet

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 22, 2024
Dataset authored and provided by
IBM Research - University of Illinois Urbana Champaign Discovery Accelerator Institute
License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

iidai/codenet dataset hosted on Hugging Face and contributed by the HF Datasets community

Search
Clear search
Close search
Google apps
Main menu