71 datasets found
  1. h

    mbpp

    • huggingface.co
    Updated Feb 4, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Niklas Muennighoff (2022). mbpp [Dataset]. https://huggingface.co/datasets/Muennighoff/mbpp
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 4, 2022
    Authors
    Niklas Muennighoff
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The MBPP (Mostly Basic Python Problems) dataset consists of around 1,000 crowd-sourced Python programming problems, designed to be solvable by entry level programmers, covering programming fundamentals, standard library functionality, and so on. Each problem consists of a task description, code solution and 3 automated test cases.

  2. h

    mbpp

    • huggingface.co
    Updated Sep 15, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    nathan lile (2022). mbpp [Dataset]. https://huggingface.co/datasets/nlile/mbpp
    Explore at:
    Dataset updated
    Sep 15, 2022
    Authors
    nathan lile
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset Card for Mostly Basic Python Problems (mbpp)

      Dataset Summary
    

    The benchmark consists of around 1,000 crowd-sourced Python programming problems, designed to be solvable by entry level programmers, covering programming fundamentals, standard library functionality, and so on. Each problem consists of a task description, code solution and 3 automated test cases. As described in the paper, a subset of the data has been hand-verified by us. Released here as part of… See the full description on the dataset page: https://huggingface.co/datasets/nlile/mbpp.

  3. t

    MBPP - Dataset - LDM

    • service.tib.eu
    Updated Jan 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). MBPP - Dataset - LDM [Dataset]. https://service.tib.eu/ldmservice/dataset/mbpp
    Explore at:
    Dataset updated
    Jan 3, 2025
    Description

    The dataset used in the paper for code generation

  4. MBPP Sanitized

    • kaggle.com
    zip
    Updated Nov 23, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mrigank Pawagi (2023). MBPP Sanitized [Dataset]. https://www.kaggle.com/datasets/mrigankpawagi/mbpp-sanitized
    Explore at:
    zip(57766 bytes)Available download formats
    Dataset updated
    Nov 23, 2023
    Authors
    Mrigank Pawagi
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by Mrigank Pawagi

    Released under CC0: Public Domain

    Contents

  5. h

    mbpp

    • huggingface.co
    Updated Jun 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CodeRAG-Bench (2024). mbpp [Dataset]. https://huggingface.co/datasets/code-rag-bench/mbpp
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 2, 2024
    Dataset authored and provided by
    CodeRAG-Bench
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    MBPP dataset annotated with ground-truth programming solutions, to enable evaluations for retrieval and retrieval-augmented code generation. Please refer to code-rag-bench for more details.

  6. h

    MBPP

    • huggingface.co
    Updated Jun 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Princeton-AI (2025). MBPP [Dataset]. https://huggingface.co/datasets/Gen-Verse/MBPP
    Explore at:
    Dataset updated
    Jun 4, 2025
    Dataset authored and provided by
    Princeton-AI
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Gen-Verse/MBPP dataset hosted on Hugging Face and contributed by the HF Datasets community

  7. h

    mbppplus

    • huggingface.co
    Updated Aug 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    EvalPlus (2025). mbppplus [Dataset]. https://huggingface.co/datasets/evalplus/mbppplus
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 21, 2025
    Dataset authored and provided by
    EvalPlus
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    evalplus/mbppplus dataset hosted on Hugging Face and contributed by the HF Datasets community

  8. h

    mbpp-new-dataset

    • huggingface.co
    Updated Sep 10, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    siqi zhu (2024). mbpp-new-dataset [Dataset]. https://huggingface.co/datasets/zsqzz/mbpp-new-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 10, 2024
    Authors
    siqi zhu
    Description

    zsqzz/mbpp-new-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community

  9. h

    mbpp

    • huggingface.co
    Updated Sep 15, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vira (2022). mbpp [Dataset]. https://huggingface.co/datasets/jash404/mbpp
    Explore at:
    Dataset updated
    Sep 15, 2022
    Authors
    Vira
    Description

    jash404/mbpp dataset hosted on Hugging Face and contributed by the HF Datasets community

  10. AOCG

    • figshare.com
    zip
    Updated Dec 7, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jia Li (2023). AOCG [Dataset]. http://doi.org/10.6084/m9.figshare.24763701.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Dec 7, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    figshare
    Authors
    Jia Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The replication package of AOCGThe repository is divided into two parts: datasets and the code of our AOCG method.### Requirements- python 3.8- Java 1.8.0- transformers 4.5.1- tree-sitter 0.2.2- Pytorch 1.7.1### Data PreprocessingExperimental datasets contain the API_SUM dataset, the Hearthstone dataset, and the MBPP dataset. We use tree sitter tool to automatically extract the API terms and sketches of programs.Take the MBPP dataset as an example:To extract API terms, run 'data_process/api_extract.py' and acquire the 'api_terms.jsonl' To extract sketches, run 'data_process/sketch_extract.py' and acquire the 'sketches.jsonl' Put the API terms, sketches, complete codes, and requirements into the 'final_train.jsonl' and 'final_test.jsonl'.### TrainingGiven a specific requirement, the APIer predicts API terms, and the Sketcher outputs corresponding the sketch based on the API terms and requirements. And the Coder fills the sketch to a complete program according to the API terms, sketch and requirement.export CUDA_VISIBLE_DEVICES=0python AOCG_finetune.py \--stage_1 nl_pp \--stage_2 nl_pp_ss \--stage_3 nl_ss_pp_code \--local_rank -1### InferenceThe AOCG predicts code snippets in a progressive generation manner, and write the predicted codes into 'xx.output'.export CUDA_VISIBLE_DEVICES=0python AOCG_inference.py \--stage_1 nl_pp \--stage_2 nl_pp_ss \--stage_3 nl_ss_pp_code \--local_rank -1### EvaluationAfter acquiring the generated codes, evaluate the programs by running 'evaluator/evaluation.py'.

  11. mbpp-pro

    • huggingface.co
    Updated Dec 31, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CodeEval-Pro (2024). mbpp-pro [Dataset]. https://huggingface.co/datasets/CodeEval-Pro/mbpp-pro
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 31, 2024
    Dataset provided by
    CodeEval, Inc.
    Authors
    CodeEval-Pro
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Evaluation dataset for umanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation Task (arxiv.org/abs/2412.21199).

  12. mbpp_last_layer

    • kaggle.com
    zip
    Updated Nov 4, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vũ Trọng Thanh (2024). mbpp_last_layer [Dataset]. https://www.kaggle.com/datasets/vtrngthanh01/mbpp-last-layer
    Explore at:
    zip(1418824902 bytes)Available download formats
    Dataset updated
    Nov 4, 2024
    Authors
    Vũ Trọng Thanh
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Vũ Trọng Thanh

    Released under Apache 2.0

    Contents

  13. Data_Sheet_2_The Effects of Mindfulness-Based Intervention on Shooting...

    • frontiersin.figshare.com
    pdf
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tsung-Yi Wu; Jui-Ti Nien; Garry Kuan; Chih-Han Wu; Yi-Chieh Chang; Hsueh-Chih Chen; Yu-Kai Chang (2023). Data_Sheet_2_The Effects of Mindfulness-Based Intervention on Shooting Performance and Cognitive Functions in Archers.pdf [Dataset]. http://doi.org/10.3389/fpsyg.2021.661961.s002
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    Frontiers Mediahttp://www.frontiersin.org/
    Authors
    Tsung-Yi Wu; Jui-Ti Nien; Garry Kuan; Chih-Han Wu; Yi-Chieh Chang; Hsueh-Chih Chen; Yu-Kai Chang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This study investigated the effects of a mindfulness-based intervention (MBI) called mindfulness-based peak performance (MBPP) on athletic performance and cognitive functions in archers, as well as the role of psychological status and the dose-response relationship of MBPP in archery performance. Twenty-three archers completed a simulated archery competition and the Stroop task prior to and after MBPP training, which consisted of eight sessions over four weeks, while the mindfulness and rumination levels of the archers were assessed at three time points, namely, before, at the mid-point of, and after the MBPP program. The results revealed that the MBPP program significantly improved the shooting performance (p = 0.002, d = 0.27), multiple cognitive functions (ps < 0.001, d = 0.51~0.71), and mindfulness levels of the archers on the post-test, compared to the pre-test (p = 0.032, ηp2 = 0.15 for general; p = 0.004, ηp2 = 0.22 for athletic). Additionally, negative ruminations level was decreased from the pre-test to the middle-test and post-test (ps < 0.001, ηp2 = 0.43). These findings provide preliminary evidence to support the view that MBPP could serve as a promising form of training for fine motor sport performance, cognitive functions, and specific psychological status, such that it warrants further study.

  14. h

    iself-mbpp

    • huggingface.co
    Updated Mar 28, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ALIN LLM (2025). iself-mbpp [Dataset]. https://huggingface.co/datasets/ALIN-LLM/iself-mbpp
    Explore at:
    Dataset updated
    Mar 28, 2025
    Dataset authored and provided by
    ALIN LLM
    Description

    ALIN-LLM/iself-mbpp dataset hosted on Hugging Face and contributed by the HF Datasets community

  15. deepseek6.7_mbpp_full

    • kaggle.com
    zip
    Updated Jul 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Thanh Vu (2025). deepseek6.7_mbpp_full [Dataset]. https://www.kaggle.com/datasets/thanhtlx/deepseek6-7-mbpp-full
    Explore at:
    zip(1157459299 bytes)Available download formats
    Dataset updated
    Jul 30, 2025
    Authors
    Thanh Vu
    Description

    Dataset

    This dataset was created by Thanh Vu

    Contents

  16. h

    lilac-mbpp

    • huggingface.co
    Updated Feb 7, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lilac AI (2024). lilac-mbpp [Dataset]. https://huggingface.co/datasets/lilacai/lilac-mbpp
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 7, 2024
    Dataset authored and provided by
    Lilac AI
    Description

    lilac/mbpp

    This dataset is a Lilac processed dataset. Original dataset: https://huggingface.co/datasets/mbpp To download the dataset to a local directory: lilac download lilacai/lilac-mbpp

    or from python with: ll.download("lilacai/lilac-mbpp")

  17. LFCLF_mbpp_code_gemma_full_layers

    • kaggle.com
    zip
    Updated Jul 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    td (2025). LFCLF_mbpp_code_gemma_full_layers [Dataset]. https://www.kaggle.com/datasets/overvisual/lfclf-mbpp-code-gemma-full-layers/discussion
    Explore at:
    zip(1802785543 bytes)Available download formats
    Dataset updated
    Jul 30, 2025
    Authors
    td
    Description

    Dataset

    This dataset was created by td

    Contents

  18. h

    humaneval-mbpp-codegen-qa

    • huggingface.co
    Updated Apr 2, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Oliver Stanley (2023). humaneval-mbpp-codegen-qa [Dataset]. https://huggingface.co/datasets/OllieStanley/humaneval-mbpp-codegen-qa
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 2, 2023
    Authors
    Oliver Stanley
    Description

    Dataset Card for "humaneval-mbpp-codegen-qa"

    This dataset contains prompt-reply (question-answer) pairs where the prompt is to create a Python function which satisfies the functionality described in a specified docstring. The responses are then the generated functions.

  19. h

    new-Synth-MBPP

    • huggingface.co
    Updated Feb 27, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zack Ankner (2025). new-Synth-MBPP [Dataset]. https://huggingface.co/datasets/ankner/new-Synth-MBPP
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 27, 2025
    Authors
    Zack Ankner
    Description

    ankner/new-Synth-MBPP dataset hosted on Hugging Face and contributed by the HF Datasets community

  20. h

    MBPP-Extended-3104

    • huggingface.co
    Updated May 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Usama (2024). MBPP-Extended-3104 [Dataset]. https://huggingface.co/datasets/MUsama100/MBPP-Extended-3104
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 1, 2024
    Authors
    Muhammad Usama
    Description

    MUsama100/MBPP-Extended-3104 dataset hosted on Hugging Face and contributed by the HF Datasets community

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Niklas Muennighoff (2022). mbpp [Dataset]. https://huggingface.co/datasets/Muennighoff/mbpp

mbpp

Muennighoff/mbpp

Mostly Basic Python Problems

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 4, 2022
Authors
Niklas Muennighoff
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The MBPP (Mostly Basic Python Problems) dataset consists of around 1,000 crowd-sourced Python programming problems, designed to be solvable by entry level programmers, covering programming fundamentals, standard library functionality, and so on. Each problem consists of a task description, code solution and 3 automated test cases.

Search
Clear search
Close search
Google apps
Main menu