71 datasets found

h
mbpp
huggingface.co
Updated Feb 4, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Niklas Muennighoff (2022). mbpp [Dataset]. https://huggingface.co/datasets/Muennighoff/mbpp
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 4, 2022
Authors
Niklas Muennighoff
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The MBPP (Mostly Basic Python Problems) dataset consists of around 1,000 crowd-sourced Python programming problems, designed to be solvable by entry level programmers, covering programming fundamentals, standard library functionality, and so on. Each problem consists of a task description, code solution and 3 automated test cases.
h
mbpp
huggingface.co
Updated Sep 15, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
nathan lile (2022). mbpp [Dataset]. https://huggingface.co/datasets/nlile/mbpp
Explore at:
Dataset updated
Sep 15, 2022
Authors
nathan lile
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Dataset Card for Mostly Basic Python Problems (mbpp)

Dataset Summary

The benchmark consists of around 1,000 crowd-sourced Python programming problems, designed to be solvable by entry level programmers, covering programming fundamentals, standard library functionality, and so on. Each problem consists of a task description, code solution and 3 automated test cases. As described in the paper, a subset of the data has been hand-verified by us. Released here as part of… See the full description on the dataset page: https://huggingface.co/datasets/nlile/mbpp.
t
MBPP - Dataset - LDM
service.tib.eu
Updated Jan 3, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). MBPP - Dataset - LDM [Dataset]. https://service.tib.eu/ldmservice/dataset/mbpp
Explore at:
Dataset updated
Jan 3, 2025
Description
The dataset used in the paper for code generation
MBPP Sanitized
kaggle.com
zip
Updated Nov 23, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mrigank Pawagi (2023). MBPP Sanitized [Dataset]. https://www.kaggle.com/datasets/mrigankpawagi/mbpp-sanitized
Explore at:
zip(57766 bytes)Available download formats
Dataset updated
Nov 23, 2023
Authors
Mrigank Pawagi
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Dataset

This dataset was created by Mrigank Pawagi

Released under CC0: Public Domain

Contents
h
mbpp
huggingface.co
Updated Jun 2, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CodeRAG-Bench (2024). mbpp [Dataset]. https://huggingface.co/datasets/code-rag-bench/mbpp
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 2, 2024
Dataset authored and provided by
CodeRAG-Bench
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
MBPP dataset annotated with ground-truth programming solutions, to enable evaluations for retrieval and retrieval-augmented code generation. Please refer to code-rag-bench for more details.
h
MBPP
huggingface.co
Updated Jun 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Princeton-AI (2025). MBPP [Dataset]. https://huggingface.co/datasets/Gen-Verse/MBPP
Explore at:
Dataset updated
Jun 4, 2025
Dataset authored and provided by
Princeton-AI
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Gen-Verse/MBPP dataset hosted on Hugging Face and contributed by the HF Datasets community
h
mbppplus
huggingface.co
Updated Aug 21, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
EvalPlus (2025). mbppplus [Dataset]. https://huggingface.co/datasets/evalplus/mbppplus
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 21, 2025
Dataset authored and provided by
EvalPlus
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
evalplus/mbppplus dataset hosted on Hugging Face and contributed by the HF Datasets community
h
mbpp-new-dataset
huggingface.co
Updated Sep 10, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
siqi zhu (2024). mbpp-new-dataset [Dataset]. https://huggingface.co/datasets/zsqzz/mbpp-new-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 10, 2024
Authors
siqi zhu
Description
zsqzz/mbpp-new-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community
h
mbpp
huggingface.co
Updated Sep 15, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vira (2022). mbpp [Dataset]. https://huggingface.co/datasets/jash404/mbpp
Explore at:
Dataset updated
Sep 15, 2022
Authors
Vira
Description
jash404/mbpp dataset hosted on Hugging Face and contributed by the HF Datasets community
AOCG
figshare.com
zip
Updated Dec 7, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jia Li (2023). AOCG [Dataset]. http://doi.org/10.6084/m9.figshare.24763701.v1
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.24763701.v1
Dataset updated
Dec 7, 2023
Dataset provided by
Figsharehttp://figshare.com/
figshare
Authors
Jia Li
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The replication package of AOCGThe repository is divided into two parts: datasets and the code of our AOCG method.### Requirements- python 3.8- Java 1.8.0- transformers 4.5.1- tree-sitter 0.2.2- Pytorch 1.7.1### Data PreprocessingExperimental datasets contain the API_SUM dataset, the Hearthstone dataset, and the MBPP dataset. We use tree sitter tool to automatically extract the API terms and sketches of programs.Take the MBPP dataset as an example:To extract API terms, run 'data_process/api_extract.py' and acquire the 'api_terms.jsonl' To extract sketches, run 'data_process/sketch_extract.py' and acquire the 'sketches.jsonl' Put the API terms, sketches, complete codes, and requirements into the 'final_train.jsonl' and 'final_test.jsonl'.### TrainingGiven a specific requirement, the APIer predicts API terms, and the Sketcher outputs corresponding the sketch based on the API terms and requirements. And the Coder fills the sketch to a complete program according to the API terms, sketch and requirement.export CUDA_VISIBLE_DEVICES=0python AOCG_finetune.py \--stage_1 nl_pp \--stage_2 nl_pp_ss \--stage_3 nl_ss_pp_code \--local_rank -1### InferenceThe AOCG predicts code snippets in a progressive generation manner, and write the predicted codes into 'xx.output'.export CUDA_VISIBLE_DEVICES=0python AOCG_inference.py \--stage_1 nl_pp \--stage_2 nl_pp_ss \--stage_3 nl_ss_pp_code \--local_rank -1### EvaluationAfter acquiring the generated codes, evaluate the programs by running 'evaluator/evaluation.py'.
mbpp-pro
huggingface.co
Updated Dec 31, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CodeEval-Pro (2024). mbpp-pro [Dataset]. https://huggingface.co/datasets/CodeEval-Pro/mbpp-pro
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 31, 2024
Dataset provided by
CodeEval, Inc.
Authors
CodeEval-Pro
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Evaluation dataset for umanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation Task (arxiv.org/abs/2412.21199).
mbpp_last_layer
kaggle.com
zip
Updated Nov 4, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vũ Trọng Thanh (2024). mbpp_last_layer [Dataset]. https://www.kaggle.com/datasets/vtrngthanh01/mbpp-last-layer
Explore at:
zip(1418824902 bytes)Available download formats
Dataset updated
Nov 4, 2024
Authors
Vũ Trọng Thanh
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by Vũ Trọng Thanh

Released under Apache 2.0

Contents
Data_Sheet_2_The Effects of Mindfulness-Based Intervention on Shooting...
frontiersin.figshare.com
pdf
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tsung-Yi Wu; Jui-Ti Nien; Garry Kuan; Chih-Han Wu; Yi-Chieh Chang; Hsueh-Chih Chen; Yu-Kai Chang (2023). Data_Sheet_2_The Effects of Mindfulness-Based Intervention on Shooting Performance and Cognitive Functions in Archers.pdf [Dataset]. http://doi.org/10.3389/fpsyg.2021.661961.s002
Explore at:
pdfAvailable download formats
Unique identifier
https://doi.org/10.3389/fpsyg.2021.661961.s002
Dataset updated
Jun 1, 2023
Dataset provided by
Frontiers Mediahttp://www.frontiersin.org/
Authors
Tsung-Yi Wu; Jui-Ti Nien; Garry Kuan; Chih-Han Wu; Yi-Chieh Chang; Hsueh-Chih Chen; Yu-Kai Chang
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This study investigated the effects of a mindfulness-based intervention (MBI) called mindfulness-based peak performance (MBPP) on athletic performance and cognitive functions in archers, as well as the role of psychological status and the dose-response relationship of MBPP in archery performance. Twenty-three archers completed a simulated archery competition and the Stroop task prior to and after MBPP training, which consisted of eight sessions over four weeks, while the mindfulness and rumination levels of the archers were assessed at three time points, namely, before, at the mid-point of, and after the MBPP program. The results revealed that the MBPP program significantly improved the shooting performance (p = 0.002, d = 0.27), multiple cognitive functions (ps < 0.001, d = 0.51~0.71), and mindfulness levels of the archers on the post-test, compared to the pre-test (p = 0.032, ηp2 = 0.15 for general; p = 0.004, ηp2 = 0.22 for athletic). Additionally, negative ruminations level was decreased from the pre-test to the middle-test and post-test (ps < 0.001, ηp2 = 0.43). These findings provide preliminary evidence to support the view that MBPP could serve as a promising form of training for fine motor sport performance, cognitive functions, and specific psychological status, such that it warrants further study.
h
iself-mbpp
huggingface.co
Updated Mar 28, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ALIN LLM (2025). iself-mbpp [Dataset]. https://huggingface.co/datasets/ALIN-LLM/iself-mbpp
Explore at:
Dataset updated
Mar 28, 2025
Dataset authored and provided by
ALIN LLM
Description
ALIN-LLM/iself-mbpp dataset hosted on Hugging Face and contributed by the HF Datasets community
deepseek6.7_mbpp_full
kaggle.com
zip
Updated Jul 30, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Thanh Vu (2025). deepseek6.7_mbpp_full [Dataset]. https://www.kaggle.com/datasets/thanhtlx/deepseek6-7-mbpp-full
Explore at:
zip(1157459299 bytes)Available download formats
Dataset updated
Jul 30, 2025
Authors
Thanh Vu
Description
Dataset

This dataset was created by Thanh Vu

Contents
h
lilac-mbpp
huggingface.co
Updated Feb 7, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Lilac AI (2024). lilac-mbpp [Dataset]. https://huggingface.co/datasets/lilacai/lilac-mbpp
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 7, 2024
Dataset authored and provided by
Lilac AI
Description
lilac/mbpp

This dataset is a Lilac processed dataset. Original dataset: https://huggingface.co/datasets/mbpp To download the dataset to a local directory: lilac download lilacai/lilac-mbpp

or from python with: ll.download("lilacai/lilac-mbpp")
LFCLF_mbpp_code_gemma_full_layers
kaggle.com
zip
Updated Jul 30, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
td (2025). LFCLF_mbpp_code_gemma_full_layers [Dataset]. https://www.kaggle.com/datasets/overvisual/lfclf-mbpp-code-gemma-full-layers/discussion
Explore at:
zip(1802785543 bytes)Available download formats
Dataset updated
Jul 30, 2025
Authors
td
Description
Dataset

This dataset was created by td

Contents
h
humaneval-mbpp-codegen-qa
huggingface.co
Updated Apr 2, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Oliver Stanley (2023). humaneval-mbpp-codegen-qa [Dataset]. https://huggingface.co/datasets/OllieStanley/humaneval-mbpp-codegen-qa
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 2, 2023
Authors
Oliver Stanley
Description
Dataset Card for "humaneval-mbpp-codegen-qa"

This dataset contains prompt-reply (question-answer) pairs where the prompt is to create a Python function which satisfies the functionality described in a specified docstring. The responses are then the generated functions.
h
new-Synth-MBPP
huggingface.co
Updated Feb 27, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Zack Ankner (2025). new-Synth-MBPP [Dataset]. https://huggingface.co/datasets/ankner/new-Synth-MBPP
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 27, 2025
Authors
Zack Ankner
Description
ankner/new-Synth-MBPP dataset hosted on Hugging Face and contributed by the HF Datasets community
h
MBPP-Extended-3104
huggingface.co
Updated May 1, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Muhammad Usama (2024). MBPP-Extended-3104 [Dataset]. https://huggingface.co/datasets/MUsama100/MBPP-Extended-3104
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 1, 2024
Authors
Muhammad Usama
Description
MUsama100/MBPP-Extended-3104 dataset hosted on Hugging Face and contributed by the HF Datasets community

Facebook

Twitter

Click to copy link

Link copied

Cite

Niklas Muennighoff (2022). mbpp [Dataset]. https://huggingface.co/datasets/Muennighoff/mbpp

mbpp

Muennighoff/mbpp

Mostly Basic Python Problems

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Feb 4, 2022

Authors

Niklas Muennighoff

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The MBPP (Mostly Basic Python Problems) dataset consists of around 1,000 crowd-sourced Python programming problems, designed to be solvable by entry level programmers, covering programming fundamentals, standard library functionality, and so on. Each problem consists of a task description, code solution and 3 automated test cases.

Clear search

Close search

Google apps

Main menu

mbpp

mbpp

MBPP - Dataset - LDM

MBPP Sanitized

Dataset

Contents

mbpp

MBPP

mbppplus

mbpp-new-dataset

mbpp

AOCG

mbpp-pro

mbpp_last_layer

Dataset

Contents

Data_Sheet_2_The Effects of Mindfulness-Based Intervention on Shooting...

iself-mbpp

deepseek6.7_mbpp_full

Dataset

Contents

lilac-mbpp

LFCLF_mbpp_code_gemma_full_layers

Dataset

Contents

humaneval-mbpp-codegen-qa

new-Synth-MBPP

MBPP-Extended-3104

mbppSee More Versions

Muennighoff/mbpp

Mostly Basic Python Problems

mbpp