Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
ReTool-SFT-multi-turn: Cold-start Multi-turn SFT Data for Tool Use
framework release blog: https://github.com/zhaochenyang20/Awesome-ML-SYS-Tutorial/blob/main/rlhf/verl/multi-turn/verl-multiturn-rollout-Release.md This dataset contains ReTool-like cold-start multi-turn supervised fine-tuning (SFT) data specifically designed for training language models in strategic tool use. It serves as the cold-start training data for the verl-SGLang multi-turn framework, enabling models to learn… See the full description on the dataset page: https://huggingface.co/datasets/swordfaith/ReTool-SFT-multi-turn.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
A GSM8K dataset for multi-step python tool calling in LLM. Formatted for VERL SFT stage.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
ReTool-SFT-multi-turn: Cold-start Multi-turn SFT Data for Tool Use
framework release blog: https://github.com/zhaochenyang20/Awesome-ML-SYS-Tutorial/blob/main/rlhf/verl/multi-turn/verl-multiturn-rollout-Release.md This dataset contains ReTool-like cold-start multi-turn supervised fine-tuning (SFT) data specifically designed for training language models in strategic tool use. It serves as the cold-start training data for the verl-SGLang multi-turn framework, enabling models to learn… See the full description on the dataset page: https://huggingface.co/datasets/swordfaith/ReTool-SFT-multi-turn.