2 datasets found
  1. h

    ReTool-SFT-multi-turn

    • huggingface.co
    Updated May 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xiang Long (2025). ReTool-SFT-multi-turn [Dataset]. https://huggingface.co/datasets/swordfaith/ReTool-SFT-multi-turn
    Explore at:
    Dataset updated
    May 11, 2025
    Authors
    Xiang Long
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    ReTool-SFT-multi-turn: Cold-start Multi-turn SFT Data for Tool Use

    framework release blog: https://github.com/zhaochenyang20/Awesome-ML-SYS-Tutorial/blob/main/rlhf/verl/multi-turn/verl-multiturn-rollout-Release.md This dataset contains ReTool-like cold-start multi-turn supervised fine-tuning (SFT) data specifically designed for training language models in strategic tool use. It serves as the cold-start training data for the verl-SGLang multi-turn framework, enabling models to learn… See the full description on the dataset page: https://huggingface.co/datasets/swordfaith/ReTool-SFT-multi-turn.

  2. h

    gsm8k-python

    • huggingface.co
    Updated Sep 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    siyuanzhu (2025). gsm8k-python [Dataset]. https://huggingface.co/datasets/siyuan-zhu/gsm8k-python
    Explore at:
    Dataset updated
    Sep 18, 2025
    Authors
    siyuanzhu
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    A GSM8K dataset for multi-step python tool calling in LLM. Formatted for VERL SFT stage.

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Xiang Long (2025). ReTool-SFT-multi-turn [Dataset]. https://huggingface.co/datasets/swordfaith/ReTool-SFT-multi-turn

ReTool-SFT-multi-turn

ReTool-SFT-multi-turn

swordfaith/ReTool-SFT-multi-turn

Explore at:
Dataset updated
May 11, 2025
Authors
Xiang Long
License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

ReTool-SFT-multi-turn: Cold-start Multi-turn SFT Data for Tool Use

framework release blog: https://github.com/zhaochenyang20/Awesome-ML-SYS-Tutorial/blob/main/rlhf/verl/multi-turn/verl-multiturn-rollout-Release.md This dataset contains ReTool-like cold-start multi-turn supervised fine-tuning (SFT) data specifically designed for training language models in strategic tool use. It serves as the cold-start training data for the verl-SGLang multi-turn framework, enabling models to learn… See the full description on the dataset page: https://huggingface.co/datasets/swordfaith/ReTool-SFT-multi-turn.

Search
Clear search
Close search
Google apps
Main menu