60 datasets found
  1. h

    PHI-4-Hindi-Instruct-Data

    • huggingface.co
    Updated Feb 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ram Kadiyala (2025). PHI-4-Hindi-Instruct-Data [Dataset]. https://huggingface.co/datasets/1024m/PHI-4-Hindi-Instruct-Data
    Explore at:
    Dataset updated
    Feb 6, 2025
    Authors
    Ram Kadiyala
    Description

    1024m/PHI-4-Hindi-Instruct-Data dataset hosted on Hugging Face and contributed by the HF Datasets community

  2. h

    details_microsoft_phi-4_v2

    • huggingface.co
    Updated May 5, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Open Arabic LLM Leaderboard (2025). details_microsoft_phi-4_v2 [Dataset]. https://huggingface.co/datasets/OALL/details_microsoft_phi-4_v2
    Explore at:
    Dataset updated
    May 5, 2025
    Dataset authored and provided by
    Open Arabic LLM Leaderboard
    Description

    Dataset Card for Evaluation run of microsoft/phi-4

    Dataset automatically created during the evaluation run of model microsoft/phi-4. The dataset is composed of 116 configuration, each one coresponding to one of the evaluated task. The dataset has been created from 1 run(s). Each run can be found as a specific split in each configuration, the split being named using the timestamp of the run.The "train" split is always pointing to the latest results. An additional configuration… See the full description on the dataset page: https://huggingface.co/datasets/OALL/details_microsoft_phi-4_v2.

  3. h

    phi4-conversations

    • huggingface.co
    Updated Mar 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    RDSM (2025). phi4-conversations [Dataset]. https://huggingface.co/datasets/rdsm/phi4-conversations
    Explore at:
    Dataset updated
    Mar 30, 2025
    Authors
    RDSM
    Description

    Raw responses generated by Phi4 , questions from alamios/Mistral-Small-24B-Instruct-2501-Conversations Made it to use on the QwenPhi 0.5B Draft model, but the finetune did not yield much improvement, still I have generated the dataset so here is the raw data hopefully it is useful for someone.

  4. h

    details_microsoft_Phi-4-mini-instruct_v2_alrage

    • huggingface.co
    Updated Jun 6, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Open Arabic LLM Leaderboard (2025). details_microsoft_Phi-4-mini-instruct_v2_alrage [Dataset]. https://huggingface.co/datasets/OALL/details_microsoft_Phi-4-mini-instruct_v2_alrage
    Explore at:
    Dataset updated
    Jun 6, 2025
    Dataset authored and provided by
    Open Arabic LLM Leaderboard
    Description

    Dataset Card for Evaluation run of microsoft/Phi-4-mini-instruct

    Dataset automatically created during the evaluation run of model microsoft/Phi-4-mini-instruct. The dataset is composed of 1 configuration, each one coresponding to one of the evaluated task. The dataset has been created from 1 run(s). Each run can be found as a specific split in each configuration, the split being named using the timestamp of the run.The "train" split is always pointing to the latest results. An… See the full description on the dataset page: https://huggingface.co/datasets/OALL/details_microsoft_Phi-4-mini-instruct_v2_alrage.

  5. h

    RAGAS-BASE-phi-4-eval2

    • huggingface.co
    Updated Apr 30, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Robin Burghardt (2025). RAGAS-BASE-phi-4-eval2 [Dataset]. https://huggingface.co/datasets/R0bfried/RAGAS-BASE-phi-4-eval2
    Explore at:
    Dataset updated
    Apr 30, 2025
    Authors
    Robin Burghardt
    Description

    R0bfried/RAGAS-BASE-phi-4-eval2 dataset hosted on Hugging Face and contributed by the HF Datasets community

  6. h

    Phi-4-reasoning-plus_eval_2693

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ML Foundations Development, Phi-4-reasoning-plus_eval_2693 [Dataset]. https://huggingface.co/datasets/mlfoundations-dev/Phi-4-reasoning-plus_eval_2693
    Explore at:
    Dataset authored and provided by
    ML Foundations Development
    Description

    mlfoundations-dev/Phi-4-reasoning-plus_eval_2693 dataset hosted on Hugging Face and contributed by the HF Datasets community

  7. h

    phi4-hallucination-embeddings

    • huggingface.co
    Updated Mar 23, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Malthe Have Musaeus (2025). phi4-hallucination-embeddings [Dataset]. https://huggingface.co/datasets/Malthe/phi4-hallucination-embeddings
    Explore at:
    Dataset updated
    Mar 23, 2025
    Authors
    Malthe Have Musaeus
    Description

    Malthe/phi4-hallucination-embeddings dataset hosted on Hugging Face and contributed by the HF Datasets community

  8. h

    code-reasoning-phi4-template

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    yehya, code-reasoning-phi4-template [Dataset]. https://huggingface.co/datasets/ykarout/code-reasoning-phi4-template
    Explore at:
    Authors
    yehya
    Description

    ykarout/code-reasoning-phi4-template dataset hosted on Hugging Face and contributed by the HF Datasets community

  9. h

    finetome-phi4-format

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    yehya, finetome-phi4-format [Dataset]. https://huggingface.co/datasets/ykarout/finetome-phi4-format
    Explore at:
    Authors
    yehya
    Description

    FineTome-Phi4-Format

    This dataset contains 15000 samples from the mlabonne/FineTome-100k dataset, formatted specifically for Phi-4 models with the system prompt set to "detailed thinking off".

      Format
    

    Each example follows the Phi chat format: <|im_start|>system<|im_sep|>detailed thinking off<|im_end|> <|im_start|>user<|im_sep|>[User content]<|im_end|> <|im_start|>assistant<|im_sep|>[Assistant response]<|im_end|>

      Source
    

    The original data comes from… See the full description on the dataset page: https://huggingface.co/datasets/ykarout/finetome-phi4-format.

  10. h

    crs-2014-2023-housing-labeled-phi4

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alexander Miller, crs-2014-2023-housing-labeled-phi4 [Dataset]. https://huggingface.co/datasets/alex-miller/crs-2014-2023-housing-labeled-phi4
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Authors
    Alexander Miller
    Description

    alex-miller/crs-2014-2023-housing-labeled-phi4 dataset hosted on Hugging Face and contributed by the HF Datasets community

  11. h

    nemotron-phi4-format-data

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    yehya, nemotron-phi4-format-data [Dataset]. https://huggingface.co/datasets/ykarout/nemotron-phi4-format-data
    Explore at:
    Authors
    yehya
    Description

    ykarout/nemotron-phi4-format-data dataset hosted on Hugging Face and contributed by the HF Datasets community

  12. h

    nvidia-nemotron-sampled-phi4-format

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    yehya, nvidia-nemotron-sampled-phi4-format [Dataset]. https://huggingface.co/datasets/ykarout/nvidia-nemotron-sampled-phi4-format
    Explore at:
    Authors
    yehya
    Description

    Nemotron Phi-4 Format Dataset

    This dataset contains examples extracted from NVIDIA/Llama-Nemotron-Post-Training-Dataset, filtered and formatted for Phi-4 fine-tuning.

      Dataset Details
    

    Source: NVIDIA/Llama-Nemotron-Post-Training-Dataset Splits: code, math, science Filter: system_prompt="detailed thinking on", output length between 8000-10000 characters Format: Phi-4 chat template Size: 20000 examples

      Format
    

    Examples follow the Phi-4 chat template:… See the full description on the dataset page: https://huggingface.co/datasets/ykarout/nvidia-nemotron-sampled-phi4-format.

  13. h

    phi4-math-chat-10k

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    yehya, phi4-math-chat-10k [Dataset]. https://huggingface.co/datasets/ykarout/phi4-math-chat-10k
    Explore at:
    Authors
    yehya
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Phi-4 Math Chat Dataset (10k)

    This dataset contains 10,000 carefully selected math reasoning samples formatted using the Phi-4 chat template. Each entry consists of:

    A user message: the math question An assistant message: a final concise answer, extracted from deeper reasoning chains

      Format
    

    Each example follows the Phi-4 chat format: <|im_start|>user<|im_sep|>question<|im_end|> <|im_start|>assistant<|im_sep|>answer<|im_end|>

      Usage
    

    This dataset is suitable… See the full description on the dataset page: https://huggingface.co/datasets/ykarout/phi4-math-chat-10k.

  14. h

    BankFAQs-Phi4

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ghulam Shabbir Khan, BankFAQs-Phi4 [Dataset]. https://huggingface.co/datasets/GhulamShabbirKhan/BankFAQs-Phi4
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Authors
    Ghulam Shabbir Khan
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    GhulamShabbirKhan/BankFAQs-Phi4 dataset hosted on Hugging Face and contributed by the HF Datasets community

  15. h

    numinamath-178k-phi4-bon-verified-dpo-trl-40k-old-r1-format

    • huggingface.co
    Updated Feb 5, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Guangxuan Xu (2025). numinamath-178k-phi4-bon-verified-dpo-trl-40k-old-r1-format [Dataset]. https://huggingface.co/datasets/gx-ai-architect/numinamath-178k-phi4-bon-verified-dpo-trl-40k-old-r1-format
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 5, 2025
    Authors
    Guangxuan Xu
    Description

    gx-ai-architect/numinamath-178k-phi4-bon-verified-dpo-trl-40k-old-r1-format dataset hosted on Hugging Face and contributed by the HF Datasets community

  16. h

    RAGAS-INSTRUCT-phi-4-eval2

    • huggingface.co
    Updated Apr 30, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Robin Burghardt (2025). RAGAS-INSTRUCT-phi-4-eval2 [Dataset]. https://huggingface.co/datasets/R0bfried/RAGAS-INSTRUCT-phi-4-eval2
    Explore at:
    Dataset updated
    Apr 30, 2025
    Authors
    Robin Burghardt
    Description

    R0bfried/RAGAS-INSTRUCT-phi-4-eval2 dataset hosted on Hugging Face and contributed by the HF Datasets community

  17. h

    FINGU-AI_Phi-4-RRStock-details

    • huggingface.co
    Updated Mar 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Open LLM Leaderboard (2025). FINGU-AI_Phi-4-RRStock-details [Dataset]. https://huggingface.co/datasets/open-llm-leaderboard/FINGU-AI_Phi-4-RRStock-details
    Explore at:
    Dataset updated
    Mar 10, 2025
    Dataset authored and provided by
    Open LLM Leaderboard
    Description

    Dataset Card for Evaluation run of FINGU-AI/Phi-4-RRStock

    Dataset automatically created during the evaluation run of model FINGU-AI/Phi-4-RRStock The dataset is composed of 38 configuration(s), each one corresponding to one of the evaluated task. The dataset has been created from 1 run(s). Each run can be found as a specific split in each configuration, the split being named using the timestamp of the run.The "train" split is always pointing to the latest results. An additional… See the full description on the dataset page: https://huggingface.co/datasets/open-llm-leaderboard/FINGU-AI_Phi-4-RRStock-details.

  18. h

    Emilia-All-Ja-Phi-4-mini-instruct

    • huggingface.co
    Updated Apr 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kadir Nar (2025). Emilia-All-Ja-Phi-4-mini-instruct [Dataset]. https://huggingface.co/datasets/kadirnar/Emilia-All-Ja-Phi-4-mini-instruct
    Explore at:
    Dataset updated
    Apr 18, 2025
    Authors
    Kadir Nar
    Description

    kadirnar/Emilia-All-Ja-Phi-4-mini-instruct dataset hosted on Hugging Face and contributed by the HF Datasets community

  19. h

    Phi-4-reasoning-plus_eval_5554

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ML Foundations Development, Phi-4-reasoning-plus_eval_5554 [Dataset]. https://huggingface.co/datasets/mlfoundations-dev/Phi-4-reasoning-plus_eval_5554
    Explore at:
    Dataset authored and provided by
    ML Foundations Development
    Description

    mlfoundations-dev/Phi-4-reasoning-plus_eval_5554

    Precomputed model outputs for evaluation.

      Evaluation Results
    
    
    
    
    
      Summary
    

    Metric AIME24 AMC23 MATH500 MMLUPro JEEBench GPQADiamond LiveCodeBench CodeElo CodeForces HLE HMMT AIME25 LiveCodeBenchv5

    Accuracy 76.0 96.2 84.0 14.6 83.5 66.8 0.8 2.4 3.5 7.1 53.0 68.0 0.5

      AIME24
    

    Average Accuracy: 76.00% ± 1.23% Number of Runs: 10

    Run Accuracy Questions Solved Total Questions

    1… See the full description on the dataset page: https://huggingface.co/datasets/mlfoundations-dev/Phi-4-reasoning-plus_eval_5554.

  20. h

    bunnycore_Phi-4-Model-Stock-v4-details

    • huggingface.co
    Updated Mar 10, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Open LLM Leaderboard (2025). bunnycore_Phi-4-Model-Stock-v4-details [Dataset]. https://huggingface.co/datasets/open-llm-leaderboard/bunnycore_Phi-4-Model-Stock-v4-details
    Explore at:
    Dataset updated
    Mar 10, 2025
    Dataset authored and provided by
    Open LLM Leaderboard
    Description

    Dataset Card for Evaluation run of bunnycore/Phi-4-Model-Stock-v4

    Dataset automatically created during the evaluation run of model bunnycore/Phi-4-Model-Stock-v4 The dataset is composed of 38 configuration(s), each one corresponding to one of the evaluated task. The dataset has been created from 1 run(s). Each run can be found as a specific split in each configuration, the split being named using the timestamp of the run.The "train" split is always pointing to the latest results.… See the full description on the dataset page: https://huggingface.co/datasets/open-llm-leaderboard/bunnycore_Phi-4-Model-Stock-v4-details.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Ram Kadiyala (2025). PHI-4-Hindi-Instruct-Data [Dataset]. https://huggingface.co/datasets/1024m/PHI-4-Hindi-Instruct-Data

PHI-4-Hindi-Instruct-Data

1024m/PHI-4-Hindi-Instruct-Data

Explore at:
Dataset updated
Feb 6, 2025
Authors
Ram Kadiyala
Description

1024m/PHI-4-Hindi-Instruct-Data dataset hosted on Hugging Face and contributed by the HF Datasets community

Search
Clear search
Close search
Google apps
Main menu