1 dataset found
  1. h

    LongBench-v2-Pause1

    • huggingface.co
    Updated Dec 20, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    James Begin (2024). LongBench-v2-Pause1 [Dataset]. https://huggingface.co/datasets/JamesBegin/LongBench-v2-Pause1
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 20, 2024
    Authors
    James Begin
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks

    🌐 Project Page: https://longbench2.github.io πŸ’» Github Repo: https://github.com/THUDM/LongBench πŸ“š Arxiv Paper: https://arxiv.org/abs/2412.15204 LongBench v2 is designed to assess the ability of LLMs to handle long-context problems requiring deep understanding and reasoning across real-world multitasks. LongBench v2 has the following features: (1) Length: Context length ranging from 8k to… See the full description on the dataset page: https://huggingface.co/datasets/JamesBegin/LongBench-v2-Pause1.

  2. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
James Begin (2024). LongBench-v2-Pause1 [Dataset]. https://huggingface.co/datasets/JamesBegin/LongBench-v2-Pause1

LongBench-v2-Pause1

JamesBegin/LongBench-v2-Pause1

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 20, 2024
Authors
James Begin
License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks

🌐 Project Page: https://longbench2.github.io πŸ’» Github Repo: https://github.com/THUDM/LongBench πŸ“š Arxiv Paper: https://arxiv.org/abs/2412.15204 LongBench v2 is designed to assess the ability of LLMs to handle long-context problems requiring deep understanding and reasoning across real-world multitasks. LongBench v2 has the following features: (1) Length: Context length ranging from 8k to… See the full description on the dataset page: https://huggingface.co/datasets/JamesBegin/LongBench-v2-Pause1.

Search
Clear search
Close search
Google apps
Main menu