Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks
π Project Page: https://longbench2.github.io π» Github Repo: https://github.com/THUDM/LongBench π Arxiv Paper: https://arxiv.org/abs/2412.15204 LongBench v2 is designed to assess the ability of LLMs to handle long-context problems requiring deep understanding and reasoning across real-world multitasks. LongBench v2 has the following features: (1) Length: Context length ranging from 8k toβ¦ See the full description on the dataset page: https://huggingface.co/datasets/JamesBegin/LongBench-v2-Pause1.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks
π Project Page: https://longbench2.github.io π» Github Repo: https://github.com/THUDM/LongBench π Arxiv Paper: https://arxiv.org/abs/2412.15204 LongBench v2 is designed to assess the ability of LLMs to handle long-context problems requiring deep understanding and reasoning across real-world multitasks. LongBench v2 has the following features: (1) Length: Context length ranging from 8k toβ¦ See the full description on the dataset page: https://huggingface.co/datasets/JamesBegin/LongBench-v2-Pause1.