9 datasets found

h
browsecomp
huggingface.co
Updated Jul 8, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Peng Jin (2025). browsecomp [Dataset]. https://huggingface.co/datasets/Chat-UniVi/browsecomp
Explore at:
Dataset updated
Jul 8, 2025
Authors
Peng Jin
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Chat-UniVi/browsecomp dataset hosted on Hugging Face and contributed by the HF Datasets community
BrowseCompLongContext
huggingface.co
Updated Aug 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
OpenAI (2025). BrowseCompLongContext [Dataset]. https://huggingface.co/datasets/openai/BrowseCompLongContext
Explore at:
Dataset updated
Aug 9, 2025
Dataset authored and provided by
OpenAIhttp://openai.com/
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
BrowseComp Long Context

BrowseComp Long Context is a dataset based on BrowseComp to benchmark LLM’s capability to retrieve relevant information from noisy data in its context. It converts the agentic question answering tasks from Browsecomp into long context tasks. For each of the questions in a subset of BrowseComp, a list of urls are attached. Each url will be paired with an indicator indicating whether the content of the web page is required to answer the question or is… See the full description on the dataset page: https://huggingface.co/datasets/openai/BrowseCompLongContext.
BrowseComp: A Benchmark for Browsing Agents
kaggle.com
Updated Jun 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
OpenAI (2025). BrowseComp: A Benchmark for Browsing Agents [Dataset]. https://www.kaggle.com/datasets/openai/browsecomp-a-benchmark-for-browsing-agents/data
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 11, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
OpenAI
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Leaderboard Link

Relevant links: * implementation: https://www.kaggle.com/code/aminmohamedmohami/browsecomp-benchmark-starter-code * publication: https://arxiv.org/abs/2504.12516 * original repository: https://github.com/openai/simple-evals/tree/main

Abstract We present BrowseComp, a simple yet challenging benchmark for measuring the ability for agents to browse the web. BrowseComp comprises 1,266 questions that require persistently navigating the internet in search of hard-to-find, entangled information. Despite the difficulty of the questions, BrowseComp is simple and easy-to-use, as predicted answers are short and easily verifiable against reference answers. BrowseComp for browsing agents can be seen as analogous to how programming competitions are an incomplete but useful benchmark for coding agents. While BrowseComp sidesteps challenges of a true user query distribution, like generating long answers or resolving ambiguity, it measures the important core capability of exercising persistence and creativity in finding information.
h
browsecomp-plus-corpus
huggingface.co
Updated Sep 4, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tevatron (2025). browsecomp-plus-corpus [Dataset]. https://huggingface.co/datasets/Tevatron/browsecomp-plus-corpus
Explore at:
Dataset updated
Sep 4, 2025
Dataset authored and provided by
Tevatron
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
BrowseComp-Plus

Project Page | Paper | Code BrowseComp-Plus is a new benchmark for Deep-Research system, isolating the effect of the retriever and the LLM agent to enable fair, transparent comparisons of Deep-Research agents. The benchmark sources challenging, reasoning-intensive queries from OpenAI's BrowseComp. However, instead of searching the live web, BrowseComp-Plus evaluates against a fixed, curated corpus of ~100K web documents from the web. The corpus includes both… See the full description on the dataset page: https://huggingface.co/datasets/Tevatron/browsecomp-plus-corpus.
h
browsecomp-filtered
huggingface.co
Updated Sep 15, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Chaohao Yuan (2025). browsecomp-filtered [Dataset]. https://huggingface.co/datasets/ychaohao/browsecomp-filtered
Explore at:
Dataset updated
Sep 15, 2025
Authors
Chaohao Yuan
Description
ychaohao/browsecomp-filtered dataset hosted on Hugging Face and contributed by the HF Datasets community
h
browse_comp
huggingface.co
Updated Apr 18, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
smolagents (2025). browse_comp [Dataset]. https://huggingface.co/datasets/smolagents/browse_comp
Explore at:
Dataset updated
Apr 18, 2025
Dataset authored and provided by
smolagents
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
smolagents/browse_comp dataset hosted on Hugging Face and contributed by the HF Datasets community
w
browsecomp.com - Historical whois Lookup
whoisdatacenter.com
csv
Updated Aug 20, 2018
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
AllHeart Web Inc (2018). browsecomp.com - Historical whois Lookup [Dataset]. https://whoisdatacenter.com/domain/browsecomp.com/
Explore at:
csvAvailable download formats
Dataset updated
Aug 20, 2018
Dataset authored and provided by
AllHeart Web Inc
License
https://whoisdatacenter.com/terms-of-use/https://whoisdatacenter.com/terms-of-use/
Time period covered
Mar 15, 1985 - Aug 20, 2025
Description
Explore the historical Whois records related to browsecomp.com (Domain). Get insights into ownership history and changes over time.
h
browsecomp-plus-indexes
huggingface.co
Updated Sep 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tevatron (2025). browsecomp-plus-indexes [Dataset]. https://huggingface.co/datasets/Tevatron/browsecomp-plus-indexes
Explore at:
Dataset updated
Sep 4, 2025
Dataset authored and provided by
Tevatron
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
BM25, embedding index used in BrowseComp-Plus. For downloading the index: huggingface-cli download Tevatron/browsecomp-plus-indexes --repo-type=dataset --include="bm25/*" --local-dir ./indexes huggingface-cli download Tevatron/browsecomp-plus-indexes --repo-type=dataset --include="qwen3-embedding-0.6b/*" --local-dir ./indexes huggingface-cli download Tevatron/browsecomp-plus-indexes --repo-type=dataset --include="qwen3-embedding-4b/*" --local-dir ./indexes huggingface-cli download… See the full description on the dataset page: https://huggingface.co/datasets/Tevatron/browsecomp-plus-indexes.
h
MMBrowseComp
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
map, MMBrowseComp [Dataset]. https://huggingface.co/datasets/mmbrowsecomp/MMBrowseComp
Explore at:
Authors
map
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents

Paper: https://arxiv.org/abs/2508.13186v1 Code: https://github.com/MMBrowseComp/MM-BrowseComp The specific evaluation methods can be found in our GitHub repository.
Not seeing a result you expected?
Learn how you can add new datasets to our index.