9 datasets found
  1. h

    browsecomp

    • huggingface.co
    Updated Jul 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Peng Jin (2025). browsecomp [Dataset]. https://huggingface.co/datasets/Chat-UniVi/browsecomp
    Explore at:
    Dataset updated
    Jul 8, 2025
    Authors
    Peng Jin
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Chat-UniVi/browsecomp dataset hosted on Hugging Face and contributed by the HF Datasets community

  2. BrowseCompLongContext

    • huggingface.co
    Updated Aug 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OpenAI (2025). BrowseCompLongContext [Dataset]. https://huggingface.co/datasets/openai/BrowseCompLongContext
    Explore at:
    Dataset updated
    Aug 9, 2025
    Dataset authored and provided by
    OpenAIhttp://openai.com/
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    BrowseComp Long Context

    BrowseComp Long Context is a dataset based on BrowseComp to benchmark LLM’s capability to retrieve relevant information from noisy data in its context. It converts the agentic question answering tasks from Browsecomp into long context tasks. For each of the questions in a subset of BrowseComp, a list of urls are attached. Each url will be paired with an indicator indicating whether the content of the web page is required to answer the question or is… See the full description on the dataset page: https://huggingface.co/datasets/openai/BrowseCompLongContext.

  3. BrowseComp: A Benchmark for Browsing Agents

    • kaggle.com
    Updated Jun 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OpenAI (2025). BrowseComp: A Benchmark for Browsing Agents [Dataset]. https://www.kaggle.com/datasets/openai/browsecomp-a-benchmark-for-browsing-agents/data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 11, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    OpenAI
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Leaderboard Link

    Relevant links: * implementation: https://www.kaggle.com/code/aminmohamedmohami/browsecomp-benchmark-starter-code * publication: https://arxiv.org/abs/2504.12516 * original repository: https://github.com/openai/simple-evals/tree/main

    Abstract We present BrowseComp, a simple yet challenging benchmark for measuring the ability for agents to browse the web. BrowseComp comprises 1,266 questions that require persistently navigating the internet in search of hard-to-find, entangled information. Despite the difficulty of the questions, BrowseComp is simple and easy-to-use, as predicted answers are short and easily verifiable against reference answers. BrowseComp for browsing agents can be seen as analogous to how programming competitions are an incomplete but useful benchmark for coding agents. While BrowseComp sidesteps challenges of a true user query distribution, like generating long answers or resolving ambiguity, it measures the important core capability of exercising persistence and creativity in finding information.

  4. h

    browsecomp-plus-corpus

    • huggingface.co
    Updated Sep 4, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tevatron (2025). browsecomp-plus-corpus [Dataset]. https://huggingface.co/datasets/Tevatron/browsecomp-plus-corpus
    Explore at:
    Dataset updated
    Sep 4, 2025
    Dataset authored and provided by
    Tevatron
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    BrowseComp-Plus

    Project Page | Paper | Code BrowseComp-Plus is a new benchmark for Deep-Research system, isolating the effect of the retriever and the LLM agent to enable fair, transparent comparisons of Deep-Research agents. The benchmark sources challenging, reasoning-intensive queries from OpenAI's BrowseComp. However, instead of searching the live web, BrowseComp-Plus evaluates against a fixed, curated corpus of ~100K web documents from the web. The corpus includes both… See the full description on the dataset page: https://huggingface.co/datasets/Tevatron/browsecomp-plus-corpus.

  5. h

    browsecomp-filtered

    • huggingface.co
    Updated Sep 15, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chaohao Yuan (2025). browsecomp-filtered [Dataset]. https://huggingface.co/datasets/ychaohao/browsecomp-filtered
    Explore at:
    Dataset updated
    Sep 15, 2025
    Authors
    Chaohao Yuan
    Description

    ychaohao/browsecomp-filtered dataset hosted on Hugging Face and contributed by the HF Datasets community

  6. h

    browse_comp

    • huggingface.co
    Updated Apr 18, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    smolagents (2025). browse_comp [Dataset]. https://huggingface.co/datasets/smolagents/browse_comp
    Explore at:
    Dataset updated
    Apr 18, 2025
    Dataset authored and provided by
    smolagents
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    smolagents/browse_comp dataset hosted on Hugging Face and contributed by the HF Datasets community

  7. w

    browsecomp.com - Historical whois Lookup

    • whoisdatacenter.com
    csv
    Updated Aug 20, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AllHeart Web Inc (2018). browsecomp.com - Historical whois Lookup [Dataset]. https://whoisdatacenter.com/domain/browsecomp.com/
    Explore at:
    csvAvailable download formats
    Dataset updated
    Aug 20, 2018
    Dataset authored and provided by
    AllHeart Web Inc
    License

    https://whoisdatacenter.com/terms-of-use/https://whoisdatacenter.com/terms-of-use/

    Time period covered
    Mar 15, 1985 - Aug 20, 2025
    Description

    Explore the historical Whois records related to browsecomp.com (Domain). Get insights into ownership history and changes over time.

  8. h

    browsecomp-plus-indexes

    • huggingface.co
    Updated Sep 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tevatron (2025). browsecomp-plus-indexes [Dataset]. https://huggingface.co/datasets/Tevatron/browsecomp-plus-indexes
    Explore at:
    Dataset updated
    Sep 4, 2025
    Dataset authored and provided by
    Tevatron
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    BM25, embedding index used in BrowseComp-Plus. For downloading the index: huggingface-cli download Tevatron/browsecomp-plus-indexes --repo-type=dataset --include="bm25/*" --local-dir ./indexes huggingface-cli download Tevatron/browsecomp-plus-indexes --repo-type=dataset --include="qwen3-embedding-0.6b/*" --local-dir ./indexes huggingface-cli download Tevatron/browsecomp-plus-indexes --repo-type=dataset --include="qwen3-embedding-4b/*" --local-dir ./indexes huggingface-cli download… See the full description on the dataset page: https://huggingface.co/datasets/Tevatron/browsecomp-plus-indexes.

  9. h

    MMBrowseComp

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    map, MMBrowseComp [Dataset]. https://huggingface.co/datasets/mmbrowsecomp/MMBrowseComp
    Explore at:
    Authors
    map
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents

    Paper: https://arxiv.org/abs/2508.13186v1 Code: https://github.com/MMBrowseComp/MM-BrowseComp The specific evaluation methods can be found in our GitHub repository.

  10. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Peng Jin (2025). browsecomp [Dataset]. https://huggingface.co/datasets/Chat-UniVi/browsecomp

browsecomp

Chat-UniVi/browsecomp

Explore at:
75 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Jul 8, 2025
Authors
Peng Jin
License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

Chat-UniVi/browsecomp dataset hosted on Hugging Face and contributed by the HF Datasets community

Search
Clear search
Close search
Google apps
Main menu