43 datasets found
  1. h

    Mind2Web

    • huggingface.co
    Updated Jun 12, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OSU NLP Group (2023). Mind2Web [Dataset]. https://huggingface.co/datasets/osunlp/Mind2Web
    Explore at:
    Dataset updated
    Jun 12, 2023
    Dataset authored and provided by
    OSU NLP Group
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset Card for Dataset Name

      Dataset Summary
    

    Mind2Web is a dataset for developing and evaluating generalist agents for the web that can follow language instructions to complete complex tasks on any website. Existing datasets for web agents either use simulated websites or only cover a limited set of websites and tasks, thus not suitable for generalist web agents. With over 2,000 open-ended tasks collected from 137 websites spanning 31 domains and crowdsourced action… See the full description on the dataset page: https://huggingface.co/datasets/osunlp/Mind2Web.

  2. Mind2Web: Generalist Agents for Web Tasks

    • kaggle.com
    zip
    Updated Dec 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2023). Mind2Web: Generalist Agents for Web Tasks [Dataset]. https://www.kaggle.com/datasets/thedevastator/mind2web-generalist-agents-for-web-tasks
    Explore at:
    zip(468820991 bytes)Available download formats
    Dataset updated
    Dec 1, 2023
    Authors
    The Devastator
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Mind2Web: Generalist Agents for Web Tasks

    Language-guided Generalist Agents for Web Tasks

    By osunlp (From Huggingface) [source]

    About this dataset

    The Mind2Web dataset is a valuable resource for the development and evaluation of generalist agents that can effectively perform web tasks by comprehending and executing language instructions. This dataset supports the creation of agents capable of completing complex tasks on any website while adhering to accessibility guidelines.

    The dataset comprises various columns that provide essential information for training these generalist agents. The action_reprs column contains textual representations of the actions that can be executed by the agents on websites. These representations serve as guidance for understanding and implementing specific tasks.

    To ensure task accuracy and completion, the confirmed_task column indicates whether a given task assigned to a generalist agent has been confirmed or not. This binary value assists in evaluating performance and validating adherence to instructions.

    In addition, the subdomain column specifies the subdomain under which each website resides. This information helps contextualize the tasks performed within distinct web environments, enhancing versatility and adaptability.

    With these explicit features and data points present in each row of train.csv, developers can train their models more effectively using guided language instructions specific to web tasks. By leveraging this dataset, researchers can advance techniques aimed at improving web accessibility through intelligent generalist agents capable of utilizing natural language understanding to navigate an array of websites efficiently

    How to use the dataset

    The Mind2Web dataset is a valuable resource for researchers and developers working on creating generalist agents capable of performing complex web tasks based on language instructions. This guide will provide you with step-by-step instructions on how to effectively use this dataset.

    • Understanding the Columns:

      • action_reprs: This column contains representations of the actions that the generalist agents can perform on a website. It provides insights into what specific actions are available for execution.
      • confirmed_task: This boolean column indicates whether the task assigned to the generalist agent has been confirmed or not. It helps in identifying which tasks have been successfully completed by the agent.
      • subdomain: The subdomain column specifies where each task is performed on a website. It helps to categorize and group tasks based on their respective subdomains.
    • Familiarize Yourself with the Dataset Structure:

      • Take some time to explore and understand how data is organized within this dataset.
      • Identify potential patterns or relationships between different columns, such as how action_reprs corresponds with confirmed_task and subdomain.
      • Look for any missing values or inconsistencies in data, which might require preprocessing before using it in your research or development projects.
    • Extraction and Cleaning of Data:

      • Based on your specific research goals, identify relevant subsets of data from this dataset that align with your objectives. For example, if you are interested in studying tasks related to e-commerce websites, focus on those entries within a particular subdomain(s).
      • Perform any necessary data cleaning steps, such as removing duplicates, handling missing values, or correcting erroneous entries. Ensuring high-quality data will lead to more reliable results during analysis.
    • Task Analysis and Model Development: i) Task Understanding: Understand each task's requirements by analyzing its corresponding language instructions (confirmed_task column) and identify the relevant actions that need to be performed on the website (action_reprs column). ii) Model Development: Utilize machine learning or natural language processing techniques to develop models capable of interpreting and executing language instructions. Train these models using the Mind2Web dataset by providing both the instructions and corresponding actions.

    • Evaluating Model Performance:

      • Use a separate validation or test set (not included in the dataset) to evaluate your model's performance. This step is crucial for determining how well your developed model can complete new, unseen tasks accurately.
      • Measure key performance metrics like accuracy,

    Research Ideas

    • Training and evaluating generalist agents: The dataset can be used to train and evaluate generalist agents, which are capab...
  3. h

    Online-Mind2Web

    • huggingface.co
    Updated Apr 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OSU NLP Group (2025). Online-Mind2Web [Dataset]. https://huggingface.co/datasets/osunlp/Online-Mind2Web
    Explore at:
    Dataset updated
    Apr 9, 2025
    Dataset authored and provided by
    OSU NLP Group
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Blog | Paper | Code | Leaderboard

      Online-Mind2Web
    

    Online-Mind2Web is the online version of Mind2Web, a more diverse and user-centric dataset includes 300 high-quality tasks from 136 popular websites across various domains. The dataset covers a diverse set of user tasks, such as clothing, food, housing, and transportation, to evaluate web agents' performance in a real-world online environment.

      News
    

    [11/03/2025] We’ve updated 36 tasks that are no longer… See the full description on the dataset page: https://huggingface.co/datasets/osunlp/Online-Mind2Web.

  4. h

    Mind2Web-2

    • huggingface.co
    Updated Jun 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OSU NLP Group (2025). Mind2Web-2 [Dataset]. https://huggingface.co/datasets/osunlp/Mind2Web-2
    Explore at:
    Dataset updated
    Jun 27, 2025
    Dataset authored and provided by
    OSU NLP Group
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Mind2Web 2

    Mind2Web 2 is an evaluation framework for agentic search capabilities, featuring Agent-as-a-Judge methodology for comprehensive assessment of web automation agents.

    Mind2Web 2 features realistic and diverse long-horizon web search tasks and a novel Agent-as-a-Judge framework to evaluate complex, time-varying, and citation-backed answers.

      🔗 Links
    

    🏠 Homepage 🏆 Leaderboard 📖 Paper 💻 Code

      🔄 Changelog
    

    Oct 23, 2025: Updated several tasks… See the full description on the dataset page: https://huggingface.co/datasets/osunlp/Mind2Web-2.

  5. h

    minibench-mm-mind2web

    • huggingface.co
    Updated Sep 29, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    WPRM (2025). minibench-mm-mind2web [Dataset]. https://huggingface.co/datasets/WPRM/minibench-mm-mind2web
    Explore at:
    Dataset updated
    Sep 29, 2025
    Dataset authored and provided by
    WPRM
    Description

    WPRM/minibench-mm-mind2web dataset hosted on Hugging Face and contributed by the HF Datasets community

  6. h

    Online-Mind2Web-Test

    • huggingface.co
    Updated Nov 5, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Genteki Zhang (2025). Online-Mind2Web-Test [Dataset]. https://huggingface.co/datasets/Genteki/Online-Mind2Web-Test
    Explore at:
    Dataset updated
    Nov 5, 2025
    Authors
    Genteki Zhang
    Description

    Genteki/Online-Mind2Web-Test dataset hosted on Hugging Face and contributed by the HF Datasets community

  7. h

    Online-Mind2Web

    • huggingface.co
    Updated Oct 9, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    hud (2025). Online-Mind2Web [Dataset]. https://huggingface.co/datasets/hud-evals/Online-Mind2Web
    Explore at:
    Dataset updated
    Oct 9, 2025
    Dataset authored and provided by
    hud
    Description

    hud-evals/Online-Mind2Web dataset hosted on Hugging Face and contributed by the HF Datasets community

  8. h

    izaz-mind2web-dataset

    • huggingface.co
    Updated Mar 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AHMAD (2024). izaz-mind2web-dataset [Dataset]. https://huggingface.co/datasets/Izazk/izaz-mind2web-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 13, 2024
    Authors
    AHMAD
    Description

    Dataset Card for Dataset Name

    This dataset card aims to be a base template for new datasets. It has been generated using this raw template.

      Dataset Details
    
    
    
    
    
      Dataset Description
    

    Curated by: [More Information Needed] Funded by [optional]: [More Information Needed] Shared by [optional]: [More Information Needed] Language(s) (NLP): [More Information Needed] License: [More Information Needed]

      Dataset Sources [optional]
    

    Repository: [More… See the full description on the dataset page: https://huggingface.co/datasets/Izazk/izaz-mind2web-dataset.

  9. h

    web-agent-mind2web

    • huggingface.co
    Updated May 31, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Isaiah (2025). web-agent-mind2web [Dataset]. https://huggingface.co/datasets/isaiahbjork/web-agent-mind2web
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 31, 2025
    Authors
    Isaiah
    Description

    isaiahbjork/web-agent-mind2web dataset hosted on Hugging Face and contributed by the HF Datasets community

  10. h

    Multimodal-Mind2Web-HTML-WM-messages-test

    • huggingface.co
    Updated Nov 20, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Language & AGI Lab (2024). Multimodal-Mind2Web-HTML-WM-messages-test [Dataset]. https://huggingface.co/datasets/LangAGI-Lab/Multimodal-Mind2Web-HTML-WM-messages-test
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 20, 2024
    Dataset authored and provided by
    Language & AGI Lab
    Description

    LangAGI-Lab/Multimodal-Mind2Web-HTML-WM-messages-test dataset hosted on Hugging Face and contributed by the HF Datasets community

  11. h

    mind2web-subset-human

    • huggingface.co
    Updated Nov 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Joan (2025). mind2web-subset-human [Dataset]. https://huggingface.co/datasets/josancamon/mind2web-subset-human
    Explore at:
    Dataset updated
    Nov 26, 2025
    Authors
    Joan
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Mind2Web Subset - Human Demonstrations

    A collection of human-demonstrated web navigation tasks with detailed interaction traces. This dataset captures real browser interactions including clicks, typing, scrolling, DOM states, screenshots, and HTTP requests for web agent training and evaluation.

      Overview
    

    This dataset contains tasks performed by humans in real web environments, capturing:

    Golden trajectories: Step-by-step sequences of actions (clicks, typing, navigation)… See the full description on the dataset page: https://huggingface.co/datasets/josancamon/mind2web-subset-human.

  12. h

    minibench-multimodal-mind2web

    • huggingface.co
    Updated Apr 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hyungjoo Chae (2025). minibench-multimodal-mind2web [Dataset]. https://huggingface.co/datasets/hyungjoochae/minibench-multimodal-mind2web
    Explore at:
    Dataset updated
    Apr 10, 2025
    Authors
    Hyungjoo Chae
    Description

    hyungjoochae/minibench-multimodal-mind2web dataset hosted on Hugging Face and contributed by the HF Datasets community

  13. h

    Online-Mind2Web-Tiny-Test

    • huggingface.co
    Updated Nov 5, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Genteki Zhang (2025). Online-Mind2Web-Tiny-Test [Dataset]. https://huggingface.co/datasets/Genteki/Online-Mind2Web-Tiny-Test
    Explore at:
    Dataset updated
    Nov 5, 2025
    Authors
    Genteki Zhang
    Description

    Genteki/Online-Mind2Web-Tiny-Test dataset hosted on Hugging Face and contributed by the HF Datasets community

  14. h

    Multimodal-Mind2Web-HTML-WM-messages-filter-35000

    • huggingface.co
    Updated Nov 20, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Language & AGI Lab (2024). Multimodal-Mind2Web-HTML-WM-messages-filter-35000 [Dataset]. https://huggingface.co/datasets/LangAGI-Lab/Multimodal-Mind2Web-HTML-WM-messages-filter-35000
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 20, 2024
    Dataset authored and provided by
    Language & AGI Lab
    Description

    LangAGI-Lab/Multimodal-Mind2Web-HTML-WM-messages-filter-35000 dataset hosted on Hugging Face and contributed by the HF Datasets community

  15. h

    Mind2Web-HTML-cleaned-lite-with-desc_w_tao_value_rationale

    • huggingface.co
    Updated Sep 28, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Language & AGI Lab (2024). Mind2Web-HTML-cleaned-lite-with-desc_w_tao_value_rationale [Dataset]. https://huggingface.co/datasets/LangAGI-Lab/Mind2Web-HTML-cleaned-lite-with-desc_w_tao_value_rationale
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 28, 2024
    Dataset authored and provided by
    Language & AGI Lab
    Description

    LangAGI-Lab/Mind2Web-HTML-cleaned-lite-with-desc_w_tao_value_rationale dataset hosted on Hugging Face and contributed by the HF Datasets community

  16. h

    Sequence-of-action-prediction-mind2web

    • huggingface.co
    Updated Jun 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AHMAD (2024). Sequence-of-action-prediction-mind2web [Dataset]. https://huggingface.co/datasets/Izazk/Sequence-of-action-prediction-mind2web
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 15, 2024
    Authors
    AHMAD
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Izazk/Sequence-of-action-prediction-mind2web dataset hosted on Hugging Face and contributed by the HF Datasets community

  17. h

    Mind2Web-cleaned-lite-value-model-w-cot-formatted-test

    • huggingface.co
    Updated Sep 19, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Language & AGI Lab (2024). Mind2Web-cleaned-lite-value-model-w-cot-formatted-test [Dataset]. https://huggingface.co/datasets/LangAGI-Lab/Mind2Web-cleaned-lite-value-model-w-cot-formatted-test
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 19, 2024
    Dataset authored and provided by
    Language & AGI Lab
    Description

    Dataset Card for "Mind2Web-cleaned-lite-value-model-w-cot-formatted-test"

    More Information needed

  18. h

    mind2web-mcq-dataset

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Advait Gupta, mind2web-mcq-dataset [Dataset]. https://huggingface.co/datasets/advaitgupta/mind2web-mcq-dataset
    Explore at:
    Authors
    Advait Gupta
    Description

    advaitgupta/mind2web-mcq-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community

  19. h

    Mind2Web_train_llava

    • huggingface.co
    Updated Oct 19, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NeuLab @ LTI/CMU (2024). Mind2Web_train_llava [Dataset]. https://huggingface.co/datasets/neulab/Mind2Web_train_llava
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 19, 2024
    Dataset authored and provided by
    NeuLab @ LTI/CMU
    License

    https://choosealicense.com/licenses/odc-by/https://choosealicense.com/licenses/odc-by/

    Description

    Mind2Web training set for the paper: Harnessing Webpage Uis For Text Rich Visual Understanding

    🌐 Homepage | 🐍 GitHub | 📖 arXiv

      Introduction
    

    We introduce MultiUI, a dataset containing 7.3 million samples from 1 million websites, covering diverse multi- modal tasks and UI layouts. Models trained on MultiUI not only excel in web UI tasks—achieving up to a 48% improvement on VisualWebBench and a 19.1% boost in action accuracy on a web agent dataset Mind2Web—but also… See the full description on the dataset page: https://huggingface.co/datasets/neulab/Mind2Web_train_llava.

  20. h

    Mind2Web-cleaned-lite-value-model

    • huggingface.co
    Updated Apr 16, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Language & AGI Lab (2023). Mind2Web-cleaned-lite-value-model [Dataset]. https://huggingface.co/datasets/LangAGI-Lab/Mind2Web-cleaned-lite-value-model
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 16, 2023
    Dataset authored and provided by
    Language & AGI Lab
    Description

    Dataset Card for "Mind2Web-cleaned-lite-value-model"

    More Information needed

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
OSU NLP Group (2023). Mind2Web [Dataset]. https://huggingface.co/datasets/osunlp/Mind2Web

Mind2Web

osunlp/Mind2Web

Explore at:
Dataset updated
Jun 12, 2023
Dataset authored and provided by
OSU NLP Group
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Dataset Card for Dataset Name

  Dataset Summary

Mind2Web is a dataset for developing and evaluating generalist agents for the web that can follow language instructions to complete complex tasks on any website. Existing datasets for web agents either use simulated websites or only cover a limited set of websites and tasks, thus not suitable for generalist web agents. With over 2,000 open-ended tasks collected from 137 websites spanning 31 domains and crowdsourced action… See the full description on the dataset page: https://huggingface.co/datasets/osunlp/Mind2Web.

Search
Clear search
Close search
Google apps
Main menu