43 datasets found

h
Mind2Web
huggingface.co
Updated Jun 12, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
OSU NLP Group (2023). Mind2Web [Dataset]. https://huggingface.co/datasets/osunlp/Mind2Web
Explore at:
Dataset updated
Jun 12, 2023
Dataset authored and provided by
OSU NLP Group
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Dataset Card for Dataset Name

Dataset Summary

Mind2Web is a dataset for developing and evaluating generalist agents for the web that can follow language instructions to complete complex tasks on any website. Existing datasets for web agents either use simulated websites or only cover a limited set of websites and tasks, thus not suitable for generalist web agents. With over 2,000 open-ended tasks collected from 137 websites spanning 31 domains and crowdsourced action… See the full description on the dataset page: https://huggingface.co/datasets/osunlp/Mind2Web.
Mind2Web: Generalist Agents for Web Tasks
kaggle.com
zip
Updated Dec 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The Devastator (2023). Mind2Web: Generalist Agents for Web Tasks [Dataset]. https://www.kaggle.com/datasets/thedevastator/mind2web-generalist-agents-for-web-tasks
Explore at:
zip(468820991 bytes)Available download formats
Dataset updated
Dec 1, 2023
Authors
The Devastator
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Mind2Web: Generalist Agents for Web Tasks

Language-guided Generalist Agents for Web Tasks

By osunlp (From Huggingface) [source]

About this dataset

The Mind2Web dataset is a valuable resource for the development and evaluation of generalist agents that can effectively perform web tasks by comprehending and executing language instructions. This dataset supports the creation of agents capable of completing complex tasks on any website while adhering to accessibility guidelines.

The dataset comprises various columns that provide essential information for training these generalist agents. The action_reprs column contains textual representations of the actions that can be executed by the agents on websites. These representations serve as guidance for understanding and implementing specific tasks.

To ensure task accuracy and completion, the confirmed_task column indicates whether a given task assigned to a generalist agent has been confirmed or not. This binary value assists in evaluating performance and validating adherence to instructions.

In addition, the subdomain column specifies the subdomain under which each website resides. This information helps contextualize the tasks performed within distinct web environments, enhancing versatility and adaptability.

With these explicit features and data points present in each row of train.csv, developers can train their models more effectively using guided language instructions specific to web tasks. By leveraging this dataset, researchers can advance techniques aimed at improving web accessibility through intelligent generalist agents capable of utilizing natural language understanding to navigate an array of websites efficiently

How to use the dataset

The Mind2Web dataset is a valuable resource for researchers and developers working on creating generalist agents capable of performing complex web tasks based on language instructions. This guide will provide you with step-by-step instructions on how to effectively use this dataset.

Understanding the Columns:

action_reprs: This column contains representations of the actions that the generalist agents can perform on a website. It provides insights into what specific actions are available for execution.

confirmed_task: This boolean column indicates whether the task assigned to the generalist agent has been confirmed or not. It helps in identifying which tasks have been successfully completed by the agent.

subdomain: The subdomain column specifies where each task is performed on a website. It helps to categorize and group tasks based on their respective subdomains.

Familiarize Yourself with the Dataset Structure:

Take some time to explore and understand how data is organized within this dataset.

Identify potential patterns or relationships between different columns, such as how action_reprs corresponds with confirmed_task and subdomain.

Look for any missing values or inconsistencies in data, which might require preprocessing before using it in your research or development projects.

Extraction and Cleaning of Data:

Based on your specific research goals, identify relevant subsets of data from this dataset that align with your objectives. For example, if you are interested in studying tasks related to e-commerce websites, focus on those entries within a particular subdomain(s).

Perform any necessary data cleaning steps, such as removing duplicates, handling missing values, or correcting erroneous entries. Ensuring high-quality data will lead to more reliable results during analysis.

Task Analysis and Model Development: i) Task Understanding: Understand each task's requirements by analyzing its corresponding language instructions (confirmed_task column) and identify the relevant actions that need to be performed on the website (action_reprs column). ii) Model Development: Utilize machine learning or natural language processing techniques to develop models capable of interpreting and executing language instructions. Train these models using the Mind2Web dataset by providing both the instructions and corresponding actions.

Evaluating Model Performance:

Use a separate validation or test set (not included in the dataset) to evaluate your model's performance. This step is crucial for determining how well your developed model can complete new, unseen tasks accurately.

Measure key performance metrics like accuracy,

Research Ideas

Training and evaluating generalist agents: The dataset can be used to train and evaluate generalist agents, which are capab...
h
Online-Mind2Web
huggingface.co
Updated Apr 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
OSU NLP Group (2025). Online-Mind2Web [Dataset]. https://huggingface.co/datasets/osunlp/Online-Mind2Web
Explore at:
Dataset updated
Apr 9, 2025
Dataset authored and provided by
OSU NLP Group
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Blog | Paper | Code | Leaderboard

Online-Mind2Web

Online-Mind2Web is the online version of Mind2Web, a more diverse and user-centric dataset includes 300 high-quality tasks from 136 popular websites across various domains. The dataset covers a diverse set of user tasks, such as clothing, food, housing, and transportation, to evaluate web agents' performance in a real-world online environment.

News

[11/03/2025] We’ve updated 36 tasks that are no longer… See the full description on the dataset page: https://huggingface.co/datasets/osunlp/Online-Mind2Web.
h
Mind2Web-2
huggingface.co
Updated Jun 27, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
OSU NLP Group (2025). Mind2Web-2 [Dataset]. https://huggingface.co/datasets/osunlp/Mind2Web-2
Explore at:
Dataset updated
Jun 27, 2025
Dataset authored and provided by
OSU NLP Group
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Mind2Web 2

Mind2Web 2 is an evaluation framework for agentic search capabilities, featuring Agent-as-a-Judge methodology for comprehensive assessment of web automation agents.

Mind2Web 2 features realistic and diverse long-horizon web search tasks and a novel Agent-as-a-Judge framework to evaluate complex, time-varying, and citation-backed answers.

🔗 Links

🏠 Homepage 🏆 Leaderboard 📖 Paper 💻 Code

🔄 Changelog

Oct 23, 2025: Updated several tasks… See the full description on the dataset page: https://huggingface.co/datasets/osunlp/Mind2Web-2.
h
minibench-mm-mind2web
huggingface.co
Updated Sep 29, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
WPRM (2025). minibench-mm-mind2web [Dataset]. https://huggingface.co/datasets/WPRM/minibench-mm-mind2web
Explore at:
Dataset updated
Sep 29, 2025
Dataset authored and provided by
WPRM
Description
WPRM/minibench-mm-mind2web dataset hosted on Hugging Face and contributed by the HF Datasets community
h
Online-Mind2Web-Test
huggingface.co
Updated Nov 5, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Genteki Zhang (2025). Online-Mind2Web-Test [Dataset]. https://huggingface.co/datasets/Genteki/Online-Mind2Web-Test
Explore at:
Dataset updated
Nov 5, 2025
Authors
Genteki Zhang
Description
Genteki/Online-Mind2Web-Test dataset hosted on Hugging Face and contributed by the HF Datasets community
h
Online-Mind2Web
huggingface.co
Updated Oct 9, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
hud (2025). Online-Mind2Web [Dataset]. https://huggingface.co/datasets/hud-evals/Online-Mind2Web
Explore at:
Dataset updated
Oct 9, 2025
Dataset authored and provided by
hud
Description
hud-evals/Online-Mind2Web dataset hosted on Hugging Face and contributed by the HF Datasets community
h
izaz-mind2web-dataset
huggingface.co
Updated Mar 13, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
AHMAD (2024). izaz-mind2web-dataset [Dataset]. https://huggingface.co/datasets/Izazk/izaz-mind2web-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 13, 2024
Authors
AHMAD
Description
Dataset Card for Dataset Name

This dataset card aims to be a base template for new datasets. It has been generated using this raw template.

Dataset Details Dataset Description

Curated by: [More Information Needed] Funded by [optional]: [More Information Needed] Shared by [optional]: [More Information Needed] Language(s) (NLP): [More Information Needed] License: [More Information Needed]

Dataset Sources [optional]

Repository: [More… See the full description on the dataset page: https://huggingface.co/datasets/Izazk/izaz-mind2web-dataset.
h
web-agent-mind2web
huggingface.co
Updated May 31, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Isaiah (2025). web-agent-mind2web [Dataset]. https://huggingface.co/datasets/isaiahbjork/web-agent-mind2web
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 31, 2025
Authors
Isaiah
Description
isaiahbjork/web-agent-mind2web dataset hosted on Hugging Face and contributed by the HF Datasets community
h
Multimodal-Mind2Web-HTML-WM-messages-test
huggingface.co
Updated Nov 20, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Language & AGI Lab (2024). Multimodal-Mind2Web-HTML-WM-messages-test [Dataset]. https://huggingface.co/datasets/LangAGI-Lab/Multimodal-Mind2Web-HTML-WM-messages-test
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 20, 2024
Dataset authored and provided by
Language & AGI Lab
Description
LangAGI-Lab/Multimodal-Mind2Web-HTML-WM-messages-test dataset hosted on Hugging Face and contributed by the HF Datasets community
h
mind2web-subset-human
huggingface.co
Updated Nov 26, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Joan (2025). mind2web-subset-human [Dataset]. https://huggingface.co/datasets/josancamon/mind2web-subset-human
Explore at:
Dataset updated
Nov 26, 2025
Authors
Joan
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Mind2Web Subset - Human Demonstrations

A collection of human-demonstrated web navigation tasks with detailed interaction traces. This dataset captures real browser interactions including clicks, typing, scrolling, DOM states, screenshots, and HTTP requests for web agent training and evaluation.

Overview

This dataset contains tasks performed by humans in real web environments, capturing:

Golden trajectories: Step-by-step sequences of actions (clicks, typing, navigation)… See the full description on the dataset page: https://huggingface.co/datasets/josancamon/mind2web-subset-human.
h
minibench-multimodal-mind2web
huggingface.co
Updated Apr 10, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hyungjoo Chae (2025). minibench-multimodal-mind2web [Dataset]. https://huggingface.co/datasets/hyungjoochae/minibench-multimodal-mind2web
Explore at:
Dataset updated
Apr 10, 2025
Authors
Hyungjoo Chae
Description
hyungjoochae/minibench-multimodal-mind2web dataset hosted on Hugging Face and contributed by the HF Datasets community
h
Online-Mind2Web-Tiny-Test
huggingface.co
Updated Nov 5, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Genteki Zhang (2025). Online-Mind2Web-Tiny-Test [Dataset]. https://huggingface.co/datasets/Genteki/Online-Mind2Web-Tiny-Test
Explore at:
Dataset updated
Nov 5, 2025
Authors
Genteki Zhang
Description
Genteki/Online-Mind2Web-Tiny-Test dataset hosted on Hugging Face and contributed by the HF Datasets community
h
Multimodal-Mind2Web-HTML-WM-messages-filter-35000
huggingface.co
Updated Nov 20, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Language & AGI Lab (2024). Multimodal-Mind2Web-HTML-WM-messages-filter-35000 [Dataset]. https://huggingface.co/datasets/LangAGI-Lab/Multimodal-Mind2Web-HTML-WM-messages-filter-35000
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 20, 2024
Dataset authored and provided by
Language & AGI Lab
Description
LangAGI-Lab/Multimodal-Mind2Web-HTML-WM-messages-filter-35000 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
Mind2Web-HTML-cleaned-lite-with-desc_w_tao_value_rationale
huggingface.co
Updated Sep 28, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Language & AGI Lab (2024). Mind2Web-HTML-cleaned-lite-with-desc_w_tao_value_rationale [Dataset]. https://huggingface.co/datasets/LangAGI-Lab/Mind2Web-HTML-cleaned-lite-with-desc_w_tao_value_rationale
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 28, 2024
Dataset authored and provided by
Language & AGI Lab
Description
LangAGI-Lab/Mind2Web-HTML-cleaned-lite-with-desc_w_tao_value_rationale dataset hosted on Hugging Face and contributed by the HF Datasets community
h
Sequence-of-action-prediction-mind2web
huggingface.co
Updated Jun 15, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
AHMAD (2024). Sequence-of-action-prediction-mind2web [Dataset]. https://huggingface.co/datasets/Izazk/Sequence-of-action-prediction-mind2web
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 15, 2024
Authors
AHMAD
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Izazk/Sequence-of-action-prediction-mind2web dataset hosted on Hugging Face and contributed by the HF Datasets community
h
Mind2Web-cleaned-lite-value-model-w-cot-formatted-test
huggingface.co
Updated Sep 19, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Language & AGI Lab (2024). Mind2Web-cleaned-lite-value-model-w-cot-formatted-test [Dataset]. https://huggingface.co/datasets/LangAGI-Lab/Mind2Web-cleaned-lite-value-model-w-cot-formatted-test
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 19, 2024
Dataset authored and provided by
Language & AGI Lab
Description
Dataset Card for "Mind2Web-cleaned-lite-value-model-w-cot-formatted-test"

More Information needed
h
mind2web-mcq-dataset
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Advait Gupta, mind2web-mcq-dataset [Dataset]. https://huggingface.co/datasets/advaitgupta/mind2web-mcq-dataset
Explore at:
Authors
Advait Gupta
Description
advaitgupta/mind2web-mcq-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community
h
Mind2Web_train_llava
huggingface.co
Updated Oct 19, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NeuLab @ LTI/CMU (2024). Mind2Web_train_llava [Dataset]. https://huggingface.co/datasets/neulab/Mind2Web_train_llava
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 19, 2024
Dataset authored and provided by
NeuLab @ LTI/CMU
License
https://choosealicense.com/licenses/odc-by/https://choosealicense.com/licenses/odc-by/
Description
Mind2Web training set for the paper: Harnessing Webpage Uis For Text Rich Visual Understanding

🌐 Homepage | 🐍 GitHub | 📖 arXiv

Introduction

We introduce MultiUI, a dataset containing 7.3 million samples from 1 million websites, covering diverse multi- modal tasks and UI layouts. Models trained on MultiUI not only excel in web UI tasks—achieving up to a 48% improvement on VisualWebBench and a 19.1% boost in action accuracy on a web agent dataset Mind2Web—but also… See the full description on the dataset page: https://huggingface.co/datasets/neulab/Mind2Web_train_llava.
h
Mind2Web-cleaned-lite-value-model
huggingface.co
Updated Apr 16, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Language & AGI Lab (2023). Mind2Web-cleaned-lite-value-model [Dataset]. https://huggingface.co/datasets/LangAGI-Lab/Mind2Web-cleaned-lite-value-model
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 16, 2023
Dataset authored and provided by
Language & AGI Lab
Description
Dataset Card for "Mind2Web-cleaned-lite-value-model"

More Information needed

Facebook

Twitter

Click to copy link

Link copied

Cite

OSU NLP Group (2023). Mind2Web [Dataset]. https://huggingface.co/datasets/osunlp/Mind2Web

Mind2Web

osunlp/Mind2Web

Explore at:

Dataset updated

Jun 12, 2023

Dataset authored and provided by

OSU NLP Group

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Dataset Card for Dataset Name

  Dataset Summary

Mind2Web is a dataset for developing and evaluating generalist agents for the web that can follow language instructions to complete complex tasks on any website. Existing datasets for web agents either use simulated websites or only cover a limited set of websites and tasks, thus not suitable for generalist web agents. With over 2,000 open-ended tasks collected from 137 websites spanning 31 domains and crowdsourced action… See the full description on the dataset page: https://huggingface.co/datasets/osunlp/Mind2Web.

Clear search

Close search

Google apps

Main menu

Mind2Web

Mind2Web: Generalist Agents for Web Tasks

Mind2Web: Generalist Agents for Web Tasks

Language-guided Generalist Agents for Web Tasks

About this dataset

How to use the dataset

Research Ideas

Online-Mind2Web

Mind2Web-2

minibench-mm-mind2web

Online-Mind2Web-Test

Online-Mind2Web

izaz-mind2web-dataset

web-agent-mind2web

Multimodal-Mind2Web-HTML-WM-messages-test

mind2web-subset-human

minibench-multimodal-mind2web

Online-Mind2Web-Tiny-Test

Multimodal-Mind2Web-HTML-WM-messages-filter-35000

Mind2Web-HTML-cleaned-lite-with-desc_w_tao_value_rationale

Sequence-of-action-prediction-mind2web

Mind2Web-cleaned-lite-value-model-w-cot-formatted-test

mind2web-mcq-dataset

Mind2Web_train_llava

Mind2Web-cleaned-lite-value-model

Mind2Web

osunlp/Mind2Web