Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
OS-Copilot/ScreenSpot-v2 dataset hosted on Hugging Face and contributed by the HF Datasets community
Dataset Card for ScreenSpot-V2
This is a FiftyOne dataset with 1272 samples.
Installation
If you haven't already, install FiftyOne: pip install -U fiftyone
Usage
import fiftyone as fo from fiftyone.utils.huggingface import load_from_hub
dataset = load_from_hub("Voxel51/ScreenSpot-v2")
session = fo.launch_app(dataset)
Dataset Details… See the full description on the dataset page: https://huggingface.co/datasets/Voxel51/ScreenSpot-v2.
ScreenSpot-v2-variaints
The ScreenSpot dataset with 4 types of instructions:
instruction: original instruction from ScreenSpot,action: clarifies the action to take, description: describes the target UI element. negative: an operation that can not be done in the screenshot.
Compatible code can be found in the GitHub repo above.
mikaelnias/screenspot-v2 dataset hosted on Hugging Face and contributed by the HF Datasets community
lmms-lab/ScreenSpot-v2 dataset hosted on Hugging Face and contributed by the HF Datasets community
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Dataset Card for ScreenSpot
GUI Grounding Benchmark: ScreenSpot. Created researchers at Nanjing University and Shanghai AI Laboratory for evaluating large multimodal models (LMMs) on GUI grounding tasks on screens given a text-based instruction.
Dataset Details
Dataset Description
ScreenSpot is an evaluation benchmark for GUI grounding, comprising over 1200 instructions from iOS, Android, macOS, Windows and Web environments, along with annotated element types… See the full description on the dataset page: https://huggingface.co/datasets/rootsautomation/ScreenSpot.
ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use
Note: Data viewer is not configured because the large size of the images will trigger loading errors.
Data Details
Category Abbr. ApplicationEdition & Version OS Icons Texts
Development and Programming
VSC Visual Studio Code 1.95 macOS 22 33
PyC PyCharm 2023.3 macOS 38 40
AS Android Studio 2022.2 macOS 44 36
Qrs Quartus II 13.0… See the full description on the dataset page: https://huggingface.co/datasets/likaixin/ScreenSpot-Pro.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
## Overview
Screenshot is a dataset for object detection tasks - it contains Expandbutton BackButton annotations for 1,635 images.
## Getting Started
You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
## License
This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
jdubkim/ScreenSpot-v2 dataset hosted on Hugging Face and contributed by the HF Datasets community
andersonbcdefg/screenspot-v2-images dataset hosted on Hugging Face and contributed by the HF Datasets community
A screenshot image taken on March 20, 2025, possibly related to the term 'lsSGdk'.
pbcong/screenspot-v2-test dataset hosted on Hugging Face and contributed by the HF Datasets community
andersonbcdefg/screenspot-v2-annots-desktop dataset hosted on Hugging Face and contributed by the HF Datasets community
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Screenshot of ClinMiner ontology browser.
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
PixelWeb: The First Web GUI Dataset with Pixel-Wise Labels
https://arxiv.org/abs/2504.16419
Dataset Description
PixelWeb-1K: 1,000 webpages with raw data(screenshot, element images(PNG), DOM data) and annotated data(mask, contour, bbox). PixelWeb-10K: 10,000 webpages with raw data(screenshot, element images(PNG), DOM data) and annotated data(mask, contour, bbox). PixelWeb-100K: 100,000 webpages with raw data(screenshot, element images(PNG), DOM data) and annotated… See the full description on the dataset page: https://huggingface.co/datasets/cyberalchemist/PixelWeb.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
OS-Copilot/ScreenSpot-v2 dataset hosted on Hugging Face and contributed by the HF Datasets community