Facebook
Twittermichelledai/openvla-data dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterSichang0621/openvla-oft-model dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterExploration/Simpler-Collections-OpenVLA dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterSichang0621/openvla-oft-libero-goal dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterSichang0621/openvla-oft-libero-10 dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This dataset was created using LeRobot.
Dataset Structure
meta/info.json: { "codebase_version": "v2.1", "robot_type": "franka", "total_episodes": 30, "total_frames": 34891, "total_tasks": 1, "total_videos": 60, "total_chunks": 1, "chunks_size": 1000, "fps": 30, "splits": { "train": "0:30" }, "data_path": "data/chunk-{episode_chunk:03d}/episode_{episode_index:06d}.parquet", "video_path":… See the full description on the dataset page: https://huggingface.co/datasets/zjushining/franka-record-red-block-0730-openvla.
Facebook
TwitterDon't Blind Your VLA: Aligning Visual Representations for OOD Generalization
This repository contains the openvla_1k-dataset, which is the training dataset used in the paper "Don't Blind Your VLA: Aligning Visual Representations for OOD Generalization". The dataset consists of 1.4k episodes collected with Octo-Small and a motion planner, used to warm up pretrained OpenVLA and fine-tune Vision-Language-Action (VLA) models. It is crucial for methods like Visual Representation… See the full description on the dataset page: https://huggingface.co/datasets/tttonyalpha/openvla_1k-dataset.
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This dataset was created using LeRobot.
Dataset Structure
meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 50, "total_frames": 2524, "total_tasks": 1, "chunks_size": 1000, "fps": 10, "splits": { "train": "0:50" }, "data_path": "data/chunk-{chunk_index:03d}/file-{file_index:03d}.parquet", "video_path": "videos/{video_key}/chunk-{chunk_index:03d}/file-{file_index:03d}.mp4", "features": {… See the full description on the dataset page: https://huggingface.co/datasets/yananchen/robosuite_lift.
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Maniskill Sub-Dataset in RLDS Format used in RPD
This repo contains the maniskill subset in RLDS format used to train Octo and OpenVLA in the Refined Policy Distillation paper which distilled these models using RL. Checkout octo-base-1.5-finetuned-maniskill and openvla-7b-finetuned-maniskill which have been fine-tuned with that dataset.
Citation
If you find RPD useful for your work, please consider citing it: @inproceedings{juelg2025refinedpolicydistillationvla… See the full description on the dataset page: https://huggingface.co/datasets/Juelg/RPD-maniskill.
Facebook
TwitterTREAD Extracted Data Annotations
Project page: https://akuramshin.github.io/tread This repository contains data annotations extracted using TREAD from three different datasets.
Datasets
We used TREAD to extract data annotations from three datasets:
libero_90 and libero_10: Modified versions of the LIBERO datasets used in the OpenVLA fine-tuning experiments. bridge: A version of BridgeData V2.
File Naming Convention
For all files, the keys are in the form… See the full description on the dataset page: https://huggingface.co/datasets/real-lab/tread_annotations.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Facebook
Twittermichelledai/openvla-data dataset hosted on Hugging Face and contributed by the HF Datasets community