Crowdsourced egocentric video for robotics imitation learning.
RoboX is building a diverse first-person video dataset for training the next generation of robots. Contributors around the world record short clips through the RoboX mobile app, capturing how humans grasp, move, navigate, and interact with everyday environments. Each clip flows through an automated annotation pipeline that produces hand keypoints, object tracks, action segments, sensor data, and spatial context, ready for imitation learning research.
| Dataset | Description | Clips |
|---|---|---|
| RoboX-EgoGrasp-v0.1 | Single grasp actions on everyday objects | 10 sample / 1,800+ full |
| RoboX-Egocentric-Collection-v0.2 | Combined collection across four campaigns: EgoGrasp, EgoDaily, EgoScene, EgoNav | 7,342 episodes |
Sample clips are published openly on Hugging Face. Full dataset access is available on request for research and commercial robotics teams.
Visit robox.to to request access or learn more.
All RoboX datasets are released under CC-BY-NC-4.0 for research and non-commercial use. For commercial licensing, contact the RoboX team via robox.to.