We are now open-sourcing the AxisDataCleaning pipeline.
Github repo:
Browser teleoperation is one of the most scalable paths for robot data generation. Raw human input, however, is not yet model-ready:
▪️ Idle pauses
▪️ Micro-jitters
▪️ Low & Variable frame rates
Raw web data alone is not enough for reliable policy training.
Here is how our backend turns noisy human demonstrations into usable trajectories for downstream policy training. 🧵👇