From gesture to intent, our latest work shows how advanced keypoint detection (body + hands) can unlock powerful, real-time action recognition.
We’re pushing the boundaries with both data-driven models and zero-shot approaches, scaling from a handful of core actions to a richer set of human behaviors without always needing new training data.
This is a glimpse into more adaptive, intelligent systems that understand people naturally and is vital for mass robot adoption.