๊ฐ€์ž… ํ›„ ์ดˆ๋Œ€ ๋งํฌ๋ฅผ ๊ณต์œ ํ•˜๋ฉด ๋™์˜์ƒ ์žฌ์ƒ ๋ฐ ์ดˆ๋Œ€ ๋ณด์ƒ์„ ๋ฐ›์„ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

Ruohan Zhang
@RuohanZhang76
Incoming Assistant Professor @NorthwesternCS, Postdoc @StanfordSVL; robot, brain, art; soccer, cooking, dance
๊ฐ€์ž… September 2021
1.1K ํŒ”๋กœ์ž‰ ์ค‘    3.1K ํŒฌ
Excited to introduce StereoPolicy, led by @EvansXuHan. ๐Ÿ“ท๐Ÿ“ท๐Ÿค–StereoPolicy is an effective way to add geometric cues to modern robot policy models while keeping the strengths of pretrained 2D encoders. โ‰๏ธWhy stereo for robot manipulation? Monocular RGB often lacks the depth cues needed for precise manipulation, while RGB-D and point clouds can be noisy or brittle, especially on reflective and transparent objects in real-world deployment. Instead of explicitly reconstructing disparity, depth, or point clouds, StereoPolicy directly fuses synchronized left/right RGB views to learn implicit stereo cues, avoiding extra reconstruction latency that can make real-time manipulation difficult. Project Page:
๋” ๋ณด๊ธฐ