๐Thrilled to release VLM^3! Most 3D vision papers nowadays still spend months/years designing complex archs/losses/augmentations for different tasks. Are they necessary? VLM^3 shows that most designs that you think are important for 3D vision are [not] important at all!