Injecting Language into the 3D World - Part II
Part II moves from spatial reasoning to embodied intelligence. We examine how large language models conditioned on 3D scene representations transition from passive understanding to active decision-making. The discussion focuses on 3D task planning, navigation, object manipulation, and safety constraints.







