Object Navigation

Haokun Zhu¹, Zongtai Li¹, Zihan Liu^1,2, Kevin Guo¹, Yuxin Cai^1,3, Guofei Chen¹, Chen Lv³ Wenshan Wang¹ Jean Oh¹ Ji Zhang¹

¹Carnegie Mellon University, ²New York University, ³Nanyang Technological University

Paper (Coming Soon) Code (Coming Soon)

示例图片

Contributions

We propose a cross-embodiment system—supporting wheeled robots, quadrupeds, and humanoids—that incrementally builds a multi-layer representation of the environment, including room, viewpoint, and object layers, enabling the VLM to make more informed decisions during the object navigation.
We design an efficient two-stage navigation policy based on this representation, combining high-level planning guided by the VLM's reasoning and low-level exploration with VLM's assistance.
Our system not only supports standard object navigation but also enables conditional object navigation—such as navigation conditioned on object attributes or spatial relations—through its multi-layer representation.
We conduct extensive real-world evaluations, including three long-range tests spanning an entire building floor and 75 unit tests conducted within multi-room environments (51 on wheeled robots, 18 on quadrupeds and 6 on humannoids).

Wheeled Robot: Long-range Object Navigation

Key Moments - Click image to jump • Click 🔍 to zoom

0:07

Robot finds a refrigerator, but not in a loundge, continues.

0:21

Robot transits to the next room selected by the VLM.

0:40

Robot transits to the next room selected by the VLM.

1:18

Robot finds the refrigerator in the loundge, completes.

Key Moments - Click image to jump • Click 🔍 to zoom

0:16

Robot transits to the next room selected by the VLM.

0:48

Robot finds blue and grey trash cans, but not in a classroom, continues.

1:08

The robot aborts exploring the current room and transits to the classroom.

1:11

Robot finds the blue trash can in a classroom, completes.

Key Moments - Click image to jump • Click 🔍 to zoom

0:07

Robot finds a microwave oven but not near the fridge, continues.

0:29

Robot finds a microwave oven but not near the fridge, continues.

0:49

Robot transits to the next room selected by the VLM.

1:03

Robot finds a microwave oven near the fridge, completes.

Wheeled Robot: Object Navigation

35 demos - demos - Click any video to view in detail

Page 1 of 4

Wheeled Robot: Object Navigation with Self-attribute Condition

10 demos - Click any video to view in detail

Page 1 of 3

Wheeled Robot: Object Navigation with Spatial Condition

6 demos - Click any video to view in detail

Quadruped: Object Navigation

7 demos - Click any video to view in detail

Page 1 of 2

Quadruped: Object Navigation with Self-attribute Condition

6 demos - Click any video to view in detail

Page 1 of 2

Quadruped: Object Navigation with Spatial Condition

5 demos - Click any video to view in detail

Page 1 of 2

Humanoid: Object Navigation

demos - Click any video to view in detail

Coming Soon!