Pragmatic Embodied Spoken Instruction Following in Human-Robot Collaboration with Theory of Mind

Jan 31, 2026ยท
Lance Ying
,
Xinyi Li
,
Shivam Aarya
,
Yizirui Fang
,
Yifan Yin
,
Jason Xinyu Liu
,
Stefanie Tellex
,
Joshua B. Tenenbaum
,
Tianmin Shu
ยท 1 min read
Type
Publication
IEEE International Conference on Robotics and Automation (ICRA 2026)

Formerly listed as SIFToM, this paper models spoken instruction following as goal inference under uncertainty. For applied scientist review, this ICRA 2026 paper is most relevant as embodied AI evidence: it connects language, perception, Bayesian inverse planning, and behavior evaluation in simulated and real-world robot task settings.