<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Bayesian Inference | Yizirui Fang</title><link>http://yiziruifang.com/tags/bayesian-inference/</link><atom:link href="http://yiziruifang.com/tags/bayesian-inference/index.xml" rel="self" type="application/rss+xml"/><description>Bayesian Inference</description><generator>Hugo Blox Builder (https://hugoblox.com)</generator><language>en-us</language><lastBuildDate>Sat, 31 Jan 2026 00:00:00 +0000</lastBuildDate><image><url>http://yiziruifang.com/media/icon_hu7729264130191091259.png</url><title>Bayesian Inference</title><link>http://yiziruifang.com/tags/bayesian-inference/</link></image><item><title>Pragmatic Embodied Spoken Instruction Following in Human-Robot Collaboration with Theory of Mind</title><link>http://yiziruifang.com/project/siftom/</link><pubDate>Sat, 31 Jan 2026 00:00:00 +0000</pubDate><guid>http://yiziruifang.com/project/siftom/</guid><description>&lt;h2 id="hiring-manager-view">Hiring-manager view&lt;/h2>
&lt;p>This project, formerly listed as SIFToM, connects language, perception, and decision-making in embodied AI. It is a useful signal for applied scientist roles that require modeling user intent, reasoning under uncertainty, and evaluating behavior in simulated and real-world settings.&lt;/p>
&lt;h2 id="scientific-problem">Scientific problem&lt;/h2>
&lt;p>Robots need to infer intended goals from spoken instructions even when the acoustic signal is noisy or ambiguous. The challenge is to model both what the speaker said and what a human listener likely perceived.&lt;/p>
&lt;h2 id="method">Method&lt;/h2>
&lt;ul>
&lt;li>Contributed to a theory-of-mind formulation for pragmatic spoken instruction following.&lt;/li>
&lt;li>Used vision-language modeling and model-based mental inference to infer robot goals from speech-related evidence and task context.&lt;/li>
&lt;li>Evaluated the approach on simulated and real-world embodied task data.&lt;/li>
&lt;/ul>
&lt;h2 id="evaluation-signal">Evaluation signal&lt;/h2>
&lt;p>The central evaluation question is whether the model improves goal inference and instruction-following robustness when speech, perception, and intent are uncertain.&lt;/p></description></item></channel></rss>