<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Uncertainty Quantification | Yizirui Fang</title><link>http://yiziruifang.com/tags/uncertainty-quantification/</link><atom:link href="http://yiziruifang.com/tags/uncertainty-quantification/index.xml" rel="self" type="application/rss+xml"/><description>Uncertainty Quantification</description><generator>Hugo Blox Builder (https://hugoblox.com)</generator><language>en-us</language><lastBuildDate>Sun, 01 Feb 2026 00:00:00 +0000</lastBuildDate><image><url>http://yiziruifang.com/media/icon_hu7729264130191091259.png</url><title>Uncertainty Quantification</title><link>http://yiziruifang.com/tags/uncertainty-quantification/</link></image><item><title>Learning to Defer with an Uncertain Rejector via Conformal Prediction</title><link>http://yiziruifang.com/project/uncertain-defer/</link><pubDate>Sun, 01 Feb 2026 00:00:00 +0000</pubDate><guid>http://yiziruifang.com/project/uncertain-defer/</guid><description>&lt;h2 id="abstract-level-summary">Abstract-level summary&lt;/h2>
&lt;p>Learning to defer routes each input to either a machine learning model or a human expert. This paper studies a failure mode in that routing layer: the rejector can itself be misspecified, poorly calibrated, or brittle under shift. We apply conformal prediction to the rejector so it can express uncertainty through deferral sets instead of returning only a hard defer-or-predict decision.&lt;/p>
&lt;p>The resulting system can take safer fallback actions when the rejector is uncertain, including abstaining, checking consensus between the model and expert, preferring the model when the human route is uncertain and cost matters, or preferring the human under distribution shift.&lt;/p>
&lt;p>
&lt;figure >
&lt;div class="flex justify-center ">
&lt;div class="w-100" >&lt;img alt="Paper abstract page" srcset="
/project/uncertain-defer/abstract-page_hu7756762487034974405.webp 400w,
/project/uncertain-defer/abstract-page_hu16459656346013309870.webp 760w,
/project/uncertain-defer/abstract-page_hu16268632555376845453.webp 1200w"
src="http://yiziruifang.com/project/uncertain-defer/abstract-page_hu7756762487034974405.webp"
width="587"
height="760"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;/figure>
&lt;/p>
&lt;h2 id="core-idea">Core idea&lt;/h2>
&lt;p>The standard learning-to-defer workflow depends on a rejector that chooses between the model and the expert. Instead of treating that rejector decision as certain, the paper constructs conformal deferral sets over whether the expert is expected to be correct. A singleton set supports an ordinary routing decision; an uncertain set unlocks safer workflows.&lt;/p>
&lt;p>
&lt;figure >
&lt;div class="flex justify-center ">
&lt;div class="w-100" >&lt;img alt="Deferral workflows" srcset="
/project/uncertain-defer/figure-1-workflows_hu17319325364120452081.webp 400w,
/project/uncertain-defer/figure-1-workflows_hu13112766801284617044.webp 760w,
/project/uncertain-defer/figure-1-workflows_hu11458883716961407415.webp 1200w"
src="http://yiziruifang.com/project/uncertain-defer/figure-1-workflows_hu17319325364120452081.webp"
width="483"
height="150"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;/figure>
&lt;/p>
&lt;h2 id="method">Method&lt;/h2>
&lt;ul>
&lt;li>Formulated uncertainty quantification for the rejector in learning-to-defer systems.&lt;/li>
&lt;li>Applied split conformal prediction to construct deferral sets with coverage behavior on expert correctness.&lt;/li>
&lt;li>Evaluated both one-vs-all and asymmetric-softmax rejector parameterizations.&lt;/li>
&lt;li>Tested abstention, consensus prediction, human-preferred routing, and model-preferred routing workflows.&lt;/li>
&lt;li>Ran experiments across CIFAR-10, HAM10000, and Hate Speech settings, including distribution-shift stress tests.&lt;/li>
&lt;/ul>
&lt;h2 id="main-tables">Main tables&lt;/h2>
&lt;p>The first table shows that conformal rejectors can achieve the target coverage level while keeping deferral sets compact across image and text classification tasks.&lt;/p>
&lt;p>
&lt;figure >
&lt;div class="flex justify-center ">
&lt;div class="w-100" >&lt;img alt="Coverage and efficiency table" srcset="
/project/uncertain-defer/table-1-coverage-efficiency_hu12242869307950842943.webp 400w,
/project/uncertain-defer/table-1-coverage-efficiency_hu7003174801164777875.webp 760w,
/project/uncertain-defer/table-1-coverage-efficiency_hu2788259301618695362.webp 1200w"
src="http://yiziruifang.com/project/uncertain-defer/table-1-coverage-efficiency_hu12242869307950842943.webp"
width="430"
height="170"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;/figure>
&lt;/p>
&lt;p>The second table compares abstention and consensus workflows. The key tradeoff is safety versus availability: abstention improves reliability by withholding uncertain decisions, while consensus asks both the model and expert when routing is ambiguous.&lt;/p>
&lt;p>
&lt;figure >
&lt;div class="flex justify-center ">
&lt;div class="w-100" >&lt;img alt="Abstention and consensus table" srcset="
/project/uncertain-defer/table-2-abstention-consensus_hu5317323060698938542.webp 400w,
/project/uncertain-defer/table-2-abstention-consensus_hu12547259816853294873.webp 760w,
/project/uncertain-defer/table-2-abstention-consensus_hu3452052353993668122.webp 1200w"
src="http://yiziruifang.com/project/uncertain-defer/table-2-abstention-consensus_hu5317323060698938542.webp"
width="445"
height="205"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;/figure>
&lt;/p>
&lt;h2 id="distribution-shift">Distribution shift&lt;/h2>
&lt;p>Under covariate shift, the conformal workflows expose increasing rejector uncertainty through higher deferral or abstention behavior. This is useful because the model can avoid confidently routing examples when the deferral decision is unreliable.&lt;/p>
&lt;p>
&lt;figure >
&lt;div class="flex justify-center ">
&lt;div class="w-100" >&lt;img alt="OOD deferral behavior" srcset="
/project/uncertain-defer/figure-2-ood-shift_hu303299267271004511.webp 400w,
/project/uncertain-defer/figure-2-ood-shift_hu3042856808072502516.webp 760w,
/project/uncertain-defer/figure-2-ood-shift_hu14763257674263760524.webp 1200w"
src="http://yiziruifang.com/project/uncertain-defer/figure-2-ood-shift_hu303299267271004511.webp"
width="500"
height="420"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;/figure>
&lt;/p>
&lt;h2 id="accuracy-coverage-tradeoff">Accuracy-coverage tradeoff&lt;/h2>
&lt;p>The final comparison plots non-abstention accuracy against how often the system defers. The useful region is where the method improves safety or accuracy without pushing nearly all examples to the human expert.&lt;/p>
&lt;p>
&lt;figure >
&lt;div class="flex justify-center ">
&lt;div class="w-100" >&lt;img alt="Accuracy coverage comparison" srcset="
/project/uncertain-defer/figure-3-accuracy-coverage_hu11687330343579666114.webp 400w,
/project/uncertain-defer/figure-3-accuracy-coverage_hu10195823359800968283.webp 760w,
/project/uncertain-defer/figure-3-accuracy-coverage_hu14506204719640219977.webp 1200w"
src="http://yiziruifang.com/project/uncertain-defer/figure-3-accuracy-coverage_hu11687330343579666114.webp"
width="500"
height="595"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;/figure>
&lt;/p>
&lt;h2 id="why-it-matters">Why it matters&lt;/h2>
&lt;p>This project is a human-in-the-loop ML signal: it turns the human/model routing decision into an uncertainty-aware component with measurable coverage, calibration, and robustness properties. For applied scientist review, the strongest evidence is the connection between a practical system failure mode, a distribution-free uncertainty method, and experiments that evaluate behavior under realistic shift.&lt;/p></description></item><item><title>Investigating Data Usage for Inductive Conformal Predictors</title><link>http://yiziruifang.com/project/conformal-predictors/</link><pubDate>Sat, 01 Jun 2024 00:00:00 +0000</pubDate><guid>http://yiziruifang.com/project/conformal-predictors/</guid><description>&lt;h2 id="hiring-manager-view">Hiring-manager view&lt;/h2>
&lt;p>This paper is a direct uncertainty-quantification signal: it studies the data and calibration choices behind conformal prediction rather than treating uncertainty estimates as a black box.&lt;/p>
&lt;h2 id="scientific-problem">Scientific problem&lt;/h2>
&lt;p>Inductive conformal predictors rely on data splits and calibration sets to produce uncertainty-aware prediction sets. The practical question is how data usage decisions affect validity, efficiency, and downstream model behavior.&lt;/p>
&lt;h2 id="method">Method&lt;/h2>
&lt;ul>
&lt;li>Investigated how data allocation choices influence inductive conformal prediction.&lt;/li>
&lt;li>Focused on calibration behavior, data efficiency, and prediction-set quality.&lt;/li>
&lt;li>Connected empirical model behavior to uncertainty guarantees relevant to safety-sensitive ML systems.&lt;/li>
&lt;/ul>
&lt;h2 id="evaluation-signal">Evaluation signal&lt;/h2>
&lt;p>The evaluation centers on how calibration and data usage choices change uncertainty quality and reliability, especially when model outputs must support downstream decisions.&lt;/p></description></item></channel></rss>