<p>When CI goes red, panic wastes cycles. Apply a triage routine.</p>
<h2 id="1-categorize-failure-type"><a href="#1-categorize-failure-type">1. Categorize Failure Type</a></h2>
<ul>
<li>Lint / Type</li>
<li>Unit test</li>
<li>Integration / e2e</li>
<li>Build / packaging</li>
<li>Flaky (non-deterministic)</li>
</ul>
<h2 id="2-reproduce-locally"><a href="#2-reproduce-locally">2. Reproduce Locally</a></h2>
<p>Run the exact script from logs. Match Node + dependency versions.</p>
<h2 id="3-minimize-noise"><a href="#3-minimize-noise">3. Minimize Noise</a></h2>
<p>Re-run failing test file in isolation. Use <code>--runInBand</code> if race suspected.</p>
<h2 id="4-check-recent-commits"><a href="#4-check-recent-commits">4. Check Recent Commits</a></h2>
<p><code>git log -n 5 --oneline</code> — look for risky changes (infra, deps, env).</p>
<h2 id="5-inspect-caches"><a href="#5-inspect-caches">5. Inspect Caches</a></h2>
<p>Out-of-date caches cause weirdness. Bust them: clear node_modules, build artifacts.</p>
<h2 id="6-for-flakes"><a href="#6-for-flakes">6. For Flakes</a></h2>
<ul>
<li>Add retries temporarily.</li>
<li>Increase timeouts only after profiling.</li>
<li>Capture artifacts (screenshots, logs) for failing runs.</li>
</ul>
<h2 id="7-root-cause-template"><a href="#7-root-cause-template">7. Root Cause Template</a></h2>
<pre class="shiki-wrapper"><div class="code-body"><code data-raw="U3ltcHRvbToKVHJpZ2dlciBjb21taXQ6ClJvb3QgY2F1c2U6CkZpeDoKU2FmZWd1YXJkICh0ZXN0IC8gbGludCAvIG1vbml0b3IpOgo="><span class="line"><span>Symptom:</span></span>
<span class="line"><span>Trigger commit:</span></span>
<span class="line"><span>Root cause:</span></span>
<span class="line"><span>Fix:</span></span>
<span class="line"><span>Safeguard (test / lint / monitor):</span></span>
<span class="line"><span></span></span></code></div><div class="code-actions"><button type="button" class="code-copy-btn inline-flex h-8 w-8 items-center justify-center rounded-md bg-background/70 border shadow-sm transition text-muted-foreground hover:text-foreground hover:bg-background focus-visible:outline-none focus-visible:ring-2 focus-visible:ring-accent-500 dark:focus-visible:ring-accent-400 active:scale-95" title="Copy code" aria-label="Copy code"><span class="sr-only">Copy code</span><span class="icon icon-copy"><svg xmlns="http://www.w3.org/2000/svg" width="16" height="16" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><rect x="8" y="8" width="12" height="12" rx="2" ry="2"/><path d="M16 8V6a2 2 0 0 0-2-2H6a2 2 0 0 0-2 2v8a2 2 0 0 0 2 2h2"/></svg></span><span class="icon icon-check hidden"><svg xmlns="http://www.w3.org/2000/svg" width="16" height="16" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><path d="M20 6 9 17l-5-5"/></svg></span></button></div></pre>
<h2 id="8-add-a-guard"><a href="#8-add-a-guard">8. Add a Guard</a></h2>
<p>After fix, write a test to fail if regression recurs.</p>
<h2 id="9-communicate"><a href="#9-communicate">9. Communicate</a></h2>
<p>Post concise summary in team channel with impact and mitigation.</p>
<h2 id="10-continuous-improvement"><a href="#10-continuous-improvement">10. Continuous Improvement</a></h2>
<p>Track categories of failures monthly -> prioritize investments.</p>
<p>Stable CI accelerates shipping.</p>

CI Failure Triage: A Repeatable Playbook