<p>Laggy endpoints hurt UX. Diagnose precisely.</p>
<h2 id="1-measure-baseline"><a href="#1-measure-baseline">1. Measure Baseline</a></h2>
<p>Collect:</p>
<ul>
<li>P50 / P95 / P99 latency</li>
<li>Error rate</li>
<li>Request rate</li>
</ul>
<h2 id="2-break-down-timing"><a href="#2-break-down-timing">2. Break Down Timing</a></h2>
<p>Enable server timing or instrumentation:</p>
<pre class="shiki-wrapper"><div class="code-body"><code data-raw="REI9NDBtcyB8IGNhY2hlPTVtcyB8IHJlbmRlcj02MG1zIHwgbmV0d29yaz0yMG1zCg=="><span class="line"><span>DB=40ms | cache=5ms | render=60ms | network=20ms</span></span>
<span class="line"><span></span></span></code></div><div class="code-actions"><button type="button" class="code-copy-btn inline-flex h-8 w-8 items-center justify-center rounded-md bg-background/70 border shadow-sm transition text-muted-foreground hover:text-foreground hover:bg-background focus-visible:outline-none focus-visible:ring-2 focus-visible:ring-accent-500 dark:focus-visible:ring-accent-400 active:scale-95" title="Copy code" aria-label="Copy code"><span class="sr-only">Copy code</span><span class="icon icon-copy"><svg xmlns="http://www.w3.org/2000/svg" width="16" height="16" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><rect x="8" y="8" width="12" height="12" rx="2" ry="2"/><path d="M16 8V6a2 2 0 0 0-2-2H6a2 2 0 0 0-2 2v8a2 2 0 0 0 2 2h2"/></svg></span><span class="icon icon-check hidden"><svg xmlns="http://www.w3.org/2000/svg" width="16" height="16" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><path d="M20 6 9 17l-5-5"/></svg></span></button></div></pre>
<h2 id="3-check-the-critical-path"><a href="#3-check-the-critical-path">3. Check the Critical Path</a></h2>
<p>Remove sequential awaits that could run in parallel (<code>Promise.all</code>).</p>
<h2 id="4-database-profiling"><a href="#4-database-profiling">4. Database Profiling</a></h2>
<p>Log slow queries > 100ms. Add EXPLAIN for outliers.</p>
<h2 id="5-cache-strategy"><a href="#5-cache-strategy">5. Cache Strategy</a></h2>
<ul>
<li>Layer 1: in-memory (hot keys)</li>
<li>Layer 2: Redis for shared</li>
<li>Invalidate on write or TTL-based</li>
</ul>
<h2 id="6-n1-detection"><a href="#6-n1-detection">6. N+1 Detection</a></h2>
<p>Look for many small similar queries. Batch or prefetch.</p>
<h2 id="7-external-calls"><a href="#7-external-calls">7. External Calls</a></h2>
<p>Wrap in timeout + circuit breaker. Measure each provider’s latency separately.</p>
<h2 id="8-payload-size"><a href="#8-payload-size">8. Payload Size</a></h2>
<p>Check response bytes. Remove unused JSON, compress selectively.</p>
<h2 id="9-re-test"><a href="#9-re-test">9. Re-Test</a></h2>
<p>Compare new latency histograms. Document improvements.</p>
<h2 id="10-guardrails"><a href="#10-guardrails">10. Guardrails</a></h2>
<p>Add SLO alert (e.g., P95 > 300ms for 5m) to catch regressions.</p>
<p>Consistent latency builds trust.</p>

API Latency Debugging: Step-by-Step