Mar 31, 2026 prefill decode TTFTPrefill vs. Decode: Why Long Context Makes Your AI Slow Before It Even Starts TalkingBefore the AI generates its first word, it must process EVERY token in the context. Here's why time-to-first-token increases with context length.