Mar 31, 2026 softmax attention dilution context rotSoftmax Attention and the Dilution Problem: The Math Behind Context RotAs context grows, softmax normalizes attention weights so each relevant token gets less attention. This mathematical property is why AI accuracy drops with length.