KV Cache Size Calculator Model family Model Tokens per sequence Sequences KV precision Indexer precision Include draft KV cache Include linear-attention state Total cache size -- = -- GB -- -- = -- Source: --