KVCache Estimator Implementation

Name: kvcache-estimator-impl
Rating: 65
Author: lycheenice

何时使用

•读取 references/formulas-and-edge-cases.md，确定统一公式与单位。
•实现配置解析：num_layers, hidden_size, num_attention_heads, num_key_value_heads。
•实现估算器主函数，返回总量、per-token、per-layer 指标。
•增加容量反推函数：给定显存预算，求 max batch 或 max seq length。
•输出推荐动作：GQA、量化、offload 的收益估计。