Running on CPU Upgrade Featured 2.75k The Smol Training Playbook ๐ 2.75k The secrets to building world-class LLMs
view reply Hello, the KV cache memory requirements in FP16 of 405B, 0.984 GB 15.38 GB 123.05 GB, these three values look like from FP32, could you double-check it? And how to get the KV cache memory when I get a new LLM?