view post Post 3915 I have just released a new blogpost about kv caching and its role in inference speedup ๐๐ https://huggingface.co/blog/not-lain/kv-caching/some takeaways : See translation 4 replies ยท ๐ฅ 8 8 ๐ค 3 3 + Reply