TurboQuant+ is an implementation of extreme KV cache compression for LLMs using PolarQuant and...

Tokens:77,272
Snippets:1,061
Trust Score:9.1
License:Apache-2.0
Update:2 months ago
Tokens:
Raw