RotorQuant (scrya-com/rotorquant) | Context7

InstallDocsPricing

More...

RotorQuant

https://github.com/scrya-com/rotorquant

RotorQuant is a KV cache compression technique for large language models that uses block-diagonal...

Tokens:39,789

Snippets:184

Trust Score:5.7

Update:2 months ago

Show doc for...

Tokens: