Nunchaku is a high-performance inference engine optimized for low-bit neural networks that achieves...

Tokens:37,279
Snippets:321
Trust Score:7.6
License:Apache-2.0
Update:2 months ago
Tokens:
Raw