Optimum Quanto is a PyTorch quantization backend that enables efficient model compression through...

Tokens:5,587
Snippets:93
Trust Score:9.6
License:Apache-2.0
Update:2 weeks ago
Tokens:
Raw