PowerInfer is a CPU/GPU hybrid LLM inference engine that leverages activation locality for fast...

Tokens:401,818
Snippets:3,981
Trust Score:5.4
License:MIT
Update:2 months ago
Tokens:
Raw