vLLM-MLX brings native Apple Silicon GPU acceleration to vLLM with support for multimodal inference...

Tokens:54,499
Snippets:616
Trust Score:9.4
License:Apache-2.0
Update:1 month ago
Tokens:
Raw