llama.cpp is a C/C++ implementation that enables efficient LLM inference with minimal setup on a...

Tokens:145,276
Snippets:968
Trust Score:8.8
License:MIT
Update:1 month ago
Tokens:
Raw