prima.cpp: Speeding up 70B-scale LLM inference on low-resource everyday home clusters

Tokens:126,962
Snippets:772
Trust Score:9.6
Update:1 year ago
Tokens:
Raw