Mooncake is a KVCache-centric disaggregated architecture for LLM serving that separates prefill and...

Tokens:278,394
Snippets:2,895
Trust Score:6.5
License:Apache-2.0
Update:6 days ago
Tokens:
Raw