FlashKDA is a high-performance CUDA kernel implementation of Kimi Delta Attention built on CUTLASS...

Tokens:27,260
Snippets:254
Trust Score:8.4
License:MIT
Update:2 weeks ago
Tokens:
Raw