FlashQLA is a high-performance linear attention kernel library built on TileLang that achieves 2-3×...

Tokens:10,917
Snippets:77
Trust Score:9.1
License:MIT
Update:2 months ago
Tokens:
Raw