LayerSkip enables early exit inference and self-speculative decoding for large language models to...

Tokens:5,585
Snippets:78
Trust Score:9.5
Update:1 month ago
Tokens:
Raw