A PyTorch implementation of Soft Mixture of Experts that enables efficient sparse routing for...

Tokens:25,167
Snippets:402
Trust Score:9.9
License:MIT
Update:1 month ago
Tokens:
Raw