A PyTorch implementation of Block Recurrent Transformer that enables long-range memory up to 60k...

Tokens:3,902
Snippets:25
Trust Score:9.9
License:MIT
Update:2 months ago
Tokens:
Raw