trlX is a distributed training framework for fine-tuning large language models with reinforcement...

Tokens:10,592
Snippets:78
Trust Score:7.3
License:MIT
Update:1 year ago
Tokens:
Raw