Maximal Update Parametrization (μP) and Hyperparameter Transfer (μTransfer)
This project provides a tool that enables maximal update parametrization (μP) in PyTorch models,...
Tokens:10,314
Snippets:99
Trust Score:10
License:MIT
Update:2 days ago
Tokens: