Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

Tokens:22,506
Snippets:65
Trust Score:8.9
License:Apache-2.0
Update:1 year ago
Tokens:
Raw