You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/colossalai/amp/naive_amp/grad_scaler
HELSON 077a5cdde4
[zero] fix gradient clipping in hybrid parallelism (#2521)
2 years ago
..
__init__.py
base_grad_scaler.py [NFC] polish amp.naive_amp.grad_scaler code style 2 years ago
constant_grad_scaler.py [doc] improved docstring in the amp module (#857) 3 years ago
dynamic_grad_scaler.py [zero] fix gradient clipping in hybrid parallelism (#2521) 2 years ago