You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/colossalai/amp/naive_amp
HELSON 077a5cdde4
[zero] fix gradient clipping in hybrid parallelism (#2521)
2 years ago
..
grad_scaler [zero] fix gradient clipping in hybrid parallelism (#2521) 2 years ago
__init__.py
_fp16_optimizer.py [setup] support pre-build and jit-build of cuda kernels (#2374) 2 years ago
_utils.py
naive_amp.py [amp] add gradient clipping for unit tests (#2283) 2 years ago