HELSON
|
077a5cdde4
|
[zero] fix gradient clipping in hybrid parallelism (#2521)
* [zero] fix gradient clipping in hybrid parallelism
* [testing] change model name to avoid pytest warning
* [hotfix] fix unit testing
|
2023-01-29 15:09:57 +08:00 |
Ziheng Qin
|
3041014089
|
[NFC] polish colossalai/amp/naive_amp/grad_scaler/dynamic_grad_scaler.py code style (#2299)
Co-authored-by: henryqin1997 <henryqin1997@gamil.com>
|
2023-01-04 15:09:57 +08:00 |
zbian
|
1559a09fb7
|
[NFC] polish amp.naive_amp.grad_scaler code style
|
2022-11-09 13:38:15 +08:00 |
Frank Lee
|
9fdebadd69
|
[doc] improved docstring in the amp module (#857)
|
2022-04-25 13:42:17 +08:00 |
HELSON
|
4c4388c46e
|
[hotfix] fix memory leak in zero (#781)
|
2022-04-18 13:57:03 +08:00 |
Frank Lee
|
e79ea44247
|
[fp16] refactored fp16 optimizer (#392)
|
2022-03-15 10:05:38 +08:00 |
Frank Lee
|
3d5d64bd10
|
refactored grad scaler (#338)
|
2022-03-11 15:50:28 +08:00 |