ColossalAI/colossalai/nn/optimizer
HELSON a9b8300d54
[zero] improve adaptability for not-shard parameters (#708)
* adapt post grad hooks for not-shard parameters
* adapt optimizer for not-shard parameters
* offload gradients for not-replicated parameters
2022-04-11 13:38:51 +08:00
..
__init__.py fix bugs in CPU adam (#633) 2022-04-02 17:04:05 +08:00
colossalai_optimizer.py
cpu_adam.py [zero] improve adaptability for not-shard parameters (#708) 2022-04-11 13:38:51 +08:00
fused_adam.py polish optimizer docstring (#619) 2022-04-01 16:27:03 +08:00
fused_lamb.py polish optimizer docstring (#619) 2022-04-01 16:27:03 +08:00
fused_sgd.py polish optimizer docstring (#619) 2022-04-01 16:27:03 +08:00
hybrid_adam.py fix bugs in CPU adam (#633) 2022-04-02 17:04:05 +08:00
lamb.py
lars.py
utils.py fix bugs in CPU adam (#633) 2022-04-02 17:04:05 +08:00