ColossalAI/colossalai/zero
HELSON a1ce02d740
[zero] test gradient accumulation (#1964)
* [zero] fix memory leak for zero2

* [zero] test gradient accumulation

* [zero] remove grad clip test
2022-11-29 13:00:30 +08:00
..
init_ctx [moe] fix MoE bugs (#1628) 2022-09-22 13:56:30 +08:00
shard_utils [gemini] add GeminiMemoryManger (#832) 2022-04-24 13:08:48 +08:00
sharded_model [Gemini] polish memstats collector (#1962) 2022-11-16 15:45:57 +08:00
sharded_optim [zero] test gradient accumulation (#1964) 2022-11-29 13:00:30 +08:00
sharded_param [NFC] polish colossalai/zero/sharded_param/__init__.py code style (#1717) 2022-10-19 12:20:51 +08:00
utils [Gemini] ZeROHookV2 -> GeminiZeROHook (#1972) 2022-11-17 14:43:49 +08:00
__init__.py [Gemini] add GeminiAdamOptimizer (#1960) 2022-11-16 14:44:28 +08:00