mirror of https://github.com/hpcaitech/ColossalAI
c6ab96983a
* refactor low level zero * fix zero2 and support cpu offload * avg gradient and modify unit test * refactor grad store, support layer drop * refactor bucket store, support grad accumulation * fix and update unit test of zero and ddp * compatible with tp, ga and unit test * fix memory leak and polish * add zero layer drop unittest * polish code * fix import err in unit test * support diffenert comm dtype, modify docstring style * polish code * test padding and fix * fix unit test of low level zero * fix pad recording in bucket store * support some models * polish |
||
---|---|---|
.. | ||
test_dp_plugin_base.py | ||
test_gemini_plugin.py | ||
test_low_level_zero_plugin.py | ||
test_torch_ddp_plugin.py | ||
test_torch_fsdp_plugin.py |