ColossalAI

History

Baizhou Zhang 21ba89cab6 [gemini] support gradient accumulation (#4869 ) * add test * fix no_sync bug in low level zero plugin * fix test * add argument for grad accum * add grad accum in backward hook for gemini * finish implementation, rewrite tests * fix test * skip stuck model in low level zero test * update doc * optimize communication & fix gradient checkpoint * modify doc * cleaning codes * update cpu adam fp16 case		2023-10-17 14:07:21 +08:00
..
mixed_precision	[misc] update pre-commit and run all files (#4752 )	2023-09-19 14:20:26 +08:00
plugin	[gemini] support gradient accumulation (#4869 )	2023-10-17 14:07:21 +08:00
__init__.py	[booster] implemented the torch ddd + resnet example (#3232 )	2023-03-27 10:24:14 +08:00
accelerator.py	[misc] update pre-commit and run all files (#4752 )	2023-09-19 14:20:26 +08:00
booster.py	[lazy] support from_pretrained (#4801 )	2023-09-26 11:04:11 +08:00