ColossalAI

History

ver217 96780e6ee4 Optimize pipeline schedule (#94 ) * add pipeline shared module wrapper and update load batch * added model parallel process group for amp and clip grad (#86) * added model parallel process group for amp and clip grad * update amp and clip with model parallel process group * remove pipeline_prev/next group (#88) * micro batch offload * optimize pipeline gpu memory usage * pipeline can receive tensor shape (#93) * optimize pipeline gpu memory usage * fix grad accumulation step counter * rename classes and functions Co-authored-by: Frank Lee <somerlee.9@gmail.com>		2021-12-30 15:56:46 +08:00
..
data_sampler	update examples and sphnix docs for the new api (#63 )	2021-12-13 22:07:01 +08:00
gradient_accumulation	Optimize pipeline schedule (#94 )	2021-12-30 15:56:46 +08:00
multi_tensor_apply	update examples and sphnix docs for the new api (#63 )	2021-12-13 22:07:01 +08:00
__init__.py	Hotfix/Colossalai layers (#92 )	2021-12-29 23:32:10 +08:00
activation_checkpoint.py	Migrated project	2021-10-28 18:21:23 +02:00
checkpointing.py	Support TP-compatible Torch AMP and Update trainer API (#27 )	2021-11-18 19:45:06 +08:00
common.py	Optimize pipeline schedule (#94 )	2021-12-30 15:56:46 +08:00
cuda.py	Migrated project	2021-10-28 18:21:23 +02:00
memory.py	Layer integration (#83 )	2021-12-27 15:04:32 +08:00
timer.py	update examples and sphnix docs for the new api (#63 )	2021-12-13 22:07:01 +08:00