ColossalAI/colossalai/utils
ver217 96780e6ee4
Optimize pipeline schedule (#94)
* add pipeline shared module wrapper and update load batch

* added model parallel process group for amp and clip grad (#86)

* added model parallel process group for amp and clip grad

* update amp and clip with model parallel process group

* remove pipeline_prev/next group (#88)

* micro batch offload

* optimize pipeline gpu memory usage

* pipeline can receive tensor shape (#93)

* optimize pipeline gpu memory usage

* fix grad accumulation step counter

* rename classes and functions

Co-authored-by: Frank Lee <somerlee.9@gmail.com>
2021-12-30 15:56:46 +08:00
..
data_sampler update examples and sphnix docs for the new api (#63) 2021-12-13 22:07:01 +08:00
gradient_accumulation Optimize pipeline schedule (#94) 2021-12-30 15:56:46 +08:00
multi_tensor_apply update examples and sphnix docs for the new api (#63) 2021-12-13 22:07:01 +08:00
__init__.py Hotfix/Colossalai layers (#92) 2021-12-29 23:32:10 +08:00
activation_checkpoint.py Migrated project 2021-10-28 18:21:23 +02:00
checkpointing.py Support TP-compatible Torch AMP and Update trainer API (#27) 2021-11-18 19:45:06 +08:00
common.py Optimize pipeline schedule (#94) 2021-12-30 15:56:46 +08:00
cuda.py Migrated project 2021-10-28 18:21:23 +02:00
memory.py Layer integration (#83) 2021-12-27 15:04:32 +08:00
timer.py update examples and sphnix docs for the new api (#63) 2021-12-13 22:07:01 +08:00