mirror of https://github.com/hpcaitech/ColossalAI
![]() * add pipeline shared module wrapper and update load batch * added model parallel process group for amp and clip grad (#86) * added model parallel process group for amp and clip grad * update amp and clip with model parallel process group * remove pipeline_prev/next group (#88) * micro batch offload * optimize pipeline gpu memory usage * pipeline can receive tensor shape (#93) * optimize pipeline gpu memory usage * fix grad accumulation step counter * rename classes and functions Co-authored-by: Frank Lee <somerlee.9@gmail.com> |
||
---|---|---|
.. | ||
test_comm | ||
test_config | ||
test_context | ||
test_data | ||
test_data_pipeline_tensor_parallel | ||
test_engine/test_engine | ||
test_layers | ||
test_trainer | ||
test_utils | ||
test_zero_data_parallel | ||
test_zero_tensor_parallel |