mirror of https://github.com/hpcaitech/ColossalAI
00525f7772
* Use self.[distribute_layers|get_stage_index] to exploit custom layer distribution * Change static methods for t5 layer distribution to member functions * Change static methods for whisper layer distribution to member functions * Replace whisper policy usage with self one * Fix test case to use non-static layer distribution methods * fix: fix typo --------- Co-authored-by: Wenhao Chen <cwher@outlook.com> |
||
---|---|---|
.. | ||
test_hybrid_parallel_grad_clip_norm | ||
test_layer | ||
test_model | ||
__init__.py | ||
test_flash_attention.py | ||
test_shard_utils.py | ||
test_with_torch_ddp.py |