mirror of https://github.com/hpcaitech/ColossalAI
7a3dfd0c64
* cherry-pick flash attention 2 cherry-pick flash attention 2 * [shardformer] update shardformer to use flash attention 2 [shardformer] update shardformer to use flash attention 2, fix [shardformer] update shardformer to use flash attention 2, fix [shardformer] update shardformer to use flash attention 2, fix |
||
---|---|---|
.. | ||
test_checkpoint | ||
test_activation_checkpointing.py | ||
test_colo_checkpoint.py | ||
test_commons.py | ||
test_flash_attention.py | ||
test_memory.py | ||
test_norm_gradient_clipping.py | ||
test_zero_gradient_clippling.py |