mirror of https://github.com/hpcaitech/ColossalAI
7a3dfd0c64
* cherry-pick flash attention 2 cherry-pick flash attention 2 * [shardformer] update shardformer to use flash attention 2 [shardformer] update shardformer to use flash attention 2, fix [shardformer] update shardformer to use flash attention 2, fix [shardformer] update shardformer to use flash attention 2, fix |
||
---|---|---|
.. | ||
csrc | ||
mha | ||
__init__.py | ||
layer_norm.py | ||
multihead_attention.py | ||
scaled_softmax.py |