mirror of https://github.com/hpcaitech/ColossalAI
7a3dfd0c64
* cherry-pick flash attention 2 cherry-pick flash attention 2 * [shardformer] update shardformer to use flash attention 2 [shardformer] update shardformer to use flash attention 2, fix [shardformer] update shardformer to use flash attention 2, fix [shardformer] update shardformer to use flash attention 2, fix |
||
---|---|---|
.. | ||
cuda_native | ||
jit | ||
triton | ||
__init__.py | ||
op_builder |