mirror of https://github.com/hpcaitech/ColossalAI
![]() * add kv copy triton kernel during decoding stage * add pytest and fix kernel * fix test utilities * revise kernel config * add benchmark for kvcache copy |
||
---|---|---|
.. | ||
attention.py |