mirror of https://github.com/hpcaitech/ColossalAI
![]() * refactor kvcache manager and rotary_embedding and kvcache_memcpy operator * refactor decode_kv_cache_memcpy * enable alibi in pagedattention |
||
---|---|---|
.. | ||
__init__.py | ||
inference.cpp | ||
inference_ops_cuda.py |