mirror of https://github.com/hpcaitech/ColossalAI
![]() * add cuda KVCache kernel * annotation benchmark_kvcache_copy * add use cuda * fix import path * move benchmark scripts to example/ * rm benchmark codes in test_kv_cache_memcpy.py * rm redundancy codes * rm redundancy codes * pr was modified according to the review |
||
---|---|---|
.. | ||
arm | ||
cuda | ||
__init__.py | ||
scaled_softmax.py |