ColossalAI

History

yuehuayingxueluo 600881a8ea [Inference]Add CUDA KVCache Kernel (#5406 ) * add cuda KVCache kernel * annotation benchmark_kvcache_copy * add use cuda * fix import path * move benchmark scripts to example/ * rm benchmark codes in test_kv_cache_memcpy.py * rm redundancy codes * rm redundancy codes * pr was modified according to the review		2024-02-28 14:36:50 +08:00
..
jit	[npu] change device to accelerator api (#5239 )	2024-01-09 10:20:05 +08:00
triton	Optimized the execution interval time between cuda kernels caused by view and memcopy (#5390 )	2024-02-21 13:23:57 +08:00
__init__.py	[feat] refactored extension module (#5298 )	2024-01-25 17:01:48 +08:00
extensions	[feat] refactored extension module (#5298 )	2024-01-25 17:01:48 +08:00
kernel_loader.py	[Inference]Add CUDA KVCache Kernel (#5406 )	2024-02-28 14:36:50 +08:00