mirror of https://github.com/hpcaitech/ColossalAI
![]() * refactor kvcache manager and rotary_embedding and kvcache_memcpy operator * refactor decode_kv_cache_memcpy * enable alibi in pagedattention |
||
---|---|---|
.. | ||
benchmark_ops | ||
benchmark_llama.py | ||
benchmark_llama3.py | ||
llama_generation.py | ||
run_benchmark.sh |