mirror of https://github.com/hpcaitech/ColossalAI
![]() * adapted to rotary_embedding * adapted to nopad rms norm * fix bugs in benchmark * fix flash_decoding.py |
||
---|---|---|
.. | ||
benchmark_llama.py | ||
build_smoothquant_weight.py | ||
run_benchmark.sh | ||
run_llama_inference.py |