mirror of https://github.com/hpcaitech/ColossalAI
![]() * update flash-context-attention * adding kernels * fix * reset * add build script * add building process * add llama2 exmaple * add colossal-llama2 test * clean * fall back test setting * fix test file * clean * clean * clean --------- Co-authored-by: cuiqing.li <lixx336@gmail.com> |
||
---|---|---|
.. | ||
serving | ||
_utils.py | ||
bench_bloom.py | ||
bench_chatglm2.py | ||
bench_llama.py | ||
colossal_llama2_demo.py | ||
gptq_bloom.py | ||
gptq_llama.py | ||
smoothquant_llama.py |