mirror of https://github.com/hpcaitech/ColossalAI
55cc7f3df7
* clean requirements * modify example inference struct * add test ci scripts * mark test_infer as submodule * rm deprecated cls & deps * import of HAS_FLASH_ATTN * prune inference tests to be run * prune triton kernel tests * increment pytest timeout mins * revert import path in openmoe |
||
---|---|---|
.. | ||
benchmark_llama.py | ||
benchmark_llama3.py | ||
llama_generation.py | ||
run_benchmark.sh | ||
test_ci.sh |