You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/colossalai/inference/modeling/models
Yuanheng Zhao 8754abae24
[Fix] Fix & Update Inference Tests (compatibility w/ main)
7 months ago
..
__init__.py fix bugs in request_handler 11 months ago
glide_llama.py [Inference/SpecDec] Support GLIDE Drafter Model (#5455) 8 months ago
nopadding_baichuan.py [inference]Add alibi to flash attn function (#5678) 7 months ago
nopadding_llama.py [Fix] Fix & Update Inference Tests (compatibility w/ main) 7 months ago