ColossalAI/colossalai/inference/modeling/models
Yuanheng Zhao 8754abae24 [Fix] Fix & Update Inference Tests (compatibility w/ main) 2024-05-05 16:28:56 +00:00
..
__init__.py fix bugs in request_handler 2024-01-11 13:39:56 +00:00
glide_llama.py [Inference/SpecDec] Support GLIDE Drafter Model (#5455) 2024-04-10 11:07:52 +08:00
nopadding_baichuan.py [inference]Add alibi to flash attn function (#5678) 2024-04-30 19:35:05 +08:00
nopadding_llama.py [Fix] Fix & Update Inference Tests (compatibility w/ main) 2024-05-05 16:28:56 +00:00