ColossalAI/colossalai/inference/modeling/models
char-1ee 04386d9eff Refactor modeling by adding attention backend
Signed-off-by: char-1ee <xingjianli59@gmail.com>
2024-06-07 08:33:47 +00:00
..
__init__.py fix bugs in request_handler 2024-01-11 13:39:56 +00:00
glide_llama.py [Inference/SpecDec] Support GLIDE Drafter Model (#5455) 2024-04-10 11:07:52 +08:00
nopadding_baichuan.py Refactor modeling by adding attention backend 2024-06-07 08:33:47 +00:00
nopadding_llama.py Refactor modeling by adding attention backend 2024-06-07 08:33:47 +00:00