You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/colossalai/inference/core
yuehuayingxueluo f79963199c
[inference]Add alibi to flash attn function (#5678)
7 months ago
..
__init__.py [doc] updated inference readme (#5343) 10 months ago
engine.py [inference]Add alibi to flash attn function (#5678) 7 months ago
plugin.py [Feat]Tensor Model Parallel Support For Inference (#5563) 7 months ago
request_handler.py [Fix/Inference] Fix GQA Triton and Support Llama3 (#5624) 7 months ago