You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/colossalai/inference/core
yuehuayingxueluo e8f0642f28
[Inference]Add Nopadding Llama Modeling (#5327)
10 months ago
..
engine.py [Inference]Add Nopadding Llama Modeling (#5327) 10 months ago
request_handler.py [inference]Optimize the usage of the mid tensors space in flash attn (#5304) 10 months ago