ColossalAI/colossalai/inference/modeling
yuehuayingxueluo e8f0642f28
[Inference]Add Nopadding Llama Modeling (#5327)
* add nopadding llama modeling

* add nopadding_llama.py

* rm unused codes

* fix bugs in test_xine_copy.py

* fix code style
2024-01-30 10:31:46 +08:00
..
layers [Kernel/Fix] Revise flash attention triton kernel API and add benchmark (#5301) 2024-01-23 17:16:02 +08:00
models [Inference]Add Nopadding Llama Modeling (#5327) 2024-01-30 10:31:46 +08:00
policy [Inference]Add Nopadding Llama Modeling (#5327) 2024-01-30 10:31:46 +08:00