ColossalAI/colossalai/inference/modeling
Yuanheng f8598e3ec5 [Fix] Llama Modeling Control with Spec-Dec (#5580)
- fix ref before asgmt
- fall back to use triton kernels when using spec-dec
2024-04-10 18:19:44 +08:00
..
layers [Inference]Fused the gate and up proj in mlp,and optimized the autograd process. (#5365) 2024-02-06 19:38:25 +08:00
models [Fix] Llama Modeling Control with Spec-Dec (#5580) 2024-04-10 18:19:44 +08:00
policy [Inference/SpecDec] Support GLIDE Drafter Model (#5455) 2024-04-10 11:07:52 +08:00
__init__.py [doc] updated inference readme (#5343) 2024-02-02 14:31:10 +08:00