ColossalAI/colossalai/inference/modeling
Runyu Lu aabc9fb6aa [feat] add use_cuda_kernel option 2024-03-19 13:24:25 +08:00
..
layers [Inference]Fused the gate and up proj in mlp,and optimized the autograd process. (#5365) 2024-02-06 19:38:25 +08:00
models [feat] add use_cuda_kernel option 2024-03-19 13:24:25 +08:00
policy feat rmsnorm cuda kernel and add unittest, benchmark script (#5417) 2024-03-08 16:21:12 +08:00
__init__.py [doc] updated inference readme (#5343) 2024-02-02 14:31:10 +08:00