Commit Graph

4 Commits (10a19e22c63aa9963a889874b63c47ccd0e6db42)

Author SHA1 Message Date
Runyu Lu 18d67d0e8e
[Feat]Inference RPC Server Support (#5705)
6 months ago
Runyu Lu e37ee2fb65
[Feat]Tensor Model Parallel Support For Inference (#5563)
7 months ago
yuehuayingxueluo f366a5ea1f
[Inference/kernel]Add Fused Rotary Embedding and KVCache Memcopy CUDA Kernel (#5418)
9 months ago
yuehuayingxueluo cea9c86e45 add utils.py
10 months ago