Commit Graph

3 Commits (de4bf3dedf2c7cb7ba6c3044745bab3c3ef6352d)

Author SHA1 Message Date
Runyu Lu e37ee2fb65
[Feat]Tensor Model Parallel Support For Inference (#5563)
7 months ago
yuehuayingxueluo f366a5ea1f
[Inference/kernel]Add Fused Rotary Embedding and KVCache Memcopy CUDA Kernel (#5418)
9 months ago
yuehuayingxueluo cea9c86e45 add utils.py
10 months ago