Commit Graph

5 Commits (58ad76d4665032bbe548d066116d1c572ce98979)

Author SHA1 Message Date
傅剑寒 279300dc5f
[Inference/Refactor] Refactor compilation mechanism and unified multi hw (#5613)
7 months ago
Yuanheng ed5ebd1735 [Fix] resolve conflicts of merging main
8 months ago
Hongxin Liu 19e1a5cf16
[shardformer] update colo attention to support custom mask (#5510)
8 months ago
yuehuayingxueluo 600881a8ea
[Inference]Add CUDA KVCache Kernel (#5406)
9 months ago
Frank Lee 7cfed5f076
[feat] refactored extension module (#5298)
10 months ago