7 Commits (3568df498ab9ab2241ba2968de614bdc070ccbc9)

Author SHA1 Message Date
flybird11111 a1e39f4c0d
[install]fix setup (#5786) 6 months ago
Charles Coulombe c46e09715c
Allow building cuda extension without a device. (#5535) 6 months ago
傅剑寒 279300dc5f
[Inference/Refactor] Refactor compilation mechanism and unified multi hw (#5613) 7 months ago
Hongxin Liu 19e1a5cf16
[shardformer] update colo attention to support custom mask (#5510) 8 months ago
yuehuayingxueluo 600881a8ea
[Inference]Add CUDA KVCache Kernel (#5406) 9 months ago
Hongxin Liu ffffc32dc7
[checkpointio] fix gemini and hybrid parallel optim checkpoint (#5347) 10 months ago
digger yu 6a3086a505
fix typo under extensions/ (#5330) 10 months ago
Frank Lee 7cfed5f076
[feat] refactored extension module (#5298) 10 months ago