Commit Graph

7 Commits (feature/inference-refactor)

Author SHA1 Message Date
Bin Jia 81b8f5e76a
[Inference Refactor] Merge chatglm2 with pp and tp (#5023)
1 year ago
Bin Jia 48d0a58d10 add support for bloom (#5008)
1 year ago
Bin Jia b6696beb04
[Pipeline Inference] Merge pp with tp (#4993)
1 year ago
Bin Jia 1db6727678
[Pipeline inference] Combine kvcache with pipeline inference (#4938)
1 year ago
github-actions[bot] 486d06a2d5
[format] applied code formatting on changed files in pull request 4820 (#4886)
1 year ago
Bin Jia 08a9f76b2f
[Pipeline Inference] Sync pipeline inference branch to main (#4820)
1 year ago
Cuiqing Li bce0f16702
[Feature] The first PR to Add TP inference engine, kv-cache manager and related kernels for our inference system (#4577)
1 year ago