Commit Graph

13 Commits (810cafb2f987cac2bbe99ef491455921f197f315)

Author SHA1 Message Date
flybird11111 295dd2d9fe
[zerobubble] rebase main (#6075)
2 months ago
hxwang 74b03de3f9 [moe] remove ops
4 months ago
hxwang 803878b2fd [moe] full test for deepseek and mixtral (pp + sp to fix)
4 months ago
hxwang 3e2b6132b7 [moe] clean legacy code
4 months ago
botbw dc583aa576 [moe] implement tp
4 months ago
botbw 9b9b76bdcd [moe] add mixtral dp grad scaling when not all experts are activated
4 months ago
botbw b5bfeb2efd [moe] implement transit between non moe tp and ep
4 months ago
hxwang 46c069b0db [zero] solve hang
4 months ago
Hongxin Liu da39d21b71 [moe] support mixtral (#5309)
10 months ago
Frank Lee 7cfed5f076
[feat] refactored extension module (#5298)
10 months ago
Wenhao Chen 3c08f17348
[hotfix]: modify create_ep_hierarchical_group and add test (#5032)
1 year ago
Wenhao Chen 724441279b
[moe]: fix ep/tp tests, add hierarchical all2all (#4982)
1 year ago
Xuanlei Zhao dc003c304c
[moe] merge moe into main (#4978)
1 year ago