Commit Graph

10 Commits (feature/zerobubble)

Author SHA1 Message Date
flybird11111 295dd2d9fe
[zerobubble] rebase main (#6075)
2 months ago
Haze188 887d2d579b
[misc] Bypass the huggingface bug to solve the mask mismatch problem (#5991)
4 months ago
hxwang cb01c0d5ce [moe] refactor mesh assignment
4 months ago
haze188 034020bd04 [misc] remove debug/print code
4 months ago
haze188 b2952a5982 [moe] deepseek moe sp support
4 months ago
hxwang 70c9924d0d [chore] solve moe ckpt test failure and some other arg pass failure
4 months ago
hxwang 803878b2fd [moe] full test for deepseek and mixtral (pp + sp to fix)
4 months ago
hxwang 877d94bb8c [moe] init moe plugin comm setting with sp
4 months ago
hxwang 74eccac0db [moe] test deepseek
4 months ago
Haze188 3420921101
[shardformer] DeepseekMoE support (#5871)
5 months ago