3 Commits (main)

Author SHA1 Message Date
flybird11111 0c10afd372
[FP8] rebase main (#5963) 4 months ago
hxwang cb01c0d5ce [moe] refactor mesh assignment 4 months ago
hxwang 70c9924d0d [chore] solve moe ckpt test failure and some other arg pass failure 4 months ago
hxwang 3e2b6132b7 [moe] clean legacy code 4 months ago
hxwang 102b784a10 [chore] arg pass & remove drop token 4 months ago
Haze188 416580b314
[MoE/ZeRO] Moe refactor with zero refactor (#5821) 5 months ago
Hongxin Liu 7f8b16635b
[misc] refactor launch API and tensor constructor (#5666) 7 months ago
Hongxin Liu da39d21b71 [moe] support mixtral (#5309) 10 months ago