Commit Graph

7 Commits (fba04e857b57abc54ba4864cbfb3af0461e2c5e7)

Author SHA1 Message Date
Wenhao Chen e614aa34f3
[shardformer, pipeline] add `gradient_checkpointing_ratio` and heterogenous shard policy for llama (#5508)
8 months ago
Insu Jang 00525f7772
[shardformer] fix pipeline forward error if custom layer distribution is used (#5189)
8 months ago
Hongxin Liu 65e5d6baa5 [moe] fix mixtral optim checkpoint (#5344)
10 months ago
Hongxin Liu 956b561b54 [moe] fix mixtral forward default value (#5329)
10 months ago
Hongxin Liu b60be18dcc [moe] fix mixtral checkpoint io (#5314)
10 months ago
Hongxin Liu da39d21b71 [moe] support mixtral (#5309)
10 months ago
Xuanlei Zhao 7d8e0338a4 [moe] init mixtral impl
10 months ago