You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/colossalai/moe
Hongxin Liu da39d21b71
[moe] support mixtral (#5309)
10 months ago
..
__init__.py [moe] init mixtral impl 10 months ago
_operation.py [moe] support mixtral (#5309) 10 months ago
checkpoint.py [moe] init mixtral impl 10 months ago
experts.py [moe] init mixtral impl 10 months ago
layers.py [moe] init mixtral impl 10 months ago
load_balance.py
loss.py
manager.py fix some typo (#5307) 10 months ago
routers.py [moe] update capacity computing (#5253) 10 months ago
utils.py [moe] init mixtral impl 10 months ago