Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
Wenhao Chen bb0a668fee
[hotfix] set return_outputs=False in examples and polish code (#5404)
8 months ago
..
moe_utils.py [moe] init mixtral impl 10 months ago
test_grad_handler.py [npu] change device to accelerator api (#5239) 11 months ago
test_kernel.py [npu] change device to accelerator api (#5239) 11 months ago
test_moe_checkpoint.py [hotfix] set return_outputs=False in examples and polish code (#5404) 8 months ago
test_moe_ep_tp.py [npu] change device to accelerator api (#5239) 11 months ago
test_moe_group.py [npu] change device to accelerator api (#5239) 11 months ago
test_moe_hybrid_zero.py [moe] support optimizer checkpoint (#5015) 1 year ago
test_moe_load_balance.py [moe] support optimizer checkpoint (#5015) 1 year ago
test_moe_router.py [moe] fix tests 10 months ago
test_moe_zero_fwd_bwd.py [moe] init mixtral impl 10 months ago
test_moe_zero_optim.py [moe] init mixtral impl 10 months ago