Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
Frank Lee efef43b53c
Merge pull request #5372 from hpcaitech/exp/mixtral
10 months ago
..
__init__.py [misc] update pre-commit and run all files (#4752) 1 year ago
checkpoint_io_base.py [moe] support mixtral (#5309) 10 months ago
general_checkpoint_io.py [shardformer] fix master param sync for hybrid plugin/rewrite unwrapping logic (#4758) 1 year ago
hybrid_parallel_checkpoint_io.py [llama] polish training script and fix optim ckpt (#5368) 10 months ago
index_file.py [misc] update pre-commit and run all files (#4752) 1 year ago
utils.py [shardformer] Fix serialization error with Tensor Parallel state saving (#5018) 1 year ago