Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
Yuanheng f8598e3ec5 [Fix] Llama Modeling Control with Spec-Dec (#5580) 8 months ago
..
_C
_analyzer [hotfix] quick fixes to make legacy tutorials runnable (#5559) 8 months ago
accelerator [hotfix] fix typo change MoECheckpintIO to MoECheckpointIO (#5335) 9 months ago
amp
auto_parallel [hotfix] Fix wrong import in meta_registry (#5392) 9 months ago
autochunk
booster [devops] remove post commit ci (#5566) 8 months ago
checkpoint_io [devops] remove post commit ci (#5566) 8 months ago
cli [devops] fix extention building (#5427) 9 months ago
cluster [devops] remove post commit ci (#5566) 8 months ago
context
device
fx
inference [Fix] Llama Modeling Control with Spec-Dec (#5580) 8 months ago
interface
kernel [Inference/SpecDec] Add Speculative Decoding Implementation (#5423) 8 months ago
lazy
legacy [Fix] resolve conflicts of merging main 8 months ago
logging
moe [hotfix] fix typo change MoECheckpintIO to MoECheckpointIO (#5335) 9 months ago
nn [hotfix] quick fixes to make legacy tutorials runnable (#5559) 8 months ago
pipeline [shardformer, pipeline] add `gradient_checkpointing_ratio` and heterogenous shard policy for llama (#5508) 8 months ago
shardformer [Fix] resolve conflicts of merging main 8 months ago
tensor [devops] remove post commit ci (#5566) 8 months ago
testing [shardformer] update colo attention to support custom mask (#5510) 8 months ago
utils Merge pull request #5310 from hpcaitech/feature/npu 10 months ago
zero [devops] remove post commit ci (#5566) 8 months ago
__init__.py [devops] remove post commit ci (#5566) 8 months ago
initialize.py