Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
Yuanheng Zhao 04863a9b14
[example] Update Llama Inference example (#5629)
7 months ago
..
_C
_analyzer [hotfix] quick fixes to make legacy tutorials runnable (#5559) 8 months ago
accelerator [hotfix] fix typo change MoECheckpintIO to MoECheckpointIO (#5335) 9 months ago
amp [npu] change device to accelerator api (#5239) 11 months ago
auto_parallel [hotfix] Fix wrong import in meta_registry (#5392) 9 months ago
autochunk
booster [devops] remove post commit ci (#5566) 8 months ago
checkpoint_io [devops] remove post commit ci (#5566) 8 months ago
cli [devops] fix extention building (#5427) 9 months ago
cluster [devops] remove post commit ci (#5566) 8 months ago
context
device
fx
inference [example] Update Llama Inference example (#5629) 7 months ago
interface
kernel [Fix/Inference] Fix GQA Triton and Support Llama3 (#5624) 7 months ago
lazy
legacy [Fix] resolve conflicts of merging main 8 months ago
logging
moe [hotfix] fix typo change MoECheckpintIO to MoECheckpointIO (#5335) 9 months ago
nn [hotfix] quick fixes to make legacy tutorials runnable (#5559) 8 months ago
pipeline [shardformer, pipeline] add `gradient_checkpointing_ratio` and heterogenous shard policy for llama (#5508) 8 months ago
shardformer [Fix] resolve conflicts of merging main 8 months ago
tensor [devops] remove post commit ci (#5566) 8 months ago
testing [shardformer] update colo attention to support custom mask (#5510) 8 months ago
utils Merge pull request #5310 from hpcaitech/feature/npu 10 months ago
zero [devops] remove post commit ci (#5566) 8 months ago
__init__.py [devops] remove post commit ci (#5566) 8 months ago
initialize.py [npu] change device to accelerator api (#5239) 11 months ago