You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/colossalai
Yuanheng Zhao 5a9b05f7b2
[Inference/SpecDec] Add Basic Drafter Model Container (#5405)
8 months ago
..
_C
_analyzer [hotfix] quick fixes to make legacy tutorials runnable (#5559) 8 months ago
accelerator [hotfix] fix typo change MoECheckpintIO to MoECheckpointIO (#5335) 9 months ago
amp [npu] change device to accelerator api (#5239) 11 months ago
auto_parallel [hotfix] Fix wrong import in meta_registry (#5392) 9 months ago
autochunk
booster [devops] remove post commit ci (#5566) 8 months ago
checkpoint_io [devops] remove post commit ci (#5566) 8 months ago
cli [devops] fix extention building (#5427) 9 months ago
cluster [devops] remove post commit ci (#5566) 8 months ago
context [moe] merge moe into main (#4978) 1 year ago
device [npu] add npu support for hybrid plugin and llama (#5090) 1 year ago
fx
inference [Inference/SpecDec] Add Basic Drafter Model Container (#5405) 8 months ago
interface
kernel [Infer] Revise and Adapt Triton Kernels for Spec-Dec (#5401) 8 months ago
lazy
legacy [Fix] resolve conflicts of merging main 8 months ago
logging
moe [hotfix] fix typo change MoECheckpintIO to MoECheckpointIO (#5335) 9 months ago
nn [hotfix] quick fixes to make legacy tutorials runnable (#5559) 8 months ago
pipeline [shardformer, pipeline] add `gradient_checkpointing_ratio` and heterogenous shard policy for llama (#5508) 8 months ago
shardformer [Fix] resolve conflicts of merging main 8 months ago
tensor [devops] remove post commit ci (#5566) 8 months ago
testing [shardformer] update colo attention to support custom mask (#5510) 8 months ago
utils Merge pull request #5310 from hpcaitech/feature/npu 10 months ago
zero [devops] remove post commit ci (#5566) 8 months ago
__init__.py [devops] remove post commit ci (#5566) 8 months ago
initialize.py [npu] change device to accelerator api (#5239) 11 months ago