Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
Baizhou Zhang 822c3d4d66
[checkpointio] sharded optimizer checkpoint for DDP plugin (#4002)
1 year ago
..
_C [setup] support pre-build and jit-build of cuda kernels (#2374) 2 years ago
_analyzer [example] add train resnet/vit with booster example (#3694) 2 years ago
amp [bf16] add bf16 support (#3882) 1 year ago
auto_parallel [nfc] fix typo colossalai/ applications/ (#3831) 2 years ago
autochunk fix typo colossalai/auto_parallel autochunk fx/passes etc. (#3808) 2 years ago
booster [checkpointio] sharded optimizer checkpoint for DDP plugin (#4002) 1 year ago
builder [NFC] polish colossalai/builder/__init__.py code style (#1560) 2 years ago
checkpoint_io [checkpointio] sharded optimizer checkpoint for DDP plugin (#4002) 1 year ago
cli Modify torch version requirement to adapt torch 2.0 (#3896) 1 year ago
cluster fix typo colossalai/auto_parallel autochunk fx/passes etc. (#3808) 2 years ago
communication [CI] fix some spelling errors (#3707) 2 years ago
context [CI] fix some spelling errors (#3707) 2 years ago
device Revert "[sync] sync feature/shardformer with develop" 1 year ago
engine fix typo colossalai/auto_parallel autochunk fx/passes etc. (#3808) 2 years ago
fx [nfc] fix typo colossalai/cli fx kernel (#3847) 1 year ago
interface [booster] implemented the torch ddd + resnet example (#3232) 2 years ago
kernel [bf16] add bf16 support (#3882) 1 year ago
lazy Revert "[sync] sync feature/shardformer with develop" 1 year ago
logging [logger] hotfix, missing _FORMAT (#2231) 2 years ago
nn [nfc]fix typo colossalai/pipeline tensor nn (#3899) 1 year ago
pipeline [nfc]fix typo colossalai/pipeline tensor nn (#3899) 1 year ago
registry Remove duplication registry (#1078) 2 years ago
tensor Revert "[sync] sync feature/shardformer with develop" 1 year ago
testing [NFC] fix typo with colossalai/auto_parallel/tensor_shard (#3742) 2 years ago
trainer fix typo with colossalai/trainer utils zero (#3908) 1 year ago
utils fix typo with colossalai/trainer utils zero (#3908) 1 year ago
zero [gemini] fixed the gemini checkpoint io (#3934) 1 year ago
__init__.py [setup] supported conda-installed torch (#2048) 2 years ago
constants.py updated tp layers 2 years ago
core.py [Tensor] distributed view supports inter-process hybrid parallel (#1169) 2 years ago
global_variables.py [NFC] polish colossalai/global_variables.py code style (#3259) 2 years ago
initialize.py [nfc] fix typo colossalai/zero (#3923) 1 year ago