You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/colossalai
Frank Lee ae1b58cd16
[tensor] added linear implementation for the new sharding spec (#1416)
2 years ago
..
amp [doc] update rst and docstring (#1351) 2 years ago
builder [NFC] polish colossalai/builder/builder.py code style (#1265) 2 years ago
cli
communication [communication] add p2p_v2.py to support communication with List[Any] (#1407) 2 years ago
context [doc] update rst and docstring (#1351) 2 years ago
device [device] add DeviceMesh class to support logical device layout (#1394) 2 years ago
engine [hotfix] fix PipelineSharedModuleGradientHandler (#1314) 2 years ago
fx [fx] fix the false interpretation of algorithm 3 in https://arxiv.org/abs/1604.06174. (#1446) 2 years ago
gemini [zero] add chunk_managerV2 for all-gather chunk (#1441) 2 years ago
kernel [hotfix] fix CPUAdam kernel nullptr (#1410) 2 years ago
logging
nn [tensor] added linear implementation for the new sharding spec (#1416) 2 years ago
pipeline [pipeline]add customized policy (#1139) 2 years ago
registry Remove duplication registry (#1078) 3 years ago
tensor [tensor] added linear implementation for the new sharding spec (#1416) 2 years ago
testing [test] skip tests when not enough GPUs are detected (#1090) 3 years ago
trainer fix issue #1080 (#1071) 3 years ago
utils [utils] Impl clip_grad_norm for ColoTensor and ZeroOptimizer (#1442) 2 years ago
zero [utils] Impl clip_grad_norm for ColoTensor and ZeroOptimizer (#1442) 2 years ago
__init__.py [NFC] polish colossalai/__init__.py code style (#1285) 2 years ago
constants.py fix typo in constants (#1027) 3 years ago
core.py [Tensor] distributed view supports inter-process hybrid parallel (#1169) 2 years ago
global_variables.py
initialize.py [hotfix] remove potiential circle import (#1307) 2 years ago