You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/colossalai
YuliangLiu0306 e27645376d
[hotfix]different overflow status lead to communication stuck. (#1175)
2 years ago
..
amp [hotfix]different overflow status lead to communication stuck. (#1175) 2 years ago
builder [pipeline] refactor the pipeline module (#1087) 3 years ago
cli [hotfix] fix some bugs caused by size mismatch. (#1011) 3 years ago
communication [hotfix]different overflow status lead to communication stuck. (#1175) 2 years ago
context [usability] improved error messages in the context module (#856) 3 years ago
engine [hotfix]fix some bugs caused by refactored schedule. (#1148) 2 years ago
fx [fx]add autoparallel passes (#1121) 2 years ago
gemini [gemini] refactor gemini mgr (#1151) 2 years ago
kernel [optim] refactor fused sgd (#1134) 2 years ago
logging [doc] improved docstring in the logging module (#861) 3 years ago
nn [Tensor] distributed view supports inter-process hybrid parallel (#1169) 2 years ago
pipeline [pipeline]add customized policy (#1139) 2 years ago
registry Remove duplication registry (#1078) 3 years ago
tensor [Tensor] distributed view supports inter-process hybrid parallel (#1169) 2 years ago
testing [test] skip tests when not enough GPUs are detected (#1090) 3 years ago
trainer fix issue #1080 (#1071) 3 years ago
utils [hotfix]different overflow status lead to communication stuck. (#1175) 2 years ago
zero [zero] sharded optim supports loading local state dict (#1170) 2 years ago
__init__.py [NFC] polish __init__.py code style (#965) 3 years ago
constants.py fix typo in constants (#1027) 3 years ago
core.py [Tensor] distributed view supports inter-process hybrid parallel (#1169) 2 years ago
global_variables.py
initialize.py [ddp] supported customized torch ddp configuration (#1123) 2 years ago