You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/colossalai
Frank Lee 11973d892d
[fx] added torchvision model tracing testing (#1216)
2 years ago
..
amp [hotfix]different overflow status lead to communication stuck. (#1175) 2 years ago
builder [pipeline] refactor the pipeline module (#1087) 3 years ago
cli [hotfix] fix some bugs caused by size mismatch. (#1011) 3 years ago
communication [hotfix]fixed p2p process send stuck (#1181) 2 years ago
context
engine [hotfix]fix some bugs caused by refactored schedule. (#1148) 2 years ago
fx [fx] added torchvision model tracing testing (#1216) 2 years ago
gemini make AutoPlacementPolicy configurable (#1191) 2 years ago
kernel [optim] refactor fused sgd (#1134) 2 years ago
logging
nn [refactor] move process group from _DistSpec to ColoTensor. (#1203) 2 years ago
pipeline [pipeline]add customized policy (#1139) 2 years ago
registry Remove duplication registry (#1078) 3 years ago
tensor [refactor] move process group from _DistSpec to ColoTensor. (#1203) 2 years ago
testing [test] skip tests when not enough GPUs are detected (#1090) 3 years ago
trainer fix issue #1080 (#1071) 3 years ago
utils [checkpoint] make unitest faster (#1217) 2 years ago
zero warmup ratio configration (#1192) 2 years ago
__init__.py [NFC] polish __init__.py code style (#965) 3 years ago
constants.py fix typo in constants (#1027) 3 years ago
core.py [Tensor] distributed view supports inter-process hybrid parallel (#1169) 2 years ago
global_variables.py
initialize.py [ddp] supported customized torch ddp configuration (#1123) 2 years ago