You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/colossalai
Frank Lee f7878f465c
[fx] supported model tracing for huggingface bert (#1201)
2 years ago
..
amp [hotfix]different overflow status lead to communication stuck. (#1175) 2 years ago
builder [pipeline] refactor the pipeline module (#1087) 3 years ago
cli [hotfix] fix some bugs caused by size mismatch. (#1011) 3 years ago
communication [hotfix]fixed p2p process send stuck (#1181) 2 years ago
context [usability] improved error messages in the context module (#856) 3 years ago
engine [hotfix]fix some bugs caused by refactored schedule. (#1148) 2 years ago
fx [fx] supported model tracing for huggingface bert (#1201) 2 years ago
gemini make AutoPlacementPolicy configurable (#1191) 2 years ago
kernel [optim] refactor fused sgd (#1134) 2 years ago
logging [doc] improved docstring in the logging module (#861) 3 years ago
nn [refactor] remove gpc dependency in colotensor's _ops (#1189) 2 years ago
pipeline [pipeline]add customized policy (#1139) 2 years ago
registry Remove duplication registry (#1078) 3 years ago
tensor [refactor] remove gpc dependency in colotensor's _ops (#1189) 2 years ago
testing [test] skip tests when not enough GPUs are detected (#1090) 3 years ago
trainer fix issue #1080 (#1071) 3 years ago
utils [context]support arbitary module materialization. (#1193) 2 years ago
zero warmup ratio configration (#1192) 2 years ago
__init__.py [NFC] polish __init__.py code style (#965) 3 years ago
constants.py fix typo in constants (#1027) 3 years ago
core.py [Tensor] distributed view supports inter-process hybrid parallel (#1169) 2 years ago
global_variables.py [MOE] add unitest for MOE experts layout, gradient handler and kernel (#469) 3 years ago
initialize.py [ddp] supported customized torch ddp configuration (#1123) 2 years ago