You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/colossalai
binmakeswell 5f41463a76
add optimizer README for tutorials (#1707)
2 years ago
..
amp [doc] update rst and docstring (#1351) 2 years ago
auto_parallel [autoparallel] refactored the autoparallel module for organization (#1706) 2 years ago
builder [NFC] polish colossalai/builder/__init__.py code style (#1560) 2 years ago
cli [hotfix] fix some bugs caused by size mismatch. (#1011) 3 years ago
communication [communication] add p2p_v2.py to support communication with List[Any] (#1407) 2 years ago
context [moe] initialize MoE groups by ProcessGroup (#1640) 2 years ago
device [tensor]add 1D device mesh (#1492) 2 years ago
engine [engin/schedule] use p2p_v2 to recontruct pipeline_schedule (#1408) 2 years ago
fx [autoparallel] adapt runtime passes (#1703) 2 years ago
gemini [feature] A new ZeRO implementation (#1644) 2 years ago
kernel [hotfix] fix CPUAdam kernel nullptr (#1410) 2 years ago
logging [doc] improved docstring in the logging module (#861) 3 years ago
nn add optimizer README for tutorials (#1707) 2 years ago
pipeline [pipeline/fix-bug] num_microbatches support any integrate | stable chimera | launch tool for rpc pp framework (#1684) 2 years ago
registry Remove duplication registry (#1078) 3 years ago
tensor [autoparallel] added sharding spec conversion for linear handler (#1687) 2 years ago
testing [unittest] added doc for the pytest wrapper (#1704) 2 years ago
trainer [NFC] polish ./colossalai/trainer/hooks/_lr_scheduler_hook.py code style (#1576) 2 years ago
utils [pipeline/rank_recorder] fix bug when process data before backward | add a tool for multiple ranks debug (#1681) 2 years ago
zero [feature] A new ZeRO implementation (#1644) 2 years ago
__init__.py update version to 0.1.10 (#1676) 2 years ago
_meta_registrations.py [fx/profiler] tuned the calculation of memory estimation (#1619) 2 years ago
constants.py fix typo in constants (#1027) 3 years ago
core.py [Tensor] distributed view supports inter-process hybrid parallel (#1169) 2 years ago
global_variables.py [MOE] add unitest for MOE experts layout, gradient handler and kernel (#469) 3 years ago
initialize.py [hotfix] remove potiential circle import (#1307) 2 years ago