ColossalAI/colossalai
HELSON e7d3afc9cc
[optimizer] add div_scale for optimizers (#2117)
* [optimizer] add div_scale for optimizers

* [zero] use div_scale in zero optimizer

* fix testing error
2022-12-12 17:58:57 +08:00
..
_C [optimizer] add div_scale for optimizers (#2117) 2022-12-12 17:58:57 +08:00
amp [kernel] move all symlinks of kernel to `colossalai._C` (#1971) 2022-11-17 13:42:33 +08:00
auto_parallel [autoparallel] add sum handler (#2101) 2022-12-08 17:02:54 +08:00
builder
cli [cli] updated installation cheheck with more inforamtion (#2050) 2022-11-30 17:53:55 +08:00
communication
context updated tp layers 2022-11-02 12:19:38 +08:00
device [device] update flatten device mesh usage (#2079) 2022-12-05 16:16:07 +08:00
engine
fx [autoparallel] support linear function bias addition (#2104) 2022-12-09 10:31:36 +08:00
gemini [NFC] update chunk manager API (#2119) 2022-12-12 16:57:22 +08:00
kernel [optimizer] add div_scale for optimizers (#2117) 2022-12-12 17:58:57 +08:00
logging fixed logger 2022-11-15 16:00:07 +08:00
nn [optimizer] add div_scale for optimizers (#2117) 2022-12-12 17:58:57 +08:00
pipeline [PP Middleware] Add bwd and step for PP middleware (#2111) 2022-12-12 12:40:03 +08:00
registry
tensor [NFC] polish comments for Chunk class (#2116) 2022-12-12 15:39:31 +08:00
testing [zero] test gradient accumulation (#1964) 2022-11-29 13:00:30 +08:00
trainer [polish] remove useless file _mem_tracer_hook.py (#1963) 2022-11-16 15:55:10 +08:00
utils [hotfix] fix a type in ColoInitContext (#2106) 2022-12-09 11:44:39 +08:00
zero [NFC] polish comments for Chunk class (#2116) 2022-12-12 15:39:31 +08:00
__init__.py [setup] supported conda-installed torch (#2048) 2022-11-30 16:45:15 +08:00
constants.py updated tp layers 2022-11-02 12:19:38 +08:00
core.py
global_variables.py updated tp layers 2022-11-02 12:19:38 +08:00
initialize.py