ColossalAI/colossalai
Ziyue Jiang 05023ecfee
[Tensor] TP Linear 1D row (#843)
2022-04-24 13:43:12 +08:00
..
amp [hotfix] fix memory leak in zero (#781) 2022-04-18 13:57:03 +08:00
builder modefied the pp build for ckpt adaptation (#803) 2022-04-24 12:23:16 +08:00
cli [CLI] refactored the launch CLI and fixed bugs in multi-node launching (#844) 2022-04-24 13:26:26 +08:00
communication
context [compatibility] used backward-compatible API for global process group (#758) 2022-04-14 17:20:35 +08:00
engine [refactor] moving grad acc logic to engine (#804) 2022-04-19 14:03:21 +08:00
gemini [gemini] add GeminiMemoryManger (#832) 2022-04-24 13:08:48 +08:00
kernel Revert "[zero] add ZeroTensorShardStrategy (#793)" (#806) 2022-04-19 14:40:02 +08:00
logging
nn [gemini] add GeminiMemoryManger (#832) 2022-04-24 13:08:48 +08:00
registry [dependency] removed torchvision (#833) 2022-04-22 15:24:35 +08:00
tensor [Tensor] TP Linear 1D row (#843) 2022-04-24 13:43:12 +08:00
testing [test] added a decorator for address already in use error with backward compatibility (#760) 2022-04-14 16:48:44 +08:00
trainer [log] local throughput metrics (#811) 2022-04-20 10:05:39 +08:00
utils [pipelinable]use pipelinable context to initialize non-pipeline model (#816) 2022-04-24 13:03:12 +08:00
zero [gemini] add GeminiMemoryManger (#832) 2022-04-24 13:08:48 +08:00
__init__.py
constants.py
core.py
global_variables.py
initialize.py modefied the pp build for ckpt adaptation (#803) 2022-04-24 12:23:16 +08:00