.. |
_C
|
[kernel] move all symlinks of kernel to `colossalai._C` (#1971)
|
2022-11-17 13:42:33 +08:00 |
amp
|
[kernel] move all symlinks of kernel to `colossalai._C` (#1971)
|
2022-11-17 13:42:33 +08:00 |
auto_parallel
|
[autoparallel] add sum handler (#2101)
|
2022-12-08 17:02:54 +08:00 |
builder
|
[NFC] polish colossalai/builder/__init__.py code style (#1560)
|
2022-09-08 22:11:04 +08:00 |
cli
|
[cli] updated installation cheheck with more inforamtion (#2050)
|
2022-11-30 17:53:55 +08:00 |
communication
|
[communication] add p2p_v2.py to support communication with List[Any] (#1407)
|
2022-08-09 11:40:04 +08:00 |
context
|
updated tp layers
|
2022-11-02 12:19:38 +08:00 |
device
|
[device] update flatten device mesh usage (#2079)
|
2022-12-05 16:16:07 +08:00 |
engine
|
[engin/schedule] use p2p_v2 to recontruct pipeline_schedule (#1408)
|
2022-08-12 11:33:26 +08:00 |
fx
|
[autoparallel] support linear function bias addition (#2104)
|
2022-12-09 10:31:36 +08:00 |
gemini
|
[Gemini] chunk init use OrderedParamGenerator (#2110)
|
2022-12-11 21:41:13 +08:00 |
kernel
|
[kernel] move all symlinks of kernel to `colossalai._C` (#1971)
|
2022-11-17 13:42:33 +08:00 |
logging
|
fixed logger
|
2022-11-15 16:00:07 +08:00 |
nn
|
[zero] add L2 gradient clipping for ZeRO (#2112)
|
2022-12-09 18:09:17 +08:00 |
pipeline
|
[PP Middleware] Add bwd and step for PP middleware (#2111)
|
2022-12-12 12:40:03 +08:00 |
registry
|
Remove duplication registry (#1078)
|
2022-06-08 07:47:24 +08:00 |
tensor
|
[Gemini] ParamOpHook -> ColoParamOpHook (#2080)
|
2022-12-05 17:11:06 +08:00 |
testing
|
[zero] test gradient accumulation (#1964)
|
2022-11-29 13:00:30 +08:00 |
trainer
|
[polish] remove useless file _mem_tracer_hook.py (#1963)
|
2022-11-16 15:55:10 +08:00 |
utils
|
[hotfix] fix a type in ColoInitContext (#2106)
|
2022-12-09 11:44:39 +08:00 |
zero
|
[Gemini] use MemStats to store the tracing data. Seperate it from Collector. (#2084)
|
2022-12-06 16:43:06 +08:00 |
__init__.py
|
[setup] supported conda-installed torch (#2048)
|
2022-11-30 16:45:15 +08:00 |
constants.py
|
updated tp layers
|
2022-11-02 12:19:38 +08:00 |
core.py
|
[Tensor] distributed view supports inter-process hybrid parallel (#1169)
|
2022-06-27 09:45:26 +08:00 |
global_variables.py
|
updated tp layers
|
2022-11-02 12:19:38 +08:00 |
initialize.py
|
[hotfix] remove potiential circle import (#1307)
|
2022-07-14 13:44:26 +08:00 |