.. |
_C
|
[optimizer] add div_scale for optimizers (#2117)
|
2022-12-12 17:58:57 +08:00 |
amp
|
[amp] add gradient clipping for unit tests (#2283)
|
2023-01-04 11:59:56 +08:00 |
auto_parallel
|
[NFC] polish colossalai/auto_parallel/tensor_shard/node_handler/getitem_handler.py code style (#2289)
|
2023-01-04 15:09:57 +08:00 |
builder
|
[NFC] polish colossalai/builder/__init__.py code style (#1560)
|
2022-09-08 22:11:04 +08:00 |
cli
|
[NFC] polish colossalai/cli/benchmark/benchmark.py code style (#2287)
|
2023-01-04 15:09:57 +08:00 |
communication
|
improved allgather & reducescatter for 3d
|
2023-01-03 17:46:08 +08:00 |
context
|
[hotfix] Fixing the bug related to ipv6 support
|
2022-12-27 12:42:46 +08:00 |
device
|
[device] update flatten device mesh usage (#2079)
|
2022-12-05 16:16:07 +08:00 |
engine
|
[engin/schedule] use p2p_v2 to recontruct pipeline_schedule (#1408)
|
2022-08-12 11:33:26 +08:00 |
fx
|
[auto-parallel] refactoring ColoTracer (#2118)
|
2023-01-04 14:44:22 +08:00 |
gemini
|
[Gemini] fix the convert_to_torch_module bug (#2269)
|
2023-01-03 15:55:35 +08:00 |
kernel
|
[builder] MOE builder (#2277)
|
2023-01-03 20:29:39 +08:00 |
logging
|
[logger] hotfix, missing _FORMAT (#2231)
|
2022-12-29 22:59:39 +08:00 |
nn
|
[builder] MOE builder (#2277)
|
2023-01-03 20:29:39 +08:00 |
pipeline
|
[example] add benchmark (#2276)
|
2023-01-03 17:20:59 +08:00 |
registry
|
Remove duplication registry (#1078)
|
2022-06-08 07:47:24 +08:00 |
tensor
|
[autoparallel] fix runtime apply memory estimation (#2281)
|
2023-01-03 17:18:07 +08:00 |
testing
|
[amp] add gradient clipping for unit tests (#2283)
|
2023-01-04 11:59:56 +08:00 |
trainer
|
[polish] remove useless file _mem_tracer_hook.py (#1963)
|
2022-11-16 15:55:10 +08:00 |
utils
|
[builder] unified cpu_optim fused_optim inferface (#2190)
|
2022-12-23 20:57:41 +08:00 |
zero
|
[zero] polish low level zero optimizer (#2275)
|
2023-01-03 17:22:34 +08:00 |
__init__.py
|
[setup] supported conda-installed torch (#2048)
|
2022-11-30 16:45:15 +08:00 |
constants.py
|
updated tp layers
|
2022-11-02 12:19:38 +08:00 |
core.py
|
[Tensor] distributed view supports inter-process hybrid parallel (#1169)
|
2022-06-27 09:45:26 +08:00 |
global_variables.py
|
updated tp layers
|
2022-11-02 12:19:38 +08:00 |
initialize.py
|
[hotfix] remove potiential circle import (#1307)
|
2022-07-14 13:44:26 +08:00 |