ColossalAI/colossalai
アマデウス 49715a78f0 [NFC] polish colossalai/cli/benchmark/benchmark.py code style (#2287) 2023-01-04 15:09:57 +08:00
..
_C [optimizer] add div_scale for optimizers (#2117) 2022-12-12 17:58:57 +08:00
amp [amp] add gradient clipping for unit tests (#2283) 2023-01-04 11:59:56 +08:00
auto_parallel [NFC] polish colossalai/auto_parallel/tensor_shard/node_handler/getitem_handler.py code style (#2289) 2023-01-04 15:09:57 +08:00
builder [NFC] polish colossalai/builder/__init__.py code style (#1560) 2022-09-08 22:11:04 +08:00
cli [NFC] polish colossalai/cli/benchmark/benchmark.py code style (#2287) 2023-01-04 15:09:57 +08:00
communication improved allgather & reducescatter for 3d 2023-01-03 17:46:08 +08:00
context [hotfix] Fixing the bug related to ipv6 support 2022-12-27 12:42:46 +08:00
device [device] update flatten device mesh usage (#2079) 2022-12-05 16:16:07 +08:00
engine [engin/schedule] use p2p_v2 to recontruct pipeline_schedule (#1408) 2022-08-12 11:33:26 +08:00
fx [auto-parallel] refactoring ColoTracer (#2118) 2023-01-04 14:44:22 +08:00
gemini [Gemini] fix the convert_to_torch_module bug (#2269) 2023-01-03 15:55:35 +08:00
kernel [builder] MOE builder (#2277) 2023-01-03 20:29:39 +08:00
logging [logger] hotfix, missing _FORMAT (#2231) 2022-12-29 22:59:39 +08:00
nn [builder] MOE builder (#2277) 2023-01-03 20:29:39 +08:00
pipeline [example] add benchmark (#2276) 2023-01-03 17:20:59 +08:00
registry Remove duplication registry (#1078) 2022-06-08 07:47:24 +08:00
tensor [autoparallel] fix runtime apply memory estimation (#2281) 2023-01-03 17:18:07 +08:00
testing [amp] add gradient clipping for unit tests (#2283) 2023-01-04 11:59:56 +08:00
trainer [polish] remove useless file _mem_tracer_hook.py (#1963) 2022-11-16 15:55:10 +08:00
utils [builder] unified cpu_optim fused_optim inferface (#2190) 2022-12-23 20:57:41 +08:00
zero [zero] polish low level zero optimizer (#2275) 2023-01-03 17:22:34 +08:00
__init__.py [setup] supported conda-installed torch (#2048) 2022-11-30 16:45:15 +08:00
constants.py updated tp layers 2022-11-02 12:19:38 +08:00
core.py [Tensor] distributed view supports inter-process hybrid parallel (#1169) 2022-06-27 09:45:26 +08:00
global_variables.py updated tp layers 2022-11-02 12:19:38 +08:00
initialize.py [hotfix] remove potiential circle import (#1307) 2022-07-14 13:44:26 +08:00