ColossalAI/colossalai
Jiarui Fang d42afd30f8
[builder] runtime adam and fused_optim builder (#2184)
2022-12-23 14:14:21 +08:00
..
_C [optimizer] add div_scale for optimizers (#2117) 2022-12-12 17:58:57 +08:00
amp [builder] runtime adam and fused_optim builder (#2184) 2022-12-23 14:14:21 +08:00
auto_parallel [autoparallel] integrate_gpt_related_tests (#2134) 2022-12-23 12:36:59 +08:00
builder [NFC] polish colossalai/builder/__init__.py code style (#1560) 2022-09-08 22:11:04 +08:00
cli [cli] updated installation cheheck with more inforamtion (#2050) 2022-11-30 17:53:55 +08:00
communication [communication] add p2p_v2.py to support communication with List[Any] (#1407) 2022-08-09 11:40:04 +08:00
context updated tp layers 2022-11-02 12:19:38 +08:00
device [device] update flatten device mesh usage (#2079) 2022-12-05 16:16:07 +08:00
engine [engin/schedule] use p2p_v2 to recontruct pipeline_schedule (#1408) 2022-08-12 11:33:26 +08:00
fx [Pipeline Middleware ] Fix deadlock when num_microbatch=num_stage (#2156) 2022-12-23 11:38:43 +08:00
gemini [hotfix] fix auto policy of test_sharded_optim_v2 (#2157) 2022-12-20 23:03:18 +08:00
kernel [builder] runtime adam and fused_optim builder (#2184) 2022-12-23 14:14:21 +08:00
logging fixed logger 2022-11-15 16:00:07 +08:00
nn [builder] runtime adam and fused_optim builder (#2184) 2022-12-23 14:14:21 +08:00
pipeline [Pipeline Middleware ] Fix deadlock when num_microbatch=num_stage (#2156) 2022-12-23 11:38:43 +08:00
registry Remove duplication registry (#1078) 2022-06-08 07:47:24 +08:00
tensor [autoparallel] memory estimation for shape consistency (#2144) 2022-12-21 10:39:37 +08:00
testing [zero] test gradient accumulation (#1964) 2022-11-29 13:00:30 +08:00
trainer [polish] remove useless file _mem_tracer_hook.py (#1963) 2022-11-16 15:55:10 +08:00
utils [Gemini] Update coloinit_ctx to support meta_tensor (#2147) 2022-12-19 22:37:07 +08:00
zero [example] add zero1, zero2 example in GPT examples (#2146) 2022-12-20 14:30:27 +08:00
__init__.py [setup] supported conda-installed torch (#2048) 2022-11-30 16:45:15 +08:00
constants.py updated tp layers 2022-11-02 12:19:38 +08:00
core.py [Tensor] distributed view supports inter-process hybrid parallel (#1169) 2022-06-27 09:45:26 +08:00
global_variables.py updated tp layers 2022-11-02 12:19:38 +08:00
initialize.py [hotfix] remove potiential circle import (#1307) 2022-07-14 13:44:26 +08:00