ColossalAI/colossalai
Ziyue Jiang fef5c949c3
polish pp middleware (#2476)
Co-authored-by: Ziyue Jiang <ziyue.jiang@gmail.com>
2023-01-13 16:56:01 +08:00
..
_C [setup] support pre-build and jit-build of cuda kernels (#2374) 2023-01-06 20:50:26 +08:00
amp [setup] support pre-build and jit-build of cuda kernels (#2374) 2023-01-06 20:50:26 +08:00
auto_parallel [autoparallel] update binary elementwise handler (#2451) 2023-01-12 09:35:10 +08:00
autochunk adapt new fx 2023-01-10 11:56:00 +08:00
builder [NFC] polish colossalai/builder/__init__.py code style (#1560) 2022-09-08 22:11:04 +08:00
cli [cli] fixed hostname mismatch error (#2465) 2023-01-12 14:52:09 +08:00
communication [NFC] polish communication/p2p_v2.py code style (#2303) 2023-01-04 15:09:57 +08:00
context Update parallel_context.py (#2408) 2023-01-10 11:27:23 +08:00
device [autoparallel] integrate device mesh initialization into autoparallelize (#2393) 2023-01-11 14:03:49 +08:00
engine [engin/schedule] use p2p_v2 to recontruct pipeline_schedule (#1408) 2022-08-12 11:33:26 +08:00
fx [fx] allow native ckpt trace and codegen. (#2438) 2023-01-11 13:49:59 +08:00
gemini [zero] add warning for ignored parameters (#2446) 2023-01-11 15:30:09 +08:00
kernel [example] integrate seq-parallel tutorial with CI (#2463) 2023-01-13 14:40:05 +08:00
logging [logger] hotfix, missing _FORMAT (#2231) 2022-12-29 22:59:39 +08:00
nn [zero] add warning for ignored parameters (#2446) 2023-01-11 15:30:09 +08:00
pipeline polish pp middleware (#2476) 2023-01-13 16:56:01 +08:00
registry Remove duplication registry (#1078) 2022-06-08 07:47:24 +08:00
tensor [hotfix] fix implement error in diffusers 2023-01-07 07:56:39 +08:00
testing [amp] add gradient clipping for unit tests (#2283) 2023-01-04 11:59:56 +08:00
trainer [polish] remove useless file _mem_tracer_hook.py (#1963) 2022-11-16 15:55:10 +08:00
utils [ddp] add is_ddp_ignored (#2434) 2023-01-11 12:22:45 +08:00
zero [zero] polish low level optimizer (#2473) 2023-01-13 14:56:17 +08:00
__init__.py [setup] supported conda-installed torch (#2048) 2022-11-30 16:45:15 +08:00
constants.py updated tp layers 2022-11-02 12:19:38 +08:00
core.py [Tensor] distributed view supports inter-process hybrid parallel (#1169) 2022-06-27 09:45:26 +08:00
global_variables.py updated tp layers 2022-11-02 12:19:38 +08:00
initialize.py Fix False warning in initialize.py (#2456) 2023-01-12 13:49:01 +08:00