ColossalAI/colossalai
Frank Lee ed19290560
[booster] implemented mixed precision class (#3151)
* [booster] implemented mixed precision class

* polish code
2023-03-17 11:00:15 +08:00
..
_C [setup] support pre-build and jit-build of cuda kernels (#2374) 2023-01-06 20:50:26 +08:00
_analyzer [analyzer] a minimal implementation of static graph analyzer (#2852) 2023-03-10 13:21:05 +08:00
amp [test] fixed the triton version for testing (#2608) 2023-02-07 13:49:38 +08:00
auto_parallel [hotfix] add shard dim to aviod backward communication error (#2954) 2023-03-01 11:41:53 +08:00
autochunk [autochunk] support vit (#3084) 2023-03-10 10:23:26 +08:00
booster [booster] implemented mixed precision class (#3151) 2023-03-17 11:00:15 +08:00
builder [NFC] polish colossalai/builder/__init__.py code style (#1560) 2022-09-08 22:11:04 +08:00
cli [cli] handled version check exceptions (#2848) 2023-02-21 17:04:49 +08:00
communication [NFC] polish communication/p2p_v2.py code style (#2303) 2023-01-04 15:09:57 +08:00
context [NFC] polish colossalai/context/process_group_initializer/initializer_2d.py code style (#2726) 2023-02-15 22:27:13 +08:00
device [hotfix] add copyright for solver and device mesh (#2803) 2023-02-18 21:14:38 +08:00
engine [doc] add deepspeed citation and copyright (#2996) 2023-03-04 20:08:11 +08:00
fx [analyzer] a minimal implementation of static graph analyzer (#2852) 2023-03-10 13:21:05 +08:00
gemini [doc] add deepspeed citation and copyright (#2996) 2023-03-04 20:08:11 +08:00
kernel [kernel] added kernel loader to softmax autograd function (#3093) 2023-03-10 14:27:09 +08:00
logging [logger] hotfix, missing _FORMAT (#2231) 2022-12-29 22:59:39 +08:00
nn [format] applied code formatting on changed files in pull request 2997 (#3008) 2023-03-06 10:42:22 +08:00
pipeline [pipeline] Add Simplified Alpa DP Partition (#2507) 2023-03-07 10:34:31 +08:00
registry Remove duplication registry (#1078) 2022-06-08 07:47:24 +08:00
tensor [DTensor] refactor dtensor with new components (#3089) 2023-03-14 16:25:47 +08:00
testing [amp] add gradient clipping for unit tests (#2283) 2023-01-04 11:59:56 +08:00
trainer [polish] remove useless file _mem_tracer_hook.py (#1963) 2022-11-16 15:55:10 +08:00
utils [lazyinit] refactor lazy tensor and lazy init ctx (#3131) 2023-03-14 15:37:12 +08:00
zero [doc] add deepspeed citation and copyright (#2996) 2023-03-04 20:08:11 +08:00
__init__.py [setup] supported conda-installed torch (#2048) 2022-11-30 16:45:15 +08:00
constants.py updated tp layers 2022-11-02 12:19:38 +08:00
core.py [Tensor] distributed view supports inter-process hybrid parallel (#1169) 2022-06-27 09:45:26 +08:00
global_variables.py updated tp layers 2022-11-02 12:19:38 +08:00
initialize.py Fix False warning in initialize.py (#2456) 2023-01-12 13:49:01 +08:00