.. |
_C
|
[setup] support pre-build and jit-build of cuda kernels (#2374)
|
2023-01-06 20:50:26 +08:00 |
_analyzer
|
[FX] refactor experimental tracer and adapt it with hf models (#3157)
|
2023-03-22 10:40:33 +08:00 |
amp
|
[test] fixed the triton version for testing (#2608)
|
2023-02-07 13:49:38 +08:00 |
auto_parallel
|
[auto-parallel] add auto-offload feature (#3154)
|
2023-03-21 14:17:41 +08:00 |
autochunk
|
[autochunk] support vit (#3084)
|
2023-03-10 10:23:26 +08:00 |
booster
|
[booster] added the plugin base and torch ddp plugin (#3180)
|
2023-03-21 17:39:30 +08:00 |
builder
|
[NFC] polish colossalai/builder/__init__.py code style (#1560)
|
2022-09-08 22:11:04 +08:00 |
checkpoint_io
|
[api] implemented the checkpoint io module (#3205)
|
2023-03-23 10:53:17 +08:00 |
cli
|
[cli] handled version check exceptions (#2848)
|
2023-02-21 17:04:49 +08:00 |
cluster
|
[API] implement device mesh manager (#3221)
|
2023-03-24 13:39:12 +08:00 |
communication
|
[NFC] polish communication/p2p_v2.py code style (#2303)
|
2023-01-04 15:09:57 +08:00 |
context
|
[NFC] polish colossalai/context/process_group_initializer/initializer_2d.py code style (#2726)
|
2023-02-15 22:27:13 +08:00 |
device
|
[hotfix] add copyright for solver and device mesh (#2803)
|
2023-02-18 21:14:38 +08:00 |
engine
|
[doc] add deepspeed citation and copyright (#2996)
|
2023-03-04 20:08:11 +08:00 |
fx
|
[analyzer] a minimal implementation of static graph analyzer (#2852)
|
2023-03-10 13:21:05 +08:00 |
gemini
|
[doc] add deepspeed citation and copyright (#2996)
|
2023-03-04 20:08:11 +08:00 |
kernel
|
updated flash attention usage
|
2023-03-20 17:57:04 +08:00 |
logging
|
[logger] hotfix, missing _FORMAT (#2231)
|
2022-12-29 22:59:39 +08:00 |
nn
|
[format] applied code formatting on changed files in pull request 2997 (#3008)
|
2023-03-06 10:42:22 +08:00 |
pipeline
|
[pipeline] Add Simplified Alpa DP Partition (#2507)
|
2023-03-07 10:34:31 +08:00 |
registry
|
Remove duplication registry (#1078)
|
2022-06-08 07:47:24 +08:00 |
tensor
|
[hotfix] layout converting issue (#3188)
|
2023-03-21 13:24:18 +08:00 |
testing
|
[amp] add gradient clipping for unit tests (#2283)
|
2023-01-04 11:59:56 +08:00 |
trainer
|
[polish] remove useless file _mem_tracer_hook.py (#1963)
|
2022-11-16 15:55:10 +08:00 |
utils
|
[lazyinit] combine lazy tensor with dtensor (#3204)
|
2023-03-23 10:53:06 +08:00 |
zero
|
[zero] Refactor ZeroContextConfig class using dataclass (#3186)
|
2023-03-21 12:36:47 +08:00 |
__init__.py
|
[setup] supported conda-installed torch (#2048)
|
2022-11-30 16:45:15 +08:00 |
constants.py
|
updated tp layers
|
2022-11-02 12:19:38 +08:00 |
core.py
|
[Tensor] distributed view supports inter-process hybrid parallel (#1169)
|
2022-06-27 09:45:26 +08:00 |
global_variables.py
|
updated tp layers
|
2022-11-02 12:19:38 +08:00 |
initialize.py
|
Fix False warning in initialize.py (#2456)
|
2023-01-12 13:49:01 +08:00 |