.. |
amp
|
[doc] update rst and docstring (#1351)
|
2022-07-21 15:54:53 +08:00 |
auto_parallel
|
[autoparellel]add strategies constructor (#1505)
|
2022-08-30 16:32:09 +08:00 |
builder
|
[NFC] polish colossalai/builder/builder.py code style (#1265)
|
2022-07-13 12:08:21 +08:00 |
cli
|
[hotfix] fix some bugs caused by size mismatch. (#1011)
|
2022-05-23 14:02:28 +08:00 |
communication
|
[communication] add p2p_v2.py to support communication with List[Any] (#1407)
|
2022-08-09 11:40:04 +08:00 |
context
|
[doc] update rst and docstring (#1351)
|
2022-07-21 15:54:53 +08:00 |
device
|
[tensor]add 1D device mesh (#1492)
|
2022-08-25 16:48:12 +08:00 |
engine
|
[engin/schedule] use p2p_v2 to recontruct pipeline_schedule (#1408)
|
2022-08-12 11:33:26 +08:00 |
fx
|
[fx]patch nn.functional convolution (#1528)
|
2022-09-01 19:05:07 +08:00 |
gemini
|
[zero] add chunk_managerV2 for all-gather chunk (#1441)
|
2022-08-11 19:17:24 +08:00 |
kernel
|
[hotfix] fix CPUAdam kernel nullptr (#1410)
|
2022-08-05 19:45:45 +08:00 |
logging
|
[doc] improved docstring in the logging module (#861)
|
2022-04-25 13:42:00 +08:00 |
nn
|
[embedding] add tablewise sharding for FAW (#1526)
|
2022-09-01 17:55:41 +08:00 |
pipeline
|
[pipeline/pipleline_process_group] finish PipelineProcessGroup to manage local abd global rank in TP,DP and PP (#1508)
|
2022-09-01 17:45:47 +08:00 |
registry
|
Remove duplication registry (#1078)
|
2022-06-08 07:47:24 +08:00 |
tensor
|
[tensor]add 1D device mesh (#1492)
|
2022-08-25 16:48:12 +08:00 |
testing
|
[test] skip tests when not enough GPUs are detected (#1090)
|
2022-06-09 17:19:13 +08:00 |
trainer
|
fix issue #1080 (#1071)
|
2022-06-07 17:21:11 +08:00 |
utils
|
[utils] Add use_reetrant=False in utils.activation_checkpoint (#1460)
|
2022-08-16 15:39:20 +08:00 |
zero
|
[utils] Impl clip_grad_norm for ColoTensor and ZeroOptimizer (#1442)
|
2022-08-11 22:58:58 +08:00 |
__init__.py
|
[NFC] polish colossalai/__init__.py code style (#1285)
|
2022-07-13 12:08:21 +08:00 |
constants.py
|
fix typo in constants (#1027)
|
2022-05-26 08:45:08 +08:00 |
core.py
|
[Tensor] distributed view supports inter-process hybrid parallel (#1169)
|
2022-06-27 09:45:26 +08:00 |
global_variables.py
|
[MOE] add unitest for MOE experts layout, gradient handler and kernel (#469)
|
2022-03-21 13:35:04 +08:00 |
initialize.py
|
[hotfix] remove potiential circle import (#1307)
|
2022-07-14 13:44:26 +08:00 |