ColossalAI/colossalai/utils
Kirigaya Kazuto 3b2a59b0ba
[pipeline/rank_recorder] fix bug when process data before backward | add a tool for multiple ranks debug (#1681)
* [pipeline/tuning] improve dispatch performance both time and space cost

* [pipeline/converge] add interface for testing convergence

* [NFC] polish colossalai/utils/multi_tensor_apply/multi_tensor_apply.py code style

* Update PipelineBase.py

* [pipeline/chimera] reconstruct PipelineBase and Worker to support more feasible custom schedule | finish Chimera

* [pipeline/chimera] test chimera | fix bug of initializing

* [pipeline/pytree] add pytree to process args and kwargs | provide  to process args and kwargs after forward
2022-10-09 17:32:57 +08:00
..
checkpoint [hotfix] fix a running error in test_colo_checkpoint.py (#1387) 2022-07-29 15:58:06 +08:00
data_sampler Refactored docstring to google style 2022-03-29 17:17:47 +08:00
model [hotfix] fix init context (#1543) 2022-09-06 11:45:08 +08:00
multi_tensor_apply [NFC] polish colossalai/utils/multi_tensor_apply/multi_tensor_apply.py code style (#1559) 2022-09-08 22:04:34 +08:00
profiler [hotfix] remove potiential circle import (#1307) 2022-07-14 13:44:26 +08:00
rank_recorder [pipeline/rank_recorder] fix bug when process data before backward | add a tool for multiple ranks debug (#1681) 2022-10-09 17:32:57 +08:00
tensor_detector [NFC] polish utils/tensor_detector/__init__.py code style (#1573) 2022-09-08 22:11:04 +08:00
__init__.py [refactory] add nn.parallel module (#1068) 2022-06-06 15:34:41 +08:00
activation_checkpoint.py [utils] Add use_reetrant=False in utils.activation_checkpoint (#1460) 2022-08-16 15:39:20 +08:00
checkpointing.py [utils] refactor parallel layers checkpoint and bcast model on loading checkpoint (#1548) 2022-09-06 20:18:35 +08:00
common.py [test] fixed the activation codegen test (#1447) 2022-08-12 14:52:31 +08:00
cuda.py [refactor] refactor the memory utils (#715) 2022-04-11 16:47:57 +08:00
memory.py [gemini] APIs to set cpu memory capacity (#809) 2022-04-19 16:05:22 +08:00
moe.py Refactored docstring to google style 2022-03-29 17:17:47 +08:00
timer.py Refactored docstring to google style 2022-03-29 17:17:47 +08:00