ColossalAI/colossalai/utils
HELSON 1468e4bcfc
[zero] add constant placement policy (#1705)
* fixes memory leak when paramter is in fp16 in ZeroDDP init.
* bans chunk releasement in CUDA. Only when a chunk is about to offload, it is allowed to release.
* adds a constant placement policy. With it, users can allocate a reserved caching memory space for parameters.
2022-10-14 17:53:16 +08:00
..
checkpoint [hotfix] fix a running error in test_colo_checkpoint.py (#1387) 2022-07-29 15:58:06 +08:00
data_sampler Refactored docstring to google style 2022-03-29 17:17:47 +08:00
model [zero] add constant placement policy (#1705) 2022-10-14 17:53:16 +08:00
multi_tensor_apply [NFC] polish colossalai/utils/multi_tensor_apply/multi_tensor_apply.py code style (#1559) 2022-09-08 22:04:34 +08:00
profiler [hotfix] remove potiential circle import (#1307) 2022-07-14 13:44:26 +08:00
rank_recorder [pipeline/rank_recorder] fix bug when process data before backward | add a tool for multiple ranks debug (#1681) 2022-10-09 17:32:57 +08:00
tensor_detector [NFC] polish utils/tensor_detector/__init__.py code style (#1573) 2022-09-08 22:11:04 +08:00
__init__.py [refactory] add nn.parallel module (#1068) 2022-06-06 15:34:41 +08:00
activation_checkpoint.py [utils] Add use_reetrant=False in utils.activation_checkpoint (#1460) 2022-08-16 15:39:20 +08:00
checkpointing.py [utils] refactor parallel layers checkpoint and bcast model on loading checkpoint (#1548) 2022-09-06 20:18:35 +08:00
common.py [test] fixed the activation codegen test (#1447) 2022-08-12 14:52:31 +08:00
cuda.py [refactor] refactor the memory utils (#715) 2022-04-11 16:47:57 +08:00
memory.py [gemini] APIs to set cpu memory capacity (#809) 2022-04-19 16:05:22 +08:00
moe.py Refactored docstring to google style 2022-03-29 17:17:47 +08:00
timer.py Refactored docstring to google style 2022-03-29 17:17:47 +08:00