ColossalAI/colossalai/utils
HELSON 1468e4bcfc
[zero] add constant placement policy (#1705)
* fixes memory leak when paramter is in fp16 in ZeroDDP init.
* bans chunk releasement in CUDA. Only when a chunk is about to offload, it is allowed to release.
* adds a constant placement policy. With it, users can allocate a reserved caching memory space for parameters.
2022-10-14 17:53:16 +08:00
..
checkpoint
data_sampler
model [zero] add constant placement policy (#1705) 2022-10-14 17:53:16 +08:00
multi_tensor_apply [NFC] polish colossalai/utils/multi_tensor_apply/multi_tensor_apply.py code style (#1559) 2022-09-08 22:04:34 +08:00
profiler
rank_recorder [pipeline/rank_recorder] fix bug when process data before backward | add a tool for multiple ranks debug (#1681) 2022-10-09 17:32:57 +08:00
tensor_detector [NFC] polish utils/tensor_detector/__init__.py code style (#1573) 2022-09-08 22:11:04 +08:00
__init__.py
activation_checkpoint.py [utils] Add use_reetrant=False in utils.activation_checkpoint (#1460) 2022-08-16 15:39:20 +08:00
checkpointing.py [utils] refactor parallel layers checkpoint and bcast model on loading checkpoint (#1548) 2022-09-06 20:18:35 +08:00
common.py
cuda.py
memory.py
moe.py
timer.py