ColossalAI

History

HELSON 1468e4bcfc [zero] add constant placement policy (#1705 ) * fixes memory leak when paramter is in fp16 in ZeroDDP init. * bans chunk releasement in CUDA. Only when a chunk is about to offload, it is allowed to release. * adds a constant placement policy. With it, users can allocate a reserved caching memory space for parameters.		2022-10-14 17:53:16 +08:00
..
checkpoint	…
data_sampler	…
model	[zero] add constant placement policy (#1705 )	2022-10-14 17:53:16 +08:00
multi_tensor_apply	[NFC] polish colossalai/utils/multi_tensor_apply/multi_tensor_apply.py code style (#1559 )	2022-09-08 22:04:34 +08:00
profiler	…
rank_recorder	[pipeline/rank_recorder] fix bug when process data before backward \| add a tool for multiple ranks debug (#1681 )	2022-10-09 17:32:57 +08:00
tensor_detector	[NFC] polish utils/tensor_detector/__init__.py code style (#1573 )	2022-09-08 22:11:04 +08:00
__init__.py	…
activation_checkpoint.py	[utils] Add use_reetrant=False in utils.activation_checkpoint (#1460 )	2022-08-16 15:39:20 +08:00
checkpointing.py	[utils] refactor parallel layers checkpoint and bcast model on loading checkpoint (#1548 )	2022-09-06 20:18:35 +08:00
common.py	…
cuda.py	…
memory.py	…
moe.py	…
timer.py	…