ColossalAI/colossalai/utils
ver217 3c9cd5bb5e
[zero] stateful tensor manager (#687)
* [WIP] stateful tensor manager

* add eviction strategy

* polish code

* polish code

* polish comment

* add unit test

* fix sampler bug

* polish code

* fix max sampling cnt resetting bug

* fix sampler bug

* polish code

* fix bug

* fix unit test

Co-authored-by: jiaruifang <fangjiarui123@gmail.com>
2022-04-08 17:51:34 +08:00
..
data_sampler Refactored docstring to google style 2022-03-29 17:17:47 +08:00
gradient_accumulation Refactored docstring to google style 2022-03-29 17:17:47 +08:00
memory_tracer [zero] stateful tensor manager (#687) 2022-04-08 17:51:34 +08:00
memory_utils [refactor] memory utils (#577) 2022-04-01 09:22:33 +08:00
multi_tensor_apply Refactored docstring to google style 2022-03-29 17:17:47 +08:00
profiler polish utils docstring (#620) 2022-04-01 16:36:47 +08:00
tensor_detector Refactored docstring to google style 2022-03-29 17:17:47 +08:00
__init__.py [model checkpoint] updated checkpoint save/load utils (#592) 2022-04-01 16:49:21 +08:00
activation_checkpoint.py [hotfix] fix bugs in testing (#659) 2022-04-02 21:58:47 +08:00
checkpointing.py polish checkpoint docstring (#637) 2022-04-02 13:34:33 +08:00
common.py moved ensure_path_exists to utils.common (#591) 2022-04-01 09:46:33 +08:00
cuda.py Fixed docstring in colossalai (#171) 2022-01-21 10:44:30 +08:00
moe.py Refactored docstring to google style 2022-03-29 17:17:47 +08:00
timer.py Refactored docstring to google style 2022-03-29 17:17:47 +08:00