ColossalAI/colossalai
ver217 3c9cd5bb5e
[zero] stateful tensor manager (#687)
* [WIP] stateful tensor manager

* add eviction strategy

* polish code

* polish code

* polish comment

* add unit test

* fix sampler bug

* polish code

* fix max sampling cnt resetting bug

* fix sampler bug

* polish code

* fix bug

* fix unit test

Co-authored-by: jiaruifang <fangjiarui123@gmail.com>
2022-04-08 17:51:34 +08:00
..
amp fix format (#570) 2022-04-06 11:40:59 +08:00
builder [NFC] polish colossalai/builder/builder.py code style (#662) 2022-04-06 11:40:59 +08:00
communication [NFC] polish colossalai/communication/utils.py code style (#656) 2022-04-06 11:40:59 +08:00
context [NFC] polish colossalai/context/process_group_initializer/initializer_sequence.py colossalai/context/process_group_initializer initializer_tensor.py code style (#639) 2022-04-06 11:40:59 +08:00
engine [zero] stateful tensor manager (#687) 2022-04-08 17:51:34 +08:00
kernel [NFC] polish colossalai/kernel/cuda_native/csrc/multi_tensor_adam.cu code style (#667) 2022-04-06 11:40:59 +08:00
logging Refactored docstring to google style 2022-03-29 17:17:47 +08:00
nn fix bugs in CPU adam (#633) 2022-04-02 17:04:05 +08:00
registry Refactored docstring to google style 2022-03-29 17:17:47 +08:00
testing [test] fixed rerun_on_exception and adapted test cases (#487) 2022-03-25 17:25:12 +08:00
trainer [pipeline] refactor pipeline (#679) 2022-04-07 15:54:14 +08:00
utils [zero] stateful tensor manager (#687) 2022-04-08 17:51:34 +08:00
zero [zero] stateful tensor manager (#687) 2022-04-08 17:51:34 +08:00
__init__.py Develop/experiments (#59) 2021-12-09 15:08:29 +08:00
constants.py fix format constants.py (#358) 2022-03-11 15:50:28 +08:00
core.py [polish] polish singleton and global context (#500) 2022-03-23 18:03:39 +08:00
global_variables.py [MOE] add unitest for MOE experts layout, gradient handler and kernel (#469) 2022-03-21 13:35:04 +08:00
initialize.py [pipeline] refactor pipeline (#679) 2022-04-07 15:54:14 +08:00