ColossalAI/colossalai/zero/sharded_param
ver217 9506a8beb2 use double buffer to handle grad 2022-03-16 14:24:09 +08:00
..
__init__.py [zero] yet an improved sharded param (#311) 2022-03-11 15:50:28 +08:00
sharded_param.py use double buffer to handle grad 2022-03-16 14:24:09 +08:00
sharded_tensor.py [zero] find miss code (#378) 2022-03-11 15:50:28 +08:00