You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/colossalai/zero
Jiarui Fang 00670c870e
[zero] bucketized tensor cpu gpu copy (#368)
3 years ago
..
init_ctx [zero] able to place params on cpu after zero init context (#365) 3 years ago
shard_utils [zero] able to place params on cpu after zero init context (#365) 3 years ago
sharded_model fix grad shape 3 years ago
sharded_optim [zero] bucketized tensor cpu gpu copy (#368) 3 years ago
sharded_param [zero] bucketized tensor cpu gpu copy (#368) 3 years ago
__init__.py added buffer sync to naive amp model wrapper (#291) 3 years ago