ColossalAI/colossalai/checkpoint_io
Jun Gao a4489384d5
[shardformer] Fix serialization error with Tensor Parallel state saving (#5018)
* Fix serialization error with Tensor Parallel state saving

* Refactor state_dict CPU transfer using tree_map
2023-11-09 17:00:25 +08:00
..
__init__.py [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00
checkpoint_io_base.py [shardformer] fix master param sync for hybrid plugin/rewrite unwrapping logic (#4758) 2023-09-20 18:29:37 +08:00
general_checkpoint_io.py [shardformer] fix master param sync for hybrid plugin/rewrite unwrapping logic (#4758) 2023-09-20 18:29:37 +08:00
hybrid_parallel_checkpoint_io.py [checkpointio] support unsharded checkpointIO for hybrid parallel (#4774) 2023-09-26 10:58:03 +08:00
index_file.py [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00
utils.py [shardformer] Fix serialization error with Tensor Parallel state saving (#5018) 2023-11-09 17:00:25 +08:00