ColossalAI/colossalai/interface
Hongxin Liu 014837e725
[shardformer] support pipeline for deepseek v3 and optimize lora save (#6188)
* [shardformer] support pipeline for deepseek v3

* [checkpointio] fix lora save

* [devops] update ci env

* [booster] optimize lora

* fix test

* fix test
2025-02-14 14:48:54 +08:00
..
__init__.py [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00
model.py [shardformer] support pipeline for deepseek v3 and optimize lora save (#6188) 2025-02-14 14:48:54 +08:00
optimizer.py [Zerobubble] merge main. (#6142) 2024-11-19 19:00:36 +08:00
pretrained.py [lazy] support from_pretrained (#4801) 2023-09-26 11:04:11 +08:00