Hongxin Liu
56fe130b15
[hotfix] fix lora load ( #6231 )
...
* [hotfix] fix lora load
* [hotfix] fix hp load
* accelerate deepseek loading
2025-03-01 19:04:14 +08:00
Hongxin Liu
014837e725
[shardformer] support pipeline for deepseek v3 and optimize lora save ( #6188 )
...
* [shardformer] support pipeline for deepseek v3
* [checkpointio] fix lora save
* [devops] update ci env
* [booster] optimize lora
* fix test
* fix test
2025-02-14 14:48:54 +08:00
Hongxin Liu
079bf3cb26
[misc] update pre-commit and run all files ( #4752 )
...
* [misc] update pre-commit
* [misc] run pre-commit
* [misc] remove useless configuration files
* [misc] ignore cuda for clang-format
2023-09-19 14:20:26 +08:00
Hongxin Liu
807e01a4ba
[zero] hotfix master param sync ( #4618 )
...
* [zero] add method to update master params
* [zero] update zero plugin
* [plugin] update low level zero plugin
2023-09-05 15:04:02 +08:00
Frank Lee
73d3e4d309
[booster] implemented the torch ddd + resnet example ( #3232 )
...
* [booster] implemented the torch ddd + resnet example
* polish code
2023-03-27 10:24:14 +08:00