Commit Graph

20 Commits (3e05c07bb8921f2a8f9736b6f6673d4e9f1697d0)

Author SHA1 Message Date
Wenhao Chen 901ab1eedd
[chat]: add lora merge weights config (#4766)
1 year ago
Wenhao Chen 7b9b86441f
[chat]: update rm, add wandb and fix bugs (#4471)
1 year ago
Hongxin Liu 079bf3cb26
[misc] update pre-commit and run all files (#4752)
1 year ago
yingliu-hpc 1467e3b41b
[coati] add chatglm model (#4539)
1 year ago
Wenhao Chen da4f7b855f
[chat] fix bugs and add unit tests (#4213)
1 year ago
Yuanchen 5187c96b7c
support session-based training (#4313)
1 year ago
Wenhao Chen 3d8d5d0d58
[chat] use official transformers and fix some issues (#4117)
1 year ago
Wenhao Chen edd75a59ea
[chat] remove naive strategy and split colossalai strategy (#4094)
1 year ago
Wenhao Chen b03d64d010
[chat] refactor trainer class (#4080)
1 year ago
Wenhao Chen 153b957a1b
[chat] refactor strategy class with booster api (#3987)
1 year ago
digger-yu 65bdc3159f
fix some spelling error with applications/Chat/examples/ (#3692)
2 years ago
tanitna 1a60dc07a8
[chat] typo accimulation_steps -> accumulation_steps (#3662)
2 years ago
binmakeswell 268b3cd80d
[chat] set default zero2 strategy (#3667)
2 years ago
Hongxin Liu 842768a174
[chat] refactor model save/load logic (#3654)
2 years ago
Hongxin Liu 6ef7011462
[chat] remove lm model class (#3653)
2 years ago
Hongxin Liu 2a951955ad
[chat] refactor trainer (#3648)
2 years ago
tingfeng cao 7788e0b0a5
fix: fix sft (#3568)
2 years ago
ver217 89fd10a1c9
[chat] add zero2 cpu strategy for sft training (#3520)
2 years ago
Camille Zhong 72cb4dd433
[Chat] fix the tokenizer "int too big to convert" error in SFT training (#3453)
2 years ago
Fazzie-Maqianli b0ce5a1032
[Coati] first commit (#3283)
2 years ago