Commit Graph

18 Commits (191daf74111251df7327f4ef7a069d0254554d2c)

Author SHA1 Message Date
hiko2MSP 191daf7411
[chatgpt] type miss of kwargs (#3107)
2 years ago
BlueRum c9dd036592
[chatgpt] fix lora save bug (#3099)
2 years ago
Fazzie-Maqianli 02ae80bf9c
[chatgpt]add flag of action mask in critic(#3086)
2 years ago
wenjunyang b51bfec357
[chatgpt] change critic input as state (#3042)
2 years ago
Fazzie-Maqianli c21b11edce
change nn to models (#3032)
2 years ago
LuGY 287d60499e
[chatgpt] Add saving ckpt callback for PPO (#2880)
2 years ago
ver217 0ff8406b00
[chatgpt] allow shard init and display warning (#2986)
2 years ago
BlueRum f5ca0397dd
[chatgpt] fix lora gemini conflict in RM training (#2984)
2 years ago
ver217 19ad49fb3b
[chatgpt] making experience support dp (#2971)
2 years ago
BlueRum c9e27f0d1b
[chatgpt]fix lora bug (#2974)
2 years ago
BlueRum 2e16f842a9
[chatgpt]support opt & gpt for rm training (#2876)
2 years ago
BlueRum 3eebc4dff7
[chatgpt] fix rm eval (#2829)
2 years ago
ver217 4ee311c026
[chatgpt] startegy add prepare method (#2766)
2 years ago
ver217 a88bc828d5
[chatgpt] disable shard init for colossalai (#2767)
2 years ago
BlueRum 613efebc5c
[chatgpt] support colossalai strategy to train rm (#2742)
2 years ago
BlueRum 648183a960
[chatgpt]fix train_rm bug with lora (#2741)
2 years ago
ver217 9c0943ecdb
[chatgpt] optimize generation kwargs (#2717)
2 years ago
ver217 1b34701027
[app] add chatgpt application (#2698)
2 years ago