Commit Graph

249 Commits (9df016fc4520a5a5c95a11ed04a8ac62bde039c4)

Author SHA1 Message Date
ver217 78fd31f9c1
[chatgpt] add precision option for colossalai (#3233)
2 years ago
Fazzie-Maqianli bd39877da4
support instrcut training (#3230)
2 years ago
Camille Zhong 9bc702ab48
[doc] update chatgpt doc paper link (#3229)
2 years ago
Fazzie-Maqianli bbac6760e5
fix torch version (#3225)
2 years ago
Fazzie-Maqianli fa97a9cab4
[chatgpt] unnify datasets (#3218)
2 years ago
Fazzie-Maqianli 4fd4bd9d9a
[chatgpt] support instuct training (#3216)
2 years ago
Yuanchen 9998d5ef64
[chatgpt]add reward model code for deberta (#3199)
2 years ago
Fazzie-Maqianli 1e1b9d2fea
[chatgpt]support llama (#3070)
2 years ago
pgzhang b429529365
[chatgpt] add supervised learning fine-tune code (#3183)
2 years ago
BlueRum 7548ca5a54
[chatgpt]Reward Model Training Process update (#3133)
2 years ago
ver217 1e58d31bb7
[chatgpt] fix trainer generate kwargs (#3166)
2 years ago
ver217 c474fda282
[chatgpt] fix ppo training hanging problem with gemini (#3162)
2 years ago
binmakeswell 3c01280a56
[doc] add community contribution guide (#3153)
2 years ago
BlueRum 23cd5e2ccf
[chatgpt]update ci (#3087)
2 years ago
BlueRum 68577fbc43
[chatgpt]Fix examples (#3116)
2 years ago
BlueRum 0672b5afac
[chatgpt] fix lora support for gpt (#3113)
2 years ago
hiko2MSP 191daf7411
[chatgpt] type miss of kwargs (#3107)
2 years ago
BlueRum c9dd036592
[chatgpt] fix lora save bug (#3099)
2 years ago
Fazzie-Maqianli 02ae80bf9c
[chatgpt]add flag of action mask in critic(#3086)
2 years ago
wenjunyang b51bfec357
[chatgpt] change critic input as state (#3042)
2 years ago
Fazzie-Maqianli c21b11edce
change nn to models (#3032)
2 years ago
github-actions[bot] e86d9bb2e1
[format] applied code formatting on changed files in pull request 3025 (#3026)
2 years ago
BlueRum 55dcd3051a
[chatgpt] fix readme (#3025)
2 years ago
LuGY 287d60499e
[chatgpt] Add saving ckpt callback for PPO (#2880)
2 years ago
BlueRum e588703454
[chatgpt]fix inference model load (#2988)
2 years ago
ver217 0ff8406b00
[chatgpt] allow shard init and display warning (#2986)
2 years ago
BlueRum f5ca0397dd
[chatgpt] fix lora gemini conflict in RM training (#2984)
2 years ago
ver217 19ad49fb3b
[chatgpt] making experience support dp (#2971)
2 years ago
BlueRum c9e27f0d1b
[chatgpt]fix lora bug (#2974)
2 years ago
BlueRum 82149e9d1b
[chatgpt] fix inference demo loading bug (#2969)
2 years ago
Fazzie-Maqianli bbf9c827c3
[ChatGPT] fix README (#2966)
2 years ago
binmakeswell b0a8766381
[doc] fix chatgpt inference typo (#2964)
2 years ago
BlueRum 489a9566af
[chatgpt]add inference example (#2944)
2 years ago
binmakeswell 8264cd7ef1
[doc] add env scope (#2933)
2 years ago
BlueRum 2e16f842a9
[chatgpt]support opt & gpt for rm training (#2876)
2 years ago
BlueRum 34ca324b0d
[chatgpt] Support saving ckpt in examples (#2846)
2 years ago
BlueRum 3eebc4dff7
[chatgpt] fix rm eval (#2829)
2 years ago
ver217 b6a108cb91
[chatgpt] add test checkpoint (#2797)
2 years ago
ver217 a619a190df
[chatgpt] update readme about checkpoint (#2792)
2 years ago
ver217 4ee311c026
[chatgpt] startegy add prepare method (#2766)
2 years ago
ver217 a88bc828d5
[chatgpt] disable shard init for colossalai (#2767)
2 years ago
BlueRum 613efebc5c
[chatgpt] support colossalai strategy to train rm (#2742)
2 years ago
BlueRum 648183a960
[chatgpt]fix train_rm bug with lora (#2741)
2 years ago
CH.Li 7aacfad8af
fix typo (#2721)
2 years ago
ver217 9c0943ecdb
[chatgpt] optimize generation kwargs (#2717)
2 years ago
binmakeswell d4d3387f45
[doc] add open-source contribution invitation (#2714)
2 years ago
binmakeswell 94f000515b
[doc] add Quick Preview (#2706)
2 years ago
binmakeswell 8408c852a6
[app] fix ChatGPT requirements (#2704)
2 years ago
ver217 1b34701027
[app] add chatgpt application (#2698)
2 years ago