Guoteng
6b6295aea3
Feat add checkpoint fraction ( #151 )
...
* feat(config): add checkpoint_fraction into config
* feat: remove checkpoint_fraction from configs/7B_sft.py
---------
Co-authored-by: wangguoteng.p <wangguoteng925@qq.com>
2023-07-31 13:57:01 +08:00
huangting4201
26205c1edf
feat(solver): fix code to adapt to torch2.0 and provide docker images ( #128 )
...
* feat(solver): fix code to adapt to torch2.0
* docs(install.md): publish internlm environment image
* docs(install.md): update dependency packages version
* docs(install.md): update default image
---------
Co-authored-by: 黄婷 <huangting3@CN0014010744M.local>
2023-07-25 19:34:52 +08:00
huangting4201
acea4554ec
docs(install.md): update dependency package transformers version to >= 4.28.0 ( #124 )
...
Co-authored-by: 黄婷 <huangting3@CN0014010744M.local>
2023-07-24 11:33:26 +08:00
Yang Gao
555ed60a2c
fix(tokenizer): refactor tokenizer and update usage in readme ( #51 )
...
* update tokenizer example
2023-07-13 17:16:27 +08:00
Sun Peng
6150e4daed
fix/fix_submodule_err ( #61 )
...
* fix/fix_submodule_err
---------
Co-authored-by: ChenQiaoling00 <qiaoling_chen@u.nus.edu>
2023-07-12 18:59:31 +08:00
Sun Peng
c18bec9361
fix[performance]: fix the performance evaluation mistakes ( #40 )
...
* fix(no_pp_scheduler): drop out and label if not used
* Update train_performance.md
* Update readme with new tested data
* update some typos
* doc(performance): fix some typos
2023-07-08 20:42:34 +08:00
Sun Peng
4a3d15650e
fix(no_pp_scheduler): drop model out data and label if not used ( #39 )
...
* fix(no_pp_scheduler): drop out and label if not used
* Update train_performance.md
* Update readme with new tested data
* update some typos
2023-07-08 18:55:31 +08:00
Sun Peng
912fc8f8aa
doc: update the training examples ( #27 )
...
* doc: update the training examples
* update README
* change all "++++" log
* Update pylint
* solve lint err
2023-07-07 15:54:09 +08:00
Sun Peng
fa7337b37b
initial commit
2023-07-06 12:55:23 +08:00