Commit Graph

9 Commits (58108413bde753f56363815e2327311ff9db9d3b)

Author SHA1 Message Date
Guoteng 6b6295aea3
Feat add checkpoint fraction (#151)
* feat(config): add checkpoint_fraction into config

* feat: remove checkpoint_fraction from configs/7B_sft.py

---------

Co-authored-by: wangguoteng.p <wangguoteng925@qq.com>
2023-07-31 13:57:01 +08:00
huangting4201 26205c1edf
feat(solver): fix code to adapt to torch2.0 and provide docker images (#128)
* feat(solver): fix code to adapt to torch2.0

* docs(install.md): publish internlm environment image

* docs(install.md): update dependency packages version

* docs(install.md): update default image

---------

Co-authored-by: 黄婷 <huangting3@CN0014010744M.local>
2023-07-25 19:34:52 +08:00
huangting4201 acea4554ec
docs(install.md): update dependency package transformers version to >= 4.28.0 (#124)
Co-authored-by: 黄婷 <huangting3@CN0014010744M.local>
2023-07-24 11:33:26 +08:00
Yang Gao 555ed60a2c
fix(tokenizer): refactor tokenizer and update usage in readme (#51)
* update tokenizer example
2023-07-13 17:16:27 +08:00
Sun Peng 6150e4daed
fix/fix_submodule_err (#61)
* fix/fix_submodule_err

---------

Co-authored-by: ChenQiaoling00 <qiaoling_chen@u.nus.edu>
2023-07-12 18:59:31 +08:00
Sun Peng c18bec9361
fix[performance]: fix the performance evaluation mistakes (#40)
* fix(no_pp_scheduler): drop out and label if not used

* Update train_performance.md

* Update readme with new tested data

* update some typos

* doc(performance): fix some typos
2023-07-08 20:42:34 +08:00
Sun Peng 4a3d15650e
fix(no_pp_scheduler): drop model out data and label if not used (#39)
* fix(no_pp_scheduler): drop out and label if not used

* Update train_performance.md

* Update readme with new tested data

* update some typos
2023-07-08 18:55:31 +08:00
Sun Peng 912fc8f8aa
doc: update the training examples (#27)
* doc: update the training examples

* update README

* change all "++++" log

* Update pylint

* solve lint err
2023-07-07 15:54:09 +08:00
Sun Peng fa7337b37b initial commit 2023-07-06 12:55:23 +08:00