ver217
dbe62c67b8
add an example of ViT-B/16 and remove w_norm clipping in LAMB ( #29 )
2021-11-18 23:45:09 +08:00
Frank Lee
3defa32aee
Support TP-compatible Torch AMP and Update trainer API ( #27 )
...
* Add gradient accumulation, fix lr scheduler
* fix FP16 optimizer and adapted torch amp with tensor parallel (#18 )
* fixed bugs in compatibility between torch amp and tensor parallel and performed some minor fixes
* fixed trainer
* Revert "fixed trainer"
This reverts commit 2e0b0b7699
.
* improved consistency between trainer, engine and schedule (#23 )
Co-authored-by: 1SAA <c2h214748@gmail.com>
Co-authored-by: 1SAA <c2h214748@gmail.com>
Co-authored-by: ver217 <lhx0217@gmail.com>
2021-11-18 19:45:06 +08:00
ver217
2b05de4c64
use env to control the language of doc ( #24 ) ( #25 )
2021-11-15 16:53:56 +08:00
ver217
9942fd5bfa
remove redundancy func in setup ( #19 ) ( #20 )
2021-11-15 16:43:28 +08:00
ver217
0aa07e600c
Merge pull request #15 from hpcaitech/feature/zhdoc
...
made some modifications to the documents
2021-11-04 14:26:28 +08:00
binmakeswell
05e7069a5b
fixed some typos in the documents, added blog link and paper author information in README
2021-11-03 17:18:43 +08:00
Frank Lee
ccb44882e1
Merge pull request #10 from hpcaitech/feature/zhdoc
...
added Chinese documents and fixed some typos in English documents
2021-11-03 11:38:06 +08:00
Fan Cui
18ba66e012
added Chinese documents and fixed some typos in English documents
2021-11-02 23:28:44 +08:00
Frank Lee
ccbc918c11
Merge pull request #4 from hpcaitech/hotfix/doc
...
reoder parallelization methods in parallelization documentation
2021-11-02 14:35:06 +08:00
ver217
50982c0b7d
reoder parallelization methods in parallelization documentation
2021-11-01 14:31:55 +08:00
ver217
3c7604ba30
update documentation
2021-10-29 09:29:20 +08:00
アマデウス
3245a69fc2
cleaned test scripts
2021-10-29 00:48:14 +08:00
アマデウス
da2042f5c1
updated readme
2021-10-29 00:39:21 +08:00
zbian
404ecbdcc6
Migrated project
2021-10-28 18:21:23 +02:00
アマデウス
2ebaefc542
Initial commit
2021-10-29 00:19:45 +08:00