Frank Lee
6f7d1362c9
[doc] removed outdated installation command ( #730 )
3 years ago
binmakeswell
896ade15d6
add PaLM link ( #704 ) ( #705 )
3 years ago
binmakeswell
270157e9e7
add PaLM link ( #704 )
...
* add PaLM link
3 years ago
Sze-qq
ce8a3eae5b
update GPT-2 experiment result ( #666 )
3 years ago
Jie Zhu
73d36618a6
[profiler] add MemProfiler ( #356 )
...
* add memory trainer hook
* fix bug
* add memory trainer hook
* fix import bug
* fix import bug
* add trainer hook
* fix #370 git log bug
* modify `to_tensorboard` function to support better output
* remove useless output
* change the name of `MemProfiler`
* complete memory profiler
* replace error with warning
* finish trainer hook
* modify interface of MemProfiler
* modify `__init__.py` in profiler
* remove unnecessary pass statement
* add usage to doc string
* add usage to trainer hook
* new location to store temp data file
3 years ago
fastalgo
a513164379
Update README.md ( #514 )
3 years ago
Sze-qq
7f5e4592eb
Update Experiment result about Colossal-AI with ZeRO ( #479 )
...
* [readme] add experimental visualisation regarding ColossalAI with ZeRO (#476 )
* Hotfix/readme (#478 )
* add experimental visualisation regarding ColossalAI with ZeRO
* adjust newly-added figure size
3 years ago
Frank Lee
4f85b687cf
[misc] replace codebeat with codefactor on readme ( #436 )
3 years ago
Frank Lee
62b08acc72
update hf badge link ( #410 )
3 years ago
Frank Lee
cf92a779dc
added huggingface badge ( #407 )
3 years ago
Frank Lee
6d3a4f51bf
fixed broken badge link
3 years ago
binmakeswell
ce7b2c9ae3
update README and images path ( #384 )
3 years ago
Shen Chenhui
1c88dd43e2
Fix/format ( #366 )
3 years ago
binmakeswell
d275b98b7d
add badge and contributor list
3 years ago
binmakeswell
08eccfe681
add community group and update issue template( #271 )
3 years ago
Sze-qq
3312d716a0
update experimental visualization ( #253 )
3 years ago
binmakeswell
753035edd3
add Chinese README
3 years ago
Frank Lee
eb3fda4c28
updated readme and change log ( #224 )
3 years ago
ver217
578ea0583b
update setup and workflow ( #222 )
3 years ago
Frank Lee
02f13fa9d1
add code quality badge ( #201 )
3 years ago
Frank Lee
812357d63c
fixed utils docstring and add example to readme ( #200 )
3 years ago
BoxiangW
a2f1565672
Update GitHub action and pre-commit settings ( #196 )
...
* Update GitHub action and pre-commit settings
* Update GitHub action and pre-commit settings (#198 )
3 years ago
Frank Lee
a2e649da39
update readme ( #168 )
3 years ago
BoxiangW
bd4840f1f1
Update workflow files and README.md ( #166 )
3 years ago
Frank Lee
be85a0f366
removed tutorial markdown and refreshed rst files for consistency
3 years ago
binmakeswell
17ce8569a8
add logo at homepage, add forum in issue template ( #161 )
3 years ago
Frank Lee
a1da3900c8
added docker documentation ( #152 )
3 years ago
Frank Lee
35813ed3c4
update examples and sphnix docs for the new api ( #63 )
3 years ago
Frank Lee
9a0466534c
update markdown docs (english) ( #60 )
3 years ago
Frank Lee
3defa32aee
Support TP-compatible Torch AMP and Update trainer API ( #27 )
...
* Add gradient accumulation, fix lr scheduler
* fix FP16 optimizer and adapted torch amp with tensor parallel (#18 )
* fixed bugs in compatibility between torch amp and tensor parallel and performed some minor fixes
* fixed trainer
* Revert "fixed trainer"
This reverts commit 2e0b0b7699
.
* improved consistency between trainer, engine and schedule (#23 )
Co-authored-by: 1SAA <c2h214748@gmail.com>
Co-authored-by: 1SAA <c2h214748@gmail.com>
Co-authored-by: ver217 <lhx0217@gmail.com>
3 years ago
binmakeswell
05e7069a5b
fixed some typos in the documents, added blog link and paper author information in README
3 years ago
ver217
3c7604ba30
update documentation
3 years ago
アマデウス
da2042f5c1
updated readme
3 years ago
zbian
404ecbdcc6
Migrated project
3 years ago