binmakeswell
1c9ec32734
[NFC] add OPT serving ( #1581 )
2022-09-09 16:56:45 +08:00
Sze-qq
3b6a5e2593
update OPT experiment result for 8 GPUs ( #1503 )
2022-08-26 15:09:13 +08:00
fastalgo
0f438d15ee
Update README.md
2022-08-23 06:58:12 +08:00
Sze-qq
1750d6f573
[doc] update readme with the new xTrimoMultimer project ( #1477 )
...
* update xTrimoMultimer project
* update xTrimoMultimer project
* latest update of xTrimoMultimer project info
2022-08-22 20:53:14 +08:00
Jiarui Fang
4f5f8f77d1
update nvme on readme ( #1397 )
2022-08-02 11:39:37 +08:00
fastalgo
db89600cf2
Update README.md
2022-07-30 22:11:07 +08:00
binmakeswell
92b0b139eb
[NFC] add OPT ( #1345 )
2022-07-20 15:02:07 +08:00
fastalgo
7857fd7616
Update README.md
2022-07-16 19:00:59 -07:00
Sze-qq
49114d8df0
update GPT-3 visualisation
2022-07-12 15:50:32 +08:00
Frank Lee
51f1ec96b0
[workflow] polish readme and dockerfile ( #1165 )
...
* [workflow] polish readme and dockerfile
* polish
2022-06-23 15:12:15 +08:00
binmakeswell
626dd187e4
add inference submodule ( #1047 )
2022-05-31 19:57:39 +08:00
binmakeswell
0dac86866b
[NFC] add inference ( #1044 )
2022-05-30 23:06:49 +08:00
Frank Lee
8d06186ff9
[doc] update docker instruction ( #1020 )
2022-05-24 17:51:50 +08:00
binmakeswell
9833d814d5
[NFC] fix paper link
2022-05-21 18:34:36 +08:00
binmakeswell
c27ea0d980
fix download link ( #998 )
2022-05-18 18:05:18 +08:00
binmakeswell
7471f97fc3
update results on a single GPU, highlight quick view ( #981 )
2022-05-16 21:14:35 +08:00
binmakeswell
deaf99f4c9
[readme] sync CN readme ( #766 )
2022-04-14 21:04:51 +08:00
Jiarui Fang
1f698f4406
[readme] polish readme ( #764 )
...
* [readme] polish readme
* centering image
2022-04-14 17:34:08 +08:00
binmakeswell
600e769a42
add video ( #732 )
2022-04-12 13:41:56 +08:00
Frank Lee
6f7d1362c9
[doc] removed outdated installation command ( #730 )
2022-04-12 11:56:45 +08:00
binmakeswell
896ade15d6
add PaLM link ( #704 ) ( #705 )
2022-04-08 18:42:12 +08:00
binmakeswell
270157e9e7
add PaLM link ( #704 )
...
* add PaLM link
2022-04-08 18:26:59 +08:00
Sze-qq
ce8a3eae5b
update GPT-2 experiment result ( #666 )
2022-04-04 13:47:43 +08:00
Jie Zhu
73d36618a6
[profiler] add MemProfiler ( #356 )
...
* add memory trainer hook
* fix bug
* add memory trainer hook
* fix import bug
* fix import bug
* add trainer hook
* fix #370 git log bug
* modify `to_tensorboard` function to support better output
* remove useless output
* change the name of `MemProfiler`
* complete memory profiler
* replace error with warning
* finish trainer hook
* modify interface of MemProfiler
* modify `__init__.py` in profiler
* remove unnecessary pass statement
* add usage to doc string
* add usage to trainer hook
* new location to store temp data file
2022-03-29 12:48:34 +08:00
fastalgo
a513164379
Update README.md ( #514 )
2022-03-25 12:12:05 +08:00
Sze-qq
7f5e4592eb
Update Experiment result about Colossal-AI with ZeRO ( #479 )
...
* [readme] add experimental visualisation regarding ColossalAI with ZeRO (#476 )
* Hotfix/readme (#478 )
* add experimental visualisation regarding ColossalAI with ZeRO
* adjust newly-added figure size
2022-03-21 16:34:07 +08:00
Frank Lee
4f85b687cf
[misc] replace codebeat with codefactor on readme ( #436 )
2022-03-16 17:43:52 +08:00
Frank Lee
62b08acc72
update hf badge link ( #410 )
2022-03-14 17:07:01 +08:00
Frank Lee
cf92a779dc
added huggingface badge ( #407 )
2022-03-14 16:23:02 +08:00
Frank Lee
6d3a4f51bf
fixed broken badge link
2022-03-13 09:11:48 +08:00
binmakeswell
ce7b2c9ae3
update README and images path ( #384 )
2022-03-11 15:50:28 +08:00
Shen Chenhui
1c88dd43e2
Fix/format ( #366 )
2022-03-11 15:50:28 +08:00
binmakeswell
d275b98b7d
add badge and contributor list
2022-03-11 15:50:28 +08:00
binmakeswell
08eccfe681
add community group and update issue template( #271 )
2022-03-11 15:50:28 +08:00
Sze-qq
3312d716a0
update experimental visualization ( #253 )
2022-03-11 15:50:28 +08:00
binmakeswell
753035edd3
add Chinese README
2022-03-11 15:50:28 +08:00
Frank Lee
eb3fda4c28
updated readme and change log ( #224 )
2022-02-15 11:31:13 +08:00
ver217
578ea0583b
update setup and workflow ( #222 )
2022-02-15 11:31:13 +08:00
Frank Lee
02f13fa9d1
add code quality badge ( #201 )
2022-02-03 14:01:09 +08:00
Frank Lee
812357d63c
fixed utils docstring and add example to readme ( #200 )
2022-02-03 11:37:17 +08:00
BoxiangW
a2f1565672
Update GitHub action and pre-commit settings ( #196 )
...
* Update GitHub action and pre-commit settings
* Update GitHub action and pre-commit settings (#198 )
2022-01-28 16:59:53 +08:00
Frank Lee
a2e649da39
update readme ( #168 )
2022-01-20 13:26:38 +08:00
BoxiangW
bd4840f1f1
Update workflow files and README.md ( #166 )
2022-01-19 20:15:14 +08:00
Frank Lee
be85a0f366
removed tutorial markdown and refreshed rst files for consistency
2022-01-19 17:01:37 +08:00
binmakeswell
17ce8569a8
add logo at homepage, add forum in issue template ( #161 )
2022-01-19 14:29:31 +08:00
Frank Lee
a1da3900c8
added docker documentation ( #152 )
2022-01-18 13:35:18 +08:00
Frank Lee
35813ed3c4
update examples and sphnix docs for the new api ( #63 )
2021-12-13 22:07:01 +08:00
Frank Lee
9a0466534c
update markdown docs (english) ( #60 )
2021-12-10 14:37:33 +08:00
Frank Lee
3defa32aee
Support TP-compatible Torch AMP and Update trainer API ( #27 )
...
* Add gradient accumulation, fix lr scheduler
* fix FP16 optimizer and adapted torch amp with tensor parallel (#18 )
* fixed bugs in compatibility between torch amp and tensor parallel and performed some minor fixes
* fixed trainer
* Revert "fixed trainer"
This reverts commit 2e0b0b7699
.
* improved consistency between trainer, engine and schedule (#23 )
Co-authored-by: 1SAA <c2h214748@gmail.com>
Co-authored-by: 1SAA <c2h214748@gmail.com>
Co-authored-by: ver217 <lhx0217@gmail.com>
2021-11-18 19:45:06 +08:00
binmakeswell
05e7069a5b
fixed some typos in the documents, added blog link and paper author information in README
2021-11-03 17:18:43 +08:00