Commit Graph

53 Commits (3abf98a6337ae39f11b3c259a0af8d40477fe7f7)

Author SHA1 Message Date
binmakeswell 1c9ec32734
[NFC] add OPT serving (#1581) 2022-09-09 16:56:45 +08:00
Sze-qq 3b6a5e2593
update OPT experiment result for 8 GPUs (#1503) 2022-08-26 15:09:13 +08:00
fastalgo 0f438d15ee
Update README.md 2022-08-23 06:58:12 +08:00
Sze-qq 1750d6f573
[doc] update readme with the new xTrimoMultimer project (#1477)
* update xTrimoMultimer project

* update xTrimoMultimer project

* latest update of xTrimoMultimer project info
2022-08-22 20:53:14 +08:00
Jiarui Fang 4f5f8f77d1
update nvme on readme (#1397) 2022-08-02 11:39:37 +08:00
fastalgo db89600cf2
Update README.md 2022-07-30 22:11:07 +08:00
binmakeswell 92b0b139eb
[NFC] add OPT (#1345) 2022-07-20 15:02:07 +08:00
fastalgo 7857fd7616
Update README.md 2022-07-16 19:00:59 -07:00
Sze-qq 49114d8df0 update GPT-3 visualisation 2022-07-12 15:50:32 +08:00
Frank Lee 51f1ec96b0
[workflow] polish readme and dockerfile (#1165)
* [workflow] polish readme and dockerfile

* polish
2022-06-23 15:12:15 +08:00
binmakeswell 626dd187e4
add inference submodule (#1047) 2022-05-31 19:57:39 +08:00
binmakeswell 0dac86866b
[NFC] add inference (#1044) 2022-05-30 23:06:49 +08:00
Frank Lee 8d06186ff9
[doc] update docker instruction (#1020) 2022-05-24 17:51:50 +08:00
binmakeswell 9833d814d5 [NFC] fix paper link 2022-05-21 18:34:36 +08:00
binmakeswell c27ea0d980
fix download link (#998) 2022-05-18 18:05:18 +08:00
binmakeswell 7471f97fc3
update results on a single GPU, highlight quick view (#981) 2022-05-16 21:14:35 +08:00
binmakeswell deaf99f4c9
[readme] sync CN readme (#766) 2022-04-14 21:04:51 +08:00
Jiarui Fang 1f698f4406
[readme] polish readme (#764)
* [readme] polish readme

* centering image
2022-04-14 17:34:08 +08:00
binmakeswell 600e769a42
add video (#732) 2022-04-12 13:41:56 +08:00
Frank Lee 6f7d1362c9
[doc] removed outdated installation command (#730) 2022-04-12 11:56:45 +08:00
binmakeswell 896ade15d6
add PaLM link (#704) (#705) 2022-04-08 18:42:12 +08:00
binmakeswell 270157e9e7
add PaLM link (#704)
* add PaLM link
2022-04-08 18:26:59 +08:00
Sze-qq ce8a3eae5b
update GPT-2 experiment result (#666) 2022-04-04 13:47:43 +08:00
Jie Zhu 73d36618a6
[profiler] add MemProfiler (#356)
* add memory trainer hook

* fix bug

* add memory trainer hook

* fix import bug

* fix import bug

* add trainer hook

* fix #370 git log bug

* modify `to_tensorboard` function to support better output

* remove useless output

* change the name of `MemProfiler`

* complete memory profiler

* replace error with warning

* finish trainer hook

* modify interface of MemProfiler

* modify `__init__.py` in profiler

* remove unnecessary pass statement

* add usage to doc string

* add usage to trainer hook

* new location to store temp data file
2022-03-29 12:48:34 +08:00
fastalgo a513164379
Update README.md (#514) 2022-03-25 12:12:05 +08:00
Sze-qq 7f5e4592eb
Update Experiment result about Colossal-AI with ZeRO (#479)
* [readme] add experimental visualisation regarding ColossalAI with ZeRO (#476)

* Hotfix/readme (#478)

* add experimental visualisation regarding ColossalAI with ZeRO

* adjust newly-added figure size
2022-03-21 16:34:07 +08:00
Frank Lee 4f85b687cf
[misc] replace codebeat with codefactor on readme (#436) 2022-03-16 17:43:52 +08:00
Frank Lee 62b08acc72
update hf badge link (#410) 2022-03-14 17:07:01 +08:00
Frank Lee cf92a779dc
added huggingface badge (#407) 2022-03-14 16:23:02 +08:00
Frank Lee 6d3a4f51bf fixed broken badge link 2022-03-13 09:11:48 +08:00
binmakeswell ce7b2c9ae3 update README and images path (#384) 2022-03-11 15:50:28 +08:00
Shen Chenhui 1c88dd43e2 Fix/format (#366) 2022-03-11 15:50:28 +08:00
binmakeswell d275b98b7d add badge and contributor list 2022-03-11 15:50:28 +08:00
binmakeswell 08eccfe681 add community group and update issue template(#271) 2022-03-11 15:50:28 +08:00
Sze-qq 3312d716a0 update experimental visualization (#253) 2022-03-11 15:50:28 +08:00
binmakeswell 753035edd3 add Chinese README 2022-03-11 15:50:28 +08:00
Frank Lee eb3fda4c28 updated readme and change log (#224) 2022-02-15 11:31:13 +08:00
ver217 578ea0583b update setup and workflow (#222) 2022-02-15 11:31:13 +08:00
Frank Lee 02f13fa9d1
add code quality badge (#201) 2022-02-03 14:01:09 +08:00
Frank Lee 812357d63c
fixed utils docstring and add example to readme (#200) 2022-02-03 11:37:17 +08:00
BoxiangW a2f1565672
Update GitHub action and pre-commit settings (#196)
* Update GitHub action and pre-commit settings

* Update GitHub action and pre-commit settings (#198)
2022-01-28 16:59:53 +08:00
Frank Lee a2e649da39
update readme (#168) 2022-01-20 13:26:38 +08:00
BoxiangW bd4840f1f1
Update workflow files and README.md (#166) 2022-01-19 20:15:14 +08:00
Frank Lee be85a0f366 removed tutorial markdown and refreshed rst files for consistency 2022-01-19 17:01:37 +08:00
binmakeswell 17ce8569a8
add logo at homepage, add forum in issue template (#161) 2022-01-19 14:29:31 +08:00
Frank Lee a1da3900c8
added docker documentation (#152) 2022-01-18 13:35:18 +08:00
Frank Lee 35813ed3c4
update examples and sphnix docs for the new api (#63) 2021-12-13 22:07:01 +08:00
Frank Lee 9a0466534c
update markdown docs (english) (#60) 2021-12-10 14:37:33 +08:00
Frank Lee 3defa32aee
Support TP-compatible Torch AMP and Update trainer API (#27)
* Add gradient accumulation, fix lr scheduler

* fix FP16 optimizer and adapted torch amp with tensor parallel (#18)

* fixed bugs in compatibility between torch amp and tensor parallel and performed some minor fixes

* fixed trainer

* Revert "fixed trainer"

This reverts commit 2e0b0b7699.

* improved consistency between trainer, engine and schedule (#23)

Co-authored-by: 1SAA <c2h214748@gmail.com>

Co-authored-by: 1SAA <c2h214748@gmail.com>
Co-authored-by: ver217 <lhx0217@gmail.com>
2021-11-18 19:45:06 +08:00
binmakeswell 05e7069a5b fixed some typos in the documents, added blog link and paper author information in README 2021-11-03 17:18:43 +08:00