Commit Graph

43 Commits (14e5b11d7f0a5f427a5c0b6de3075b41c3f736e2)

Author SHA1 Message Date
binmakeswell 626dd187e4
add inference submodule (#1047) 2022-05-31 19:57:39 +08:00
binmakeswell 0dac86866b
[NFC] add inference (#1044) 2022-05-30 23:06:49 +08:00
Frank Lee 8d06186ff9
[doc] update docker instruction (#1020) 2022-05-24 17:51:50 +08:00
binmakeswell 9833d814d5 [NFC] fix paper link 2022-05-21 18:34:36 +08:00
binmakeswell c27ea0d980
fix download link (#998) 2022-05-18 18:05:18 +08:00
binmakeswell 7471f97fc3
update results on a single GPU, highlight quick view (#981) 2022-05-16 21:14:35 +08:00
binmakeswell deaf99f4c9
[readme] sync CN readme (#766) 2022-04-14 21:04:51 +08:00
Jiarui Fang 1f698f4406
[readme] polish readme (#764)
* [readme] polish readme

* centering image
2022-04-14 17:34:08 +08:00
binmakeswell 600e769a42
add video (#732) 2022-04-12 13:41:56 +08:00
Frank Lee 6f7d1362c9
[doc] removed outdated installation command (#730) 2022-04-12 11:56:45 +08:00
binmakeswell 896ade15d6
add PaLM link (#704) (#705) 2022-04-08 18:42:12 +08:00
binmakeswell 270157e9e7
add PaLM link (#704)
* add PaLM link
2022-04-08 18:26:59 +08:00
Sze-qq ce8a3eae5b
update GPT-2 experiment result (#666) 2022-04-04 13:47:43 +08:00
Jie Zhu 73d36618a6
[profiler] add MemProfiler (#356)
* add memory trainer hook

* fix bug

* add memory trainer hook

* fix import bug

* fix import bug

* add trainer hook

* fix #370 git log bug

* modify `to_tensorboard` function to support better output

* remove useless output

* change the name of `MemProfiler`

* complete memory profiler

* replace error with warning

* finish trainer hook

* modify interface of MemProfiler

* modify `__init__.py` in profiler

* remove unnecessary pass statement

* add usage to doc string

* add usage to trainer hook

* new location to store temp data file
2022-03-29 12:48:34 +08:00
fastalgo a513164379
Update README.md (#514) 2022-03-25 12:12:05 +08:00
Sze-qq 7f5e4592eb
Update Experiment result about Colossal-AI with ZeRO (#479)
* [readme] add experimental visualisation regarding ColossalAI with ZeRO (#476)

* Hotfix/readme (#478)

* add experimental visualisation regarding ColossalAI with ZeRO

* adjust newly-added figure size
2022-03-21 16:34:07 +08:00
Frank Lee 4f85b687cf
[misc] replace codebeat with codefactor on readme (#436) 2022-03-16 17:43:52 +08:00
Frank Lee 62b08acc72
update hf badge link (#410) 2022-03-14 17:07:01 +08:00
Frank Lee cf92a779dc
added huggingface badge (#407) 2022-03-14 16:23:02 +08:00
Frank Lee 6d3a4f51bf fixed broken badge link 2022-03-13 09:11:48 +08:00
binmakeswell ce7b2c9ae3 update README and images path (#384) 2022-03-11 15:50:28 +08:00
Shen Chenhui 1c88dd43e2 Fix/format (#366) 2022-03-11 15:50:28 +08:00
binmakeswell d275b98b7d add badge and contributor list 2022-03-11 15:50:28 +08:00
binmakeswell 08eccfe681 add community group and update issue template(#271) 2022-03-11 15:50:28 +08:00
Sze-qq 3312d716a0 update experimental visualization (#253) 2022-03-11 15:50:28 +08:00
binmakeswell 753035edd3 add Chinese README 2022-03-11 15:50:28 +08:00
Frank Lee eb3fda4c28 updated readme and change log (#224) 2022-02-15 11:31:13 +08:00
ver217 578ea0583b update setup and workflow (#222) 2022-02-15 11:31:13 +08:00
Frank Lee 02f13fa9d1
add code quality badge (#201) 2022-02-03 14:01:09 +08:00
Frank Lee 812357d63c
fixed utils docstring and add example to readme (#200) 2022-02-03 11:37:17 +08:00
BoxiangW a2f1565672
Update GitHub action and pre-commit settings (#196)
* Update GitHub action and pre-commit settings

* Update GitHub action and pre-commit settings (#198)
2022-01-28 16:59:53 +08:00
Frank Lee a2e649da39
update readme (#168) 2022-01-20 13:26:38 +08:00
BoxiangW bd4840f1f1
Update workflow files and README.md (#166) 2022-01-19 20:15:14 +08:00
Frank Lee be85a0f366 removed tutorial markdown and refreshed rst files for consistency 2022-01-19 17:01:37 +08:00
binmakeswell 17ce8569a8
add logo at homepage, add forum in issue template (#161) 2022-01-19 14:29:31 +08:00
Frank Lee a1da3900c8
added docker documentation (#152) 2022-01-18 13:35:18 +08:00
Frank Lee 35813ed3c4
update examples and sphnix docs for the new api (#63) 2021-12-13 22:07:01 +08:00
Frank Lee 9a0466534c
update markdown docs (english) (#60) 2021-12-10 14:37:33 +08:00
Frank Lee 3defa32aee
Support TP-compatible Torch AMP and Update trainer API (#27)
* Add gradient accumulation, fix lr scheduler

* fix FP16 optimizer and adapted torch amp with tensor parallel (#18)

* fixed bugs in compatibility between torch amp and tensor parallel and performed some minor fixes

* fixed trainer

* Revert "fixed trainer"

This reverts commit 2e0b0b7699.

* improved consistency between trainer, engine and schedule (#23)

Co-authored-by: 1SAA <c2h214748@gmail.com>

Co-authored-by: 1SAA <c2h214748@gmail.com>
Co-authored-by: ver217 <lhx0217@gmail.com>
2021-11-18 19:45:06 +08:00
binmakeswell 05e7069a5b fixed some typos in the documents, added blog link and paper author information in README 2021-11-03 17:18:43 +08:00
ver217 3c7604ba30 update documentation 2021-10-29 09:29:20 +08:00
アマデウス da2042f5c1
updated readme 2021-10-29 00:39:21 +08:00
zbian 404ecbdcc6 Migrated project 2021-10-28 18:21:23 +02:00