binmakeswell
46f20bac41
[doc] update auto parallel paper link ( #2686 )
...
* [doc] update auto parallel paper link
* [doc] update auto parallel paper link
2 years ago
binmakeswell
9ab14b20b5
[doc] add CVPR tutorial ( #2666 )
2 years ago
binmakeswell
a020eecc70
[doc] fix typo of BLOOM ( #2643 )
...
* [doc] fix typo of BLOOM
2 years ago
Frank Lee
c375563653
[doc] removed pre-built wheel installation from readme ( #2637 )
2 years ago
Frank Lee
291b051171
[doc] fixed broken badge ( #2623 )
2 years ago
binmakeswell
039b0c487b
[tutorial] polish README ( #2568 )
2 years ago
binmakeswell
a360b9bc44
[doc] update example link ( #2520 )
...
* [doc] update example link
* [doc] update example link
2 years ago
binmakeswell
a6a10616ec
[doc] update opt and tutorial links ( #2509 )
2 years ago
Frank Lee
cd38167c1a
[doc] added documentation for CI/CD ( #2420 )
...
* [doc] added documentation for CI/CD
* polish markdown
* polish markdown
* polish markdown
2 years ago
Frank Lee
85e045b063
[doc] updated readme regarding pypi installation ( #2406 )
2 years ago
Jiarui Fang
c3d9e23277
[builder] correct readme ( #2375 )
...
* [example] add google doc for benchmark results of GPT
* add tencet doc
* [example] gpt, shard init on all processes
* polish comments
* polish code
* [builder] update readme
2 years ago
binmakeswell
e512ca9c24
[doc] update stable diffusion link ( #2322 )
...
* [doc] update link
2 years ago
Sze-qq
da1c47f060
update ColossalAI logo ( #2316 )
...
Co-authored-by: siqi <siqi@siqis-MacBook-Pro.local>
2 years ago
binmakeswell
2fac699923
[doc] update news ( #2295 )
2 years ago
binmakeswell
c719798abe
[doc] add feature diffusion v2, bloom, auto-parallel ( #2282 )
2 years ago
binmakeswell
04a200573c
[NFC] update news link ( #2191 )
2 years ago
binmakeswell
c13c22c481
[doc] add news ( #1901 )
2 years ago
binmakeswell
9d3124ac8b
[doc] remove obsolete API demo ( #1833 )
2 years ago
binmakeswell
76e64cb67c
[doc] add diffusion ( #1827 )
2 years ago
binmakeswell
16b0abf94f
[doc] add FastFold ( #1766 )
2 years ago
binmakeswell
0d87c4e20d
[doc] update recommendation system catalogue ( #1732 )
2 years ago
Jiarui Fang
c626b23960
[doc] update recommedation system urls ( #1725 )
2 years ago
Jiarui Fang
b0a23dc4fc
[embeddings] add doc in readme ( #1711 )
2 years ago
binmakeswell
1c9ec32734
[NFC] add OPT serving ( #1581 )
2 years ago
Sze-qq
3b6a5e2593
update OPT experiment result for 8 GPUs ( #1503 )
2 years ago
fastalgo
0f438d15ee
Update README.md
2 years ago
Sze-qq
1750d6f573
[doc] update readme with the new xTrimoMultimer project ( #1477 )
...
* update xTrimoMultimer project
* update xTrimoMultimer project
* latest update of xTrimoMultimer project info
2 years ago
Jiarui Fang
4f5f8f77d1
update nvme on readme ( #1397 )
2 years ago
fastalgo
db89600cf2
Update README.md
2 years ago
binmakeswell
92b0b139eb
[NFC] add OPT ( #1345 )
2 years ago
fastalgo
7857fd7616
Update README.md
2 years ago
Sze-qq
49114d8df0
update GPT-3 visualisation
2 years ago
Frank Lee
51f1ec96b0
[workflow] polish readme and dockerfile ( #1165 )
...
* [workflow] polish readme and dockerfile
* polish
2 years ago
binmakeswell
626dd187e4
add inference submodule ( #1047 )
3 years ago
binmakeswell
0dac86866b
[NFC] add inference ( #1044 )
3 years ago
Frank Lee
8d06186ff9
[doc] update docker instruction ( #1020 )
3 years ago
binmakeswell
9833d814d5
[NFC] fix paper link
3 years ago
binmakeswell
c27ea0d980
fix download link ( #998 )
3 years ago
binmakeswell
7471f97fc3
update results on a single GPU, highlight quick view ( #981 )
3 years ago
binmakeswell
deaf99f4c9
[readme] sync CN readme ( #766 )
3 years ago
Jiarui Fang
1f698f4406
[readme] polish readme ( #764 )
...
* [readme] polish readme
* centering image
3 years ago
binmakeswell
600e769a42
add video ( #732 )
3 years ago
Frank Lee
6f7d1362c9
[doc] removed outdated installation command ( #730 )
3 years ago
binmakeswell
896ade15d6
add PaLM link ( #704 ) ( #705 )
3 years ago
binmakeswell
270157e9e7
add PaLM link ( #704 )
...
* add PaLM link
3 years ago
Sze-qq
ce8a3eae5b
update GPT-2 experiment result ( #666 )
3 years ago
Jie Zhu
73d36618a6
[profiler] add MemProfiler ( #356 )
...
* add memory trainer hook
* fix bug
* add memory trainer hook
* fix import bug
* fix import bug
* add trainer hook
* fix #370 git log bug
* modify `to_tensorboard` function to support better output
* remove useless output
* change the name of `MemProfiler`
* complete memory profiler
* replace error with warning
* finish trainer hook
* modify interface of MemProfiler
* modify `__init__.py` in profiler
* remove unnecessary pass statement
* add usage to doc string
* add usage to trainer hook
* new location to store temp data file
3 years ago
fastalgo
a513164379
Update README.md ( #514 )
3 years ago
Sze-qq
7f5e4592eb
Update Experiment result about Colossal-AI with ZeRO ( #479 )
...
* [readme] add experimental visualisation regarding ColossalAI with ZeRO (#476 )
* Hotfix/readme (#478 )
* add experimental visualisation regarding ColossalAI with ZeRO
* adjust newly-added figure size
3 years ago
Frank Lee
4f85b687cf
[misc] replace codebeat with codefactor on readme ( #436 )
3 years ago
Frank Lee
62b08acc72
update hf badge link ( #410 )
3 years ago
Frank Lee
cf92a779dc
added huggingface badge ( #407 )
3 years ago
Frank Lee
6d3a4f51bf
fixed broken badge link
3 years ago
binmakeswell
ce7b2c9ae3
update README and images path ( #384 )
3 years ago
Shen Chenhui
1c88dd43e2
Fix/format ( #366 )
3 years ago
binmakeswell
d275b98b7d
add badge and contributor list
3 years ago
binmakeswell
08eccfe681
add community group and update issue template( #271 )
3 years ago
Sze-qq
3312d716a0
update experimental visualization ( #253 )
3 years ago
binmakeswell
753035edd3
add Chinese README
3 years ago
Frank Lee
eb3fda4c28
updated readme and change log ( #224 )
3 years ago
ver217
578ea0583b
update setup and workflow ( #222 )
3 years ago
Frank Lee
02f13fa9d1
add code quality badge ( #201 )
3 years ago
Frank Lee
812357d63c
fixed utils docstring and add example to readme ( #200 )
3 years ago
BoxiangW
a2f1565672
Update GitHub action and pre-commit settings ( #196 )
...
* Update GitHub action and pre-commit settings
* Update GitHub action and pre-commit settings (#198 )
3 years ago
Frank Lee
a2e649da39
update readme ( #168 )
3 years ago
BoxiangW
bd4840f1f1
Update workflow files and README.md ( #166 )
3 years ago
Frank Lee
be85a0f366
removed tutorial markdown and refreshed rst files for consistency
3 years ago
binmakeswell
17ce8569a8
add logo at homepage, add forum in issue template ( #161 )
3 years ago
Frank Lee
a1da3900c8
added docker documentation ( #152 )
3 years ago
Frank Lee
35813ed3c4
update examples and sphnix docs for the new api ( #63 )
3 years ago
Frank Lee
9a0466534c
update markdown docs (english) ( #60 )
3 years ago
Frank Lee
3defa32aee
Support TP-compatible Torch AMP and Update trainer API ( #27 )
...
* Add gradient accumulation, fix lr scheduler
* fix FP16 optimizer and adapted torch amp with tensor parallel (#18 )
* fixed bugs in compatibility between torch amp and tensor parallel and performed some minor fixes
* fixed trainer
* Revert "fixed trainer"
This reverts commit 2e0b0b7699
.
* improved consistency between trainer, engine and schedule (#23 )
Co-authored-by: 1SAA <c2h214748@gmail.com>
Co-authored-by: 1SAA <c2h214748@gmail.com>
Co-authored-by: ver217 <lhx0217@gmail.com>
3 years ago
binmakeswell
05e7069a5b
fixed some typos in the documents, added blog link and paper author information in README
3 years ago
ver217
3c7604ba30
update documentation
3 years ago
アマデウス
da2042f5c1
updated readme
3 years ago
zbian
404ecbdcc6
Migrated project
3 years ago