YuliangLiu0306
f477a14f4a
[hotfix] fix autoparallel demo ( #2533 )
2 years ago
HELSON
6e0faa70e0
[gemini] add profiler in the demo ( #2534 )
2 years ago
Fazzie
f35326881c
fix README
2 years ago
HELSON
66dfcf5281
[gemini] update the gpt example ( #2527 )
2 years ago
LuGY
ecbad93b65
[example] Add fastfold tutorial ( #2528 )
...
* add fastfold example
* pre-commit polish
* pre-commit polish readme and add empty test ci
* Add test_ci and reduce the default sequence length
2 years ago
Jiarui Fang
fd8d19a6e7
[example] update lightning dependency for stable diffusion ( #2522 )
2 years ago
HELSON
707b11d4a0
[gemini] update ddp strict mode ( #2518 )
...
* [zero] add strict ddp mode for chunk init
* [gemini] update gpt example
2 years ago
HELSON
2d1a7dfe5f
[zero] add strict ddp mode ( #2508 )
...
* [zero] add strict ddp mode
* [polish] add comments for strict ddp mode
* [zero] fix test error
2 years ago
jiaruifang
32390cbe8f
add test_ci.sh to dreambooth
2 years ago
jiaruifang
025b482dc1
[example] dreambooth example
2 years ago
Jiarui Fang
e327e95144
[hotfix] gpt example titans bug #2493 ( #2494 )
2 years ago
jiaruifang
e58cc441e2
polish code and fix dataloader bugs
2 years ago
jiaruifang
a4b75b78a0
[hotfix] gpt example titans bug #2493
2 years ago
binmakeswell
fcc6d61d92
[example] fix requirements ( #2488 )
2 years ago
Jiarui Fang
3a21485ead
[example] titans for gpt ( #2484 )
2 years ago
jiaruifang
438ea608f3
update readme
2 years ago
jiaruifang
38424db6ff
polish code
2 years ago
jiaruifang
92f65fbbe3
remove license
2 years ago
jiaruifang
315e1433ce
polish readme
2 years ago
jiaruifang
37baea20cb
[example] titans for gpt
2 years ago
jiaruifang
e64a05b38b
polish code
2 years ago
Jiarui Fang
7c31706227
[CI] add test_ci.sh for palm, opt and gpt ( #2475 )
2 years ago
Jiarui Fang
e4c38ba367
[example] stable diffusion add roadmap ( #2482 )
2 years ago
jiaruifang
9cba38b492
add dummy test_ci.sh
2 years ago
jiaruifang
f78bad21ed
[example] stable diffusion add roadmap
2 years ago
ver217
f525d1f528
[example] update gpt gemini example ci test ( #2477 )
2 years ago
Ziyue Jiang
fef5c949c3
polish pp middleware ( #2476 )
...
Co-authored-by: Ziyue Jiang <ziyue.jiang@gmail.com>
2 years ago
Frank Lee
8b7495dd54
[example] integrate seq-parallel tutorial with CI ( #2463 )
2 years ago
ver217
8e85d2440a
[example] update vit ci script ( #2469 )
...
* [example] update vit ci script
* [example] update requirements
* [example] update requirements
2 years ago
Jiarui Fang
867c8c2d3a
[zero] low level optim supports ProcessGroup ( #2464 )
2 years ago
Frank Lee
e6943e2d11
[example] integrate autoparallel demo with CI ( #2466 )
...
* [example] integrate autoparallel demo with CI
* polish code
* polish code
* polish code
* polish code
2 years ago
YuliangLiu0306
c20529fe78
[examples] update autoparallel tutorial demo ( #2449 )
...
* [examples] update autoparallel tutorial demo
* add test_ci.sh
* polish
* add conda yaml
2 years ago
Haofan Wang
cfd1d5ee49
[example] fixed seed error in train_dreambooth_colossalai.py ( #2445 )
2 years ago
Frank Lee
ac18a445fa
[example] updated large-batch optimizer tutorial ( #2448 )
...
* [example] updated large-batch optimizer tutorial
* polish code
* polish code
2 years ago
Frank Lee
39163417a1
[example] updated the hybrid parallel tutorial ( #2444 )
...
* [example] updated the hybrid parallel tutorial
* polish code
2 years ago
YuliangLiu0306
2731531bc2
[autoparallel] integrate device mesh initialization into autoparallelize ( #2393 )
...
* [autoparallel] integrate device mesh initialization into autoparallelize
* add megatron solution
* update gpt autoparallel examples with latest api
* adapt beta value to fit the current computation cost
2 years ago
Frank Lee
a3e5496156
[example] improved the clarity yof the example readme ( #2427 )
...
* [example] improved the clarity yof the example readme
* polish workflow
* polish workflow
* polish workflow
* polish workflow
* polish workflow
* polish workflow
2 years ago
Frank Lee
63be79d505
[example] removed duplicated stable diffusion example ( #2424 )
2 years ago
ZijianYY
fe0f7970a2
[examples] adding tflops to PaLM ( #2365 )
2 years ago
HELSON
d84e747975
[hotfix] add DISTPAN argument for benchmark ( #2412 )
...
* change the benchmark config file
* change config
* revert config file
* rename distpan to distplan
2 years ago
Frank Lee
8327932d2c
[workflow] refactored the example check workflow ( #2411 )
...
* [workflow] refactored the example check workflow
* polish code
* polish code
* polish code
* polish code
* polish code
* polish code
* polish code
* polish code
* polish code
* polish code
* polish code
2 years ago
HELSON
498b5ca993
[hotfix] fix gpt gemini example ( #2404 )
...
* [hotfix] fix gpt gemini example
* [example] add new assertions
2 years ago
jiaruifang
b2e0d502b8
[doc] hotfix #2377
2 years ago
1SAA
33f3023e19
[hotfix] fix implement error in diffusers
2 years ago
Ziyue Jiang
ad00894f7f
polish
2 years ago
Jiarui Fang
1aaeb596c6
[example] gpt, shard init on all processes ( #2366 )
2 years ago
Ziyue Jiang
3a15b20421
Move GPT PP Example
2 years ago
HELSON
48d33b1b17
[gemini] add get static torch model ( #2356 )
2 years ago
YuliangLiu0306
8b1e0dfd80
[example] upload auto parallel gpt2 demo ( #2354 )
2 years ago
Jiarui Fang
00a9c781fd
[example] add google doc for benchmark results of GPT ( #2355 )
2 years ago