HELSON
6e0faa70e0
[gemini] add profiler in the demo ( #2534 )
2 years ago
HELSON
66dfcf5281
[gemini] update the gpt example ( #2527 )
2 years ago
HELSON
707b11d4a0
[gemini] update ddp strict mode ( #2518 )
...
* [zero] add strict ddp mode for chunk init
* [gemini] update gpt example
2 years ago
HELSON
2d1a7dfe5f
[zero] add strict ddp mode ( #2508 )
...
* [zero] add strict ddp mode
* [polish] add comments for strict ddp mode
* [zero] fix test error
2 years ago
Jiarui Fang
e327e95144
[hotfix] gpt example titans bug #2493 ( #2494 )
2 years ago
binmakeswell
fcc6d61d92
[example] fix requirements ( #2488 )
2 years ago
Jiarui Fang
3a21485ead
[example] titans for gpt ( #2484 )
2 years ago
Jiarui Fang
7c31706227
[CI] add test_ci.sh for palm, opt and gpt ( #2475 )
2 years ago
ver217
f525d1f528
[example] update gpt gemini example ci test ( #2477 )
2 years ago
Ziyue Jiang
fef5c949c3
polish pp middleware ( #2476 )
...
Co-authored-by: Ziyue Jiang <ziyue.jiang@gmail.com>
2 years ago
Jiarui Fang
867c8c2d3a
[zero] low level optim supports ProcessGroup ( #2464 )
2 years ago
YuliangLiu0306
2731531bc2
[autoparallel] integrate device mesh initialization into autoparallelize ( #2393 )
...
* [autoparallel] integrate device mesh initialization into autoparallelize
* add megatron solution
* update gpt autoparallel examples with latest api
* adapt beta value to fit the current computation cost
2 years ago
ZijianYY
fe0f7970a2
[examples] adding tflops to PaLM ( #2365 )
2 years ago
HELSON
d84e747975
[hotfix] add DISTPAN argument for benchmark ( #2412 )
...
* change the benchmark config file
* change config
* revert config file
* rename distpan to distplan
2 years ago
HELSON
498b5ca993
[hotfix] fix gpt gemini example ( #2404 )
...
* [hotfix] fix gpt gemini example
* [example] add new assertions
2 years ago
Jiarui Fang
12c8bf38d7
[Pipeline] Refine GPT PP Example
2 years ago
Ziyue Jiang
ad00894f7f
polish
2 years ago
Jiarui Fang
1aaeb596c6
[example] gpt, shard init on all processes ( #2366 )
2 years ago
Ziyue Jiang
3a15b20421
Move GPT PP Example
2 years ago
YuliangLiu0306
8b1e0dfd80
[example] upload auto parallel gpt2 demo ( #2354 )
2 years ago
Jiarui Fang
00a9c781fd
[example] add google doc for benchmark results of GPT ( #2355 )
2 years ago
Jiarui Fang
509a87f3ff
[example] make gpt example directory more clear ( #2353 )
2 years ago
Ikko Eltociear Ashimine
5e4bced0a3
[NFC] Update roberta/README.md ( #2350 )
2 years ago
Jiarui Fang
35e22be2f6
[example] simplify opt example ( #2344 )
2 years ago
ziyuhuang123
7080a8edb0
[workflow]New version: Create workflow files for examples' auto check ( #2298 )
...
* [workflows]bug_repair
* [workflow]new_pr_fixing_bugs
Co-authored-by: binmakeswell <binmakeswell@gmail.com>
2 years ago
binmakeswell
d7352bef2c
[example] add example requirement ( #2345 )
2 years ago
ZijianYY
f7fd592bf4
[examples]adding tp to PaLM ( #2319 )
2 years ago
HELSON
e00cedd181
[example] update gemini benchmark bash ( #2306 )
2 years ago
ZijianYY
df1d6dc553
[examples] using args and combining two versions for PaLM ( #2284 )
2 years ago
Ziyue Jiang
ac863a01d6
[example] add benchmark ( #2276 )
...
* add benchmark
* merge common func
* add total and avg tflops
Co-authored-by: Ziyue Jiang <ziyue.jiang@gmail.com>
2 years ago
Jiarui Fang
879df8b943
[example] GPT polish readme ( #2274 )
2 years ago
Ziyue Jiang
9654df0e9a
Add GPT PP Example ( #2272 )
...
Co-authored-by: Ziyue Jiang <ziyue.jiang@gmail.com>
2 years ago
YuliangLiu0306
4b29112ab2
[autoparallel] gpt2 autoparallel examples ( #2267 )
...
* [autoparallel] gpt2 autoparallel examples
* polish code
* polish code
2 years ago
HELSON
09c0102fe6
[example] fix gpt example with 0.1.10 ( #2265 )
2 years ago
Jiarui Fang
db4cbdc7fb
[builder] builder for scaled_upper_triang_masked_softmax ( #2234 )
2 years ago
HELSON
31fe84237b
[example] fix benchmark.sh for gpt example ( #2229 )
2 years ago
Jiarui Fang
2cdecc9f38
[example] make palm + GeminiDPP work ( #2227 )
2 years ago
ZijianYY
63cc77173b
[example] Palm adding gemini, still has bugs ( #2221 )
2 years ago
HELSON
7010e18134
[example] update gpt example ( #2225 )
2 years ago
Jiarui Fang
49c601da21
[example] add benchmark.sh for gpt ( #2226 )
2 years ago
HELSON
3629e611cd
[example] update gpt benchmark ( #2219 )
2 years ago
ZijianYY
92de90dfb3
[examples] replace einsum with matmul ( #2210 )
2 years ago
Jiarui Fang
7675792100
[builder] raise Error when CUDA_HOME is not set ( #2213 )
2 years ago
Jiarui Fang
d96cc37e32
[example] update GPT example benchmark results ( #2212 )
2 years ago
Jiarui Fang
d5e3e3ec01
[example] update gpt example for larger model scale ( #2211 )
2 years ago
Jiarui Fang
29868a9ec1
[example] update gpt readme with performance ( #2206 )
2 years ago
Jiarui Fang
65f56f49e8
[example] gpt demo more accuracy tflops ( #2178 )
2 years ago
Jiarui Fang
27327a4c90
[example] add palm pytorch version ( #2172 )
2 years ago
HELSON
a7d95b7024
[example] add zero1, zero2 example in GPT examples ( #2146 )
...
* [example] add zero1 and zero2 for GPT
* update readme in gpt example
* polish code
* change init value
* update readme
2 years ago
ZijianYY
fa9d1aea71
[example] update GPT README ( #2095 )
2 years ago
mandoxzhang
52bd106627
add RoBERTa ( #1980 )
...
* update roberta
* update roberta & readme
* update roberta & readme
* update roberta & readme
2 years ago
Jiarui Fang
f7e276fa71
[Gemini] add GeminiAdamOptimizer ( #1960 )
2 years ago
Jiarui Fang
60abd86d6a
[example] enhance GPT demo ( #1959 )
...
* [example] enhence GPT demo
* Update README.md
Co-authored-by: binmakeswell <binmakeswell@gmail.com>
2 years ago
Jiarui Fang
a25f755331
[example] add TP to GPT example ( #1828 )
2 years ago
Jiarui Fang
b1263d32ba
[example] simplify the GPT2 huggingface example ( #1826 )
2 years ago
Jiarui Fang
cd5a0d56fa
[Gemini] make gemini usage simple ( #1821 )
2 years ago
Jiarui Fang
350ccc0481
[example] opt does not depend on Titans ( #1811 )
2 years ago
Jiarui Fang
203ca57aed
[example] add GPT
2 years ago
Jiarui Fang
fd2c8d8156
[example] add opt model in lauguage ( #1809 )
2 years ago