HELSON
a5dc4253c6
[zero] polish low level optimizer ( #2473 )
2 years ago
Frank Lee
8b7495dd54
[example] integrate seq-parallel tutorial with CI ( #2463 )
2 years ago
ver217
8e85d2440a
[example] update vit ci script ( #2469 )
...
* [example] update vit ci script
* [example] update requirements
* [example] update requirements
2 years ago
Jiarui Fang
867c8c2d3a
[zero] low level optim supports ProcessGroup ( #2464 )
2 years ago
Frank Lee
e6943e2d11
[example] integrate autoparallel demo with CI ( #2466 )
...
* [example] integrate autoparallel demo with CI
* polish code
* polish code
* polish code
* polish code
2 years ago
Frank Lee
14d9299360
[cli] fixed hostname mismatch error ( #2465 )
2 years ago
YuliangLiu0306
c20529fe78
[examples] update autoparallel tutorial demo ( #2449 )
...
* [examples] update autoparallel tutorial demo
* add test_ci.sh
* polish
* add conda yaml
2 years ago
Haofan Wang
9358262992
Fix False warning in initialize.py ( #2456 )
...
* Update initialize.py
* pre-commit run check
2 years ago
Frank Lee
32c46e146e
[workflow] automated bdist wheel build ( #2459 )
...
* [workflow] automated bdist wheel build
* polish workflow
* polish readme
* polish readme
2 years ago
YuliangLiu0306
8221fd7485
[autoparallel] update binary elementwise handler ( #2451 )
...
* [autoparallel] update binary elementwise handler
* polish
2 years ago
Frank Lee
c9ec5190a0
[workflow] automated the compatiblity test ( #2453 )
...
* [workflow] automated the compatiblity test
* polish code
2 years ago
Frank Lee
483efdabc5
[workflow] fixed the on-merge condition check ( #2452 )
2 years ago
Haofan Wang
cfd1d5ee49
[example] fixed seed error in train_dreambooth_colossalai.py ( #2445 )
2 years ago
Frank Lee
ac18a445fa
[example] updated large-batch optimizer tutorial ( #2448 )
...
* [example] updated large-batch optimizer tutorial
* polish code
* polish code
2 years ago
HELSON
2bfeb24308
[zero] add warning for ignored parameters ( #2446 )
2 years ago
Frank Lee
39163417a1
[example] updated the hybrid parallel tutorial ( #2444 )
...
* [example] updated the hybrid parallel tutorial
* polish code
2 years ago
HELSON
5521af7877
[zero] fix state_dict and load_state_dict for ddp ignored parameters ( #2443 )
...
* [ddp] add is_ddp_ignored
[ddp] rename to is_ddp_ignored
* [zero] fix state_dict and load_state_dict
* fix bugs
* [zero] update unit test for ZeroDDP
2 years ago
YuliangLiu0306
2731531bc2
[autoparallel] integrate device mesh initialization into autoparallelize ( #2393 )
...
* [autoparallel] integrate device mesh initialization into autoparallelize
* add megatron solution
* update gpt autoparallel examples with latest api
* adapt beta value to fit the current computation cost
2 years ago
Frank Lee
c72c827e95
[cli] provided more details if colossalai run fail ( #2442 )
2 years ago
Super Daniel
c41e59e5ad
[fx] allow native ckpt trace and codegen. ( #2438 )
2 years ago
YuliangLiu0306
41429b9b28
[autoparallel] add shard option ( #2423 )
2 years ago
Frank Lee
1b7587d958
[workflow] make test coverage report collapsable ( #2436 )
2 years ago
HELSON
7829aa094e
[ddp] add is_ddp_ignored ( #2434 )
...
[ddp] rename to is_ddp_ignored
2 years ago
Frank Lee
a3e5496156
[example] improved the clarity yof the example readme ( #2427 )
...
* [example] improved the clarity yof the example readme
* polish workflow
* polish workflow
* polish workflow
* polish workflow
* polish workflow
* polish workflow
2 years ago
Frank Lee
21256674e9
[workflow] report test coverage even if below threshold ( #2431 )
2 years ago
HELSON
bb4e9a311a
[zero] add inference mode and its unit test ( #2418 )
2 years ago
Frank Lee
63be79d505
[example] removed duplicated stable diffusion example ( #2424 )
2 years ago
Frank Lee
cd38167c1a
[doc] added documentation for CI/CD ( #2420 )
...
* [doc] added documentation for CI/CD
* polish markdown
* polish markdown
* polish markdown
2 years ago
Frank Lee
b3472d32e0
[workflow]auto comment with test coverage report ( #2419 )
...
* [workflow]auto comment with test coverage report
* polish code
* polish yaml
2 years ago
ZijianYY
fe0f7970a2
[examples] adding tflops to PaLM ( #2365 )
2 years ago
Jiarui Fang
93f62dd152
[autochunk] add autochunk feature
2 years ago
HELSON
dddacd2d2c
[hotfix] add norm clearing for the overflow step ( #2416 )
2 years ago
Frank Lee
57b6157b6c
[workflow] auto comment if precommit check fails ( #2417 )
2 years ago
oahzxl
61fdd3464a
update doc
2 years ago
oahzxl
36ab2cb783
change import
2 years ago
Frank Lee
2445279a08
[setup] refactored setup.py for dependency graph ( #2413 )
2 years ago
Frank Lee
9d432230ba
[workflow] added translation for non-english comments ( #2414 )
2 years ago
oahzxl
7ab2db206f
adapt new fx
2 years ago
Frank Lee
4befaabace
[workflow] added precommit check for code consistency ( #2401 )
...
* [workflow] added precommit check for code consistency
* polish code
* polish code
* polish code
* polish code
* polish code
* polish code
* polish code
2 years ago
HELSON
d84e747975
[hotfix] add DISTPAN argument for benchmark ( #2412 )
...
* change the benchmark config file
* change config
* revert config file
* rename distpan to distplan
2 years ago
oahzxl
e532679c95
Merge branch 'main' of https://github.com/oahzxl/ColossalAI into chunk
2 years ago
Haofan Wang
7d5640b9db
Update parallel_context.py ( #2408 )
2 years ago
Frank Lee
8327932d2c
[workflow] refactored the example check workflow ( #2411 )
...
* [workflow] refactored the example check workflow
* polish code
* polish code
* polish code
* polish code
* polish code
* polish code
* polish code
* polish code
* polish code
* polish code
* polish code
2 years ago
oahzxl
c1492e5013
add test in import
2 years ago
oahzxl
fd818cf144
change imports
2 years ago
oahzxl
a591d45b29
add available
2 years ago
oahzxl
615e7e68d9
update doc
2 years ago
oahzxl
7d4abaa525
add doc
2 years ago
Frank Lee
8de8de9fa3
[docker] updated Dockerfile and release workflow ( #2410 )
2 years ago
oahzxl
1be0ac3cbf
add doc for trace indice
2 years ago