Hongxin Liu
f83ea813f5
[example] add train resnet/vit with booster example ( #3694 )
...
* [example] add train vit with booster example
* [example] update readme
* [example] add train resnet with booster example
* [example] enable ci
* [example] enable ci
* [example] add requirements
* [hotfix] fix analyzer init
* [example] update requirements
2 years ago
Hongxin Liu
d556648885
[example] add finetune bert with booster example ( #3693 )
2 years ago
github-actions[bot]
d544ed4345
[bot] Automated submodule synchronization ( #3596 )
...
Co-authored-by: github-actions <github-actions@github.com>
2 years ago
binmakeswell
f1b3d60cae
[example] reorganize for community examples ( #3557 )
2 years ago
Frank Lee
80eba05b0a
[test] refactor tests with spawn ( #3452 )
...
* [test] added spawn decorator
* polish code
* polish code
* polish code
* polish code
* polish code
* polish code
2 years ago
Frank Lee
7d8d825681
[booster] fixed the torch ddp plugin with the new checkpoint api ( #3442 )
2 years ago
ver217
573af84184
[example] update examples related to zero/gemini ( #3431 )
...
* [zero] update legacy import
* [zero] update examples
* [example] fix opt tutorial
* [example] fix opt tutorial
* [example] fix opt tutorial
* [example] fix opt tutorial
* [example] fix import
2 years ago
ver217
26b7aac0be
[zero] reorganize zero/gemini folder structure ( #3424 )
...
* [zero] refactor low-level zero folder structure
* [zero] fix legacy zero import path
* [zero] fix legacy zero import path
* [zero] remove useless import
* [zero] refactor gemini folder structure
* [zero] refactor gemini folder structure
* [zero] refactor legacy zero import path
* [zero] refactor gemini folder structure
* [zero] refactor gemini folder structure
* [zero] refactor gemini folder structure
* [zero] refactor legacy zero import path
* [zero] fix test import path
* [zero] fix test
* [zero] fix circular import
* [zero] update import
2 years ago
YuliangLiu0306
fd6add575d
[examples] polish AutoParallel readme ( #3270 )
2 years ago
Frank Lee
73d3e4d309
[booster] implemented the torch ddd + resnet example ( #3232 )
...
* [booster] implemented the torch ddd + resnet example
* polish code
2 years ago
github-actions[bot]
0aa92c0409
Automated submodule synchronization ( #3105 )
...
Co-authored-by: github-actions <github-actions@github.com>
2 years ago
binmakeswell
018936a3f3
[tutorial] update notes for TransformerEngine ( #3098 )
2 years ago
Kirthi Shankar Sivamani
65a4dbda6c
[NVIDIA] Add FP8 example using TE ( #3080 )
...
Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>
2 years ago
binmakeswell
52a5078988
[doc] add ISC tutorial ( #2997 )
...
* [doc] add ISC tutorial
* [doc] add ISC tutorial
* [doc] add ISC tutorial
* [doc] add ISC tutorial
2 years ago
github-actions[bot]
827a0af8cc
Automated submodule synchronization ( #2982 )
...
Co-authored-by: github-actions <github-actions@github.com>
2 years ago
binmakeswell
0afb55fc5b
[doc] add os scope, update tutorial install and tips ( #2914 )
2 years ago
Zheng Zeng
597914317b
[doc] fix typo in opt inference tutorial ( #2849 )
2 years ago
github-actions[bot]
a5721229d9
Automated submodule synchronization ( #2740 )
...
Co-authored-by: github-actions <github-actions@github.com>
2 years ago
github-actions[bot]
d701ef81b1
Automated submodule synchronization ( #2707 )
...
Co-authored-by: github-actions <github-actions@github.com>
2 years ago
github-actions[bot]
88416019e7
Automated submodule synchronization ( #2648 )
...
Co-authored-by: github-actions <github-actions@github.com>
2 years ago
binmakeswell
9ab14b20b5
[doc] add CVPR tutorial ( #2666 )
2 years ago
Frank Lee
4ae02c4b1c
[tutorial] added energonai to opt inference requirements ( #2625 )
2 years ago
binmakeswell
0556f5d468
[tutorial] add video link ( #2619 )
2 years ago
github-actions[bot]
ae86be1fd2
Automated submodule synchronization ( #2607 )
...
Co-authored-by: github-actions <github-actions@github.com>
2 years ago
binmakeswell
039b0c487b
[tutorial] polish README ( #2568 )
2 years ago
oahzxl
4f5ef73a43
[tutorial] update fastfold tutorial ( #2565 )
...
* update readme
* update
* update
2 years ago
YuliangLiu0306
f477a14f4a
[hotfix] fix autoparallel demo ( #2533 )
2 years ago
LuGY
ecbad93b65
[example] Add fastfold tutorial ( #2528 )
...
* add fastfold example
* pre-commit polish
* pre-commit polish readme and add empty test ci
* Add test_ci and reduce the default sequence length
2 years ago
Frank Lee
8b7495dd54
[example] integrate seq-parallel tutorial with CI ( #2463 )
2 years ago
Frank Lee
e6943e2d11
[example] integrate autoparallel demo with CI ( #2466 )
...
* [example] integrate autoparallel demo with CI
* polish code
* polish code
* polish code
* polish code
2 years ago
YuliangLiu0306
c20529fe78
[examples] update autoparallel tutorial demo ( #2449 )
...
* [examples] update autoparallel tutorial demo
* add test_ci.sh
* polish
* add conda yaml
2 years ago
Frank Lee
ac18a445fa
[example] updated large-batch optimizer tutorial ( #2448 )
...
* [example] updated large-batch optimizer tutorial
* polish code
* polish code
2 years ago
Frank Lee
39163417a1
[example] updated the hybrid parallel tutorial ( #2444 )
...
* [example] updated the hybrid parallel tutorial
* polish code
2 years ago
Frank Lee
63be79d505
[example] removed duplicated stable diffusion example ( #2424 )
2 years ago
Frank Lee
8327932d2c
[workflow] refactored the example check workflow ( #2411 )
...
* [workflow] refactored the example check workflow
* polish code
* polish code
* polish code
* polish code
* polish code
* polish code
* polish code
* polish code
* polish code
* polish code
* polish code
2 years ago
binmakeswell
d7352bef2c
[example] add example requirement ( #2345 )
2 years ago
YuliangLiu0306
edf4cd46c5
[examples] update autoparallel demo ( #2061 )
2 years ago
Super Daniel
2edbef13cc
[fx] add more meta_registry for MetaTensor execution. ( #2000 )
...
* [sc] add examples for auto checkpoint.
* merge upstream
* [fx] add more meta_registry for MetaTensor execution.
2 years ago
Jiarui Fang
f7e276fa71
[Gemini] add GeminiAdamOptimizer ( #1960 )
2 years ago
binmakeswell
9183e0dec5
[tutorial] polish all README ( #1946 )
2 years ago
Frank Lee
de56b563b9
[tutorial] added missing dummy dataloader ( #1944 )
2 years ago
Frank Lee
c6ea65011f
[tutorial] fixed pipeline bug for sequence parallel ( #1943 )
2 years ago
YuliangLiu0306
c7925c5d08
[sc demo] add requirements to spmd README ( #1941 )
2 years ago
Boyuan Yao
d5f5e06d82
[SC] remove redundant hands on ( #1939 )
...
* [sc] SC tutorial for auto checkpoint
* [sc] polish examples
* [sc] polish readme
* [sc] polish readme and help information
* [sc] polish readme and help information
* [sc] modify auto checkpoint benchmark
* [sc] remove imgs
* [sc] remove redundant handson
2 years ago
binmakeswell
41868f7605
[tutorial] polish README and OPT files ( #1930 )
...
* [tutorial] polish README and OPT files
* [tutorial] polish README and OPT files
* [tutorial] polish README and OPT files
2 years ago
ver217
b0b7a786b7
[tutorial] add synthetic dataset for opt ( #1924 )
2 years ago
Frank Lee
0486048453
[tutorial] updated hybrid parallel readme ( #1928 )
...
* [tutorial] updated hybrid parallel readme
* polish code
2 years ago
Frank Lee
807cbdb87d
[tutorial] added synthetic data for sequence parallel ( #1927 )
...
* [tutorial] added synthetic data for sequence parallel
* polish code
2 years ago
Frank Lee
abf4c27f6a
[tutorial] removed huggingface model warning ( #1925 )
2 years ago
Frank Lee
d43a671ad6
Hotfix/tutorial readme index ( #1922 )
...
* [tutorial] removed tutorial index in readme
* [tutorial] removed tutorial index in readme
2 years ago