littsk
|
1e0e080837
|
[bug] Fix the version check bug in colossalai run when generating the cmd. (#4713)
* Fix the version check bug in colossalai run when generating the cmd.
* polish code
|
1 year ago |
Hongxin Liu
|
079bf3cb26
|
[misc] update pre-commit and run all files (#4752)
* [misc] update pre-commit
* [misc] run pre-commit
* [misc] remove useless configuration files
* [misc] ignore cuda for clang-format
|
1 year ago |
Hongxin Liu
|
0b00def881
|
[example] add llama2 example (#4527)
* [example] transfer llama-1 example
* [example] fit llama-2
* [example] refactor scripts folder
* [example] fit new gemini plugin
* [cli] fix multinode runner
* [example] fit gemini optim checkpoint
* [example] refactor scripts
* [example] update requirements
* [example] update requirements
* [example] rename llama to llama2
* [example] update readme and pretrain script
* [example] refactor scripts
|
1 year ago |
LuGY
|
03654c0ce2
|
fix localhost measurement (#4320)
|
1 year ago |
Hongxin Liu
|
1908caad38
|
[cli] hotfix launch command for multi-nodes (#4165)
|
1 year ago |
Liu Ziming
|
8065cc5fba
|
Modify torch version requirement to adapt torch 2.0 (#3896)
|
1 year ago |
digger yu
|
70c8cdecf4
|
[nfc] fix typo colossalai/cli fx kernel (#3847)
* fix typo colossalai/autochunk auto_parallel amp
* fix typo colossalai/auto_parallel nn utils etc.
* fix typo colossalai/auto_parallel autochunk fx/passes etc.
* fix typo docs/
* change placememt_policy to placement_policy in docs/ and examples/
* fix typo colossalai/ applications/
* fix typo colossalai/cli fx kernel
|
2 years ago |
Zihao
|
b3d10db5f1
|
[NFC] polish colossalai/cli/launcher/__init__.py code style (#2709)
|
2 years ago |
Frank Lee
|
14d9299360
|
[cli] fixed hostname mismatch error (#2465)
|
2 years ago |
Frank Lee
|
39163417a1
|
[example] updated the hybrid parallel tutorial (#2444)
* [example] updated the hybrid parallel tutorial
* polish code
|
2 years ago |
Frank Lee
|
c72c827e95
|
[cli] provided more details if colossalai run fail (#2442)
|
2 years ago |
Frank Lee
|
1467d83edf
|
[cli] remove unused imports (#1001)
|
3 years ago |
Frank Lee
|
cf6d1c9284
|
[CLI] refactored the launch CLI and fixed bugs in multi-node launching (#844)
* [cli] fixed multi-node job launching
* [cli] fixed a bug in version comparison
* [cli] support launching with env var
* [cli] fixed multi-node job launching
* [cli] fixed a bug in version comparison
* [cli] support launching with env var
* added docstring
* [cli] added extra launch arguments
* [cli] added default launch rdzv args
* [cli] fixed version comparison
* [cli] added docstring examples and requierment
* polish docstring
* polish code
* polish code
|
3 years ago |
FrankLeeeee
|
d522cb704e
|
[cli] fixed single-node process launching
|
3 years ago |
FrankLeeeee
|
f63e91d280
|
[cli] fixed a bug in user args and refactored the module structure
|
3 years ago |
YuliangLiu0306
|
cfadc9df8e
|
[cli] added distributed launcher command (#791)
* [CLI] add CLI launcher
* Revert "[CLI] add CLI launcher"
This reverts commit df7e6506d4 .
* [CLI]add cli launcher feature
* remove testing message used during developing
* refactor the module structure.
|
3 years ago |