Commit Graph

2546 Commits (fc5cef2c79265e36b585ef22c5e1d7f18be52a4e)
 

Author SHA1 Message Date
wukong1992 6b30dfb7ce [shardformer] support llama model using shardformer (#3969)
1 year ago
FoolPlayer 45927d5527 [shardformer] Add dropout layer in shard model and refactor policy api (#3949)
1 year ago
FoolPlayer a73130482d [shardformer] Unit test (#3928)
1 year ago
FoolPlayer f1cb5ac6bf [shardformer] Align bert value (#3907)
1 year ago
FoolPlayer 79f8d5d54b [shardformer] add gpt2 policy and modify shard and slicer to support (#3883)
1 year ago
FoolPlayer 70173e3123 update README (#3909)
1 year ago
FoolPlayer ab8a47f830 [shardformer] add Dropout layer support different dropout pattern (#3856)
1 year ago
FoolPlayer c594dc2f1c [shardformer] update readme with modules implement doc (#3834)
1 year ago
Frank Lee 4972e1f40e [shardformer] refactored the user api (#3828)
1 year ago
Frank Lee 235792f170 [shardformer] updated readme (#3827)
1 year ago
FoolPlayer 8cc11235c0 [shardformer]: Feature/shardformer, add some docstring and readme (#3816)
1 year ago
FoolPlayer 8d68de767d [shardformer] init shardformer code structure (#3731)
1 year ago
Wenhao Chen 3d8d5d0d58
[chat] use official transformers and fix some issues (#4117)
1 year ago
Baizhou Zhang 1350ece492
[hotfix] fix import bug in checkpoint_io (#4142)
1 year ago
digger yu 8abc87798f
fix Tensor is not defined (#4129)
1 year ago
digger yu 7e46bc87b6
fix CheckpointIndexFile is not defined (#4109)
1 year ago
digger yu 09fe9dc704
[nfc]fix ColossalaiOptimizer is not defined (#4122)
1 year ago
Wenhao Chen edd75a59ea
[chat] remove naive strategy and split colossalai strategy (#4094)
1 year ago
Wenhao Chen b03d64d010
[chat] refactor trainer class (#4080)
1 year ago
Jianghai 711e2b4c00
[doc] update and revise some typos and errs in docs (#4107)
1 year ago
digger yu 769cddcb2c
fix typo docs/ (#4033)
1 year ago
digger yu 2d40759a53
fix #3852 path error (#4058)
1 year ago
Frank Lee 1ee947f617
[workflow] added status check for test coverage workflow (#4106)
1 year ago
Jianghai 31dc302017
[examples] copy resnet example to image (#4090)
1 year ago
Frank Lee 95e95b6d58
[testing] move pytest to be inside the function (#4087)
1 year ago
Baizhou Zhang 4da324cd60
[hotfix]fix argument naming in docs and examples (#4083)
1 year ago
Michelle e89b127d8e
[chat]: fix chat evaluation possible bug (#4064)
1 year ago
Baizhou Zhang 2c8ae37f61
Merge pull request #4056 from Fridge003/hotfix/fix_gemini_chunk_config_searching
1 year ago
Wenhao Chen 153b957a1b
[chat] refactor strategy class with booster api (#3987)
1 year ago
Baizhou Zhang 0bb0b481b4 [gemini] fix argument naming during chunk configuration searching
1 year ago
Frank Lee b463651f3e
[workflow] cover all public repositories in weekly report (#4069)
1 year ago
Hongxin Liu 4a81faa5f3
[devops] fix build on pr ci (#4043)
1 year ago
github-actions[bot] a52f62082d
[format] applied code formatting on changed files in pull request 4021 (#4022)
1 year ago
LuGY 160c64c645
[example] fix bucket size in example of gpt gemini (#4028)
1 year ago
digger yu 727c4598a9
[nfc] fix dim not defined and fix typo (#3991)
1 year ago
Frank Lee ca768eb62d
Merge pull request #4025 from hpcaitech/develop
1 year ago
Frank Lee a5883aa790
[test] fixed codefactor format report (#4026)
1 year ago
Baizhou Zhang 822c3d4d66
[checkpointio] sharded optimizer checkpoint for DDP plugin (#4002)
1 year ago
Wenhao Chen 725af3eeeb
[booster] make optimizer argument optional for boost (#3993)
1 year ago
Baizhou Zhang c9cff7e7fa
[checkpointio] General Checkpointing of Sharded Optimizers (#3984)
1 year ago
digger yu d4fb7bfda7
fix typo applications/Chat/coati/ (#3947)
1 year ago
Baizhou Zhang e8ad3c88f5
[doc] add a note about unit-testing to CONTRIBUTING.md (#3970)
1 year ago
Yuanchen 2925f47399
[evaluate] support gpt evaluation with reference (#3972)
1 year ago
Frank Lee 8bcad73677
[workflow] fixed the directory check in build (#3980)
1 year ago
Wenhao Chen 9d02590c9a
[chat] refactor actor class (#3968)
1 year ago
Frank Lee 2bf6547ad7
Merge pull request #3967 from ver217/update-develop
1 year ago
Frank Lee 6718a2f285 [workflow] cancel duplicated workflow jobs (#3960)
1 year ago
Frank Lee 71fe52769c [gemini] fixed the gemini checkpoint io (#3934)
1 year ago
Baizhou Zhang b3ab7fbabf
[example] update ViT example using booster api (#3940)
1 year ago
Frank Lee 4110d1f0d4
[workflow] cancel duplicated workflow jobs (#3960)
1 year ago