2519 Commits (92f6791095491e44c5712e14f00f2e19b52dc9f6)
 

Author SHA1 Message Date
digger yu de0d7df33f
[nfc] fix typo colossalai/zero (#3923) 1 year ago
Hongxin Liu 12c90db3f3
[doc] add lazy init tutorial (#3922) 1 year ago
Maruyama_Aya c94a33579b modify shell for check 1 year ago
digger yu a9d1cadc49
fix typo with colossalai/trainer utils zero (#3908) 1 year ago
Liu Ziming b306cecf28
[example] Modify palm example with the new booster API (#3913) 1 year ago
wukong1992 a55fb00c18
[booster] update bert example, using booster api (#3885) 1 year ago
Frank Lee 5e2132dcff
[workflow] added docker latest tag for release (#3920) 1 year ago
Hongxin Liu c25d421f3e
[devops] hotfix testmon cache clean logic (#3917) 1 year ago
Frank Lee d51e83d642
Merge pull request #3916 from FrankLeeeee/sync/dtensor-with-develop 1 year ago
Frank Lee c622bb3630
Merge pull request #3915 from FrankLeeeee/update/develop 1 year ago
Hongxin Liu 9c88b6cbd1
[lazy] fix compatibility problem on torch 1.13 (#3911) 1 year ago
Maruyama_Aya 4fc8bc68ac modify file path 1 year ago
Hongxin Liu b5f0566363
[chat] add distributed PPO trainer (#3740) 1 year ago
Hongxin Liu 41fb7236aa
[devops] hotfix CI about testmon cache (#3910) 1 year ago
Maruyama_Aya b4437e88c3 fixed port 1 year ago
Maruyama_Aya 79c9f776a9 fixed port 1 year ago
Maruyama_Aya d3379f0be7 fixed model saving bugs 1 year ago
Maruyama_Aya b29e1f0722 change directory 1 year ago
Maruyama_Aya 1c1f71cbd2 fixing insecure hash function 1 year ago
Maruyama_Aya b56c7f4283 update shell file 1 year ago
Maruyama_Aya 176010f289 update performance evaluation 1 year ago
digger yu 0e484e6201
[nfc]fix typo colossalai/pipeline tensor nn (#3899) 1 year ago
Baizhou Zhang c1535ccbba
[doc] fix docs about booster api usage (#3898) 1 year ago
Hongxin Liu ec9bbc0094
[devops] improving testmon cache (#3902) 1 year ago
Yuanchen 57a6d7685c
support evaluation for english (#3880) 1 year ago
digger yu 1878749753
[nfc] fix typo colossalai/nn (#3887) 1 year ago
Hongxin Liu ae02d4e4f7
[bf16] add bf16 support (#3882) 1 year ago
jiangmingyan 07cb21142f
[doc]update moe chinese document. (#3890) 1 year ago
Liu Ziming 8065cc5fba
Modify torch version requirement to adapt torch 2.0 (#3896) 1 year ago
Hongxin Liu dbb32692d2
[lazy] refactor lazy init (#3891) 1 year ago
Maruyama_Aya 25447d4407 modify path 1 year ago
Maruyama_Aya 42e3232bc0 roll back 1 year ago
Maruyama_Aya 60ec33bb18 Add a new example of Dreambooth training using the booster API 1 year ago
digger yu 70c8cdecf4
[nfc] fix typo colossalai/cli fx kernel (#3847) 1 year ago
Maruyama_Aya 46503c35dd Modify torch version requirement to adapt torch 2.0 2 years ago
jiangmingyan 281b33f362
[doc] update document of zero with chunk. (#3855) 2 years ago
jiangmingyan 5f79008c4a
[example] update gemini examples (#3868) 2 years ago
Yuanchen 2506e275b8
[evaluation] improvement on evaluation (#3862) 2 years ago
jiangmingyan b0474878bf
[doc] update nvme offload documents. (#3850) 2 years ago
Frank Lee ae959a72a5
[workflow] fixed workflow check for docker build (#3849) 2 years ago
Frank Lee d42b1be09d
[release] bump to v0.3.0 (#3830) 2 years ago
digger yu e2d81eba0d
[nfc] fix typo colossalai/ applications/ (#3831) 2 years ago
jiangmingyan a64df3fa97
[doc] update document of gemini instruction. (#3842) 2 years ago
Frank Lee 54e97ed7ea
[workflow] supported test on CUDA 10.2 (#3841) 2 years ago
wukong1992 3229f93e30
[booster] add warning for torch fsdp plugin doc (#3833) 2 years ago
Hongxin Liu 7c9f2ed6dd
[dtensor] polish sharding spec docstring (#3838) 2 years ago
Frank Lee 84500b7799
[workflow] fixed testmon cache in build CI (#3806) 2 years ago
digger yu 518b31c059
[docs] change placememt_policy to placement_policy (#3829) 2 years ago
digger yu e90fdb1000 fix typo docs/ 2 years ago
Yuanchen 34966378e8
[evaluation] add automatic evaluation pipeline (#3821) 2 years ago