12 Commits (8241c0c054b38a109ed3ce7be1052a1e600b8471)

Author SHA1 Message Date
Jianghai 61a1b2e798 [Inference] Fix bugs and docs for feat/online-server (#5598) 7 months ago
CjhHa1 7bbb28e48b [Inference] resolve rebase conflicts 7 months ago
flybird11111 a0ad587c24
[shardformer] refactor embedding resize (#5603) 7 months ago
flybird11111 576a2f7b10
[gemini] gemini support tensor parallelism. (#4942) 1 year ago
Hongxin Liu 079bf3cb26
[misc] update pre-commit and run all files (#4752) 1 year ago
ver217 73a4144b91 [shardformer] fix embedding 1 year ago
FoolPlayer 9ee4ebea83 [shardformer] support whisper (#4212) 1 year ago
Hongxin Liu d921ce8391 [shardformer] support inplace sharding (#4251) 1 year ago
Hongxin Liu 890774b2fb [shardformer] support lazy init (#4202) 1 year ago
Frank Lee 70c58cfd4f [shardformer] supported fused qkv checkpoint (#4073) 1 year ago
Frank Lee 8eb09a4c69 [shardformer] support module saving and loading (#4062) 1 year ago
Frank Lee f22ddacef0 [shardformer] refactored the shardformer layer structure (#4053) 1 year ago
FoolPlayer 4021b9a8a2 [shardformer] add gpt2 test and layer class refactor (#4041) 1 year ago