58 Commits (refactor/inference)

Author SHA1 Message Date
Jianghai cf579ff46d
[Inference] Dynamic Batching Inference, online and offline (#4953) 1 year ago
Hongxin Liu b8e770c832
[test] merge old components to test to model zoo (#4945) 1 year ago
Zhongkai Zhao db40e086c8 [test] modify model supporting part of low_level_zero plugin (including correspoding docs) 1 year ago
Jianghai ce7ade3882
[inference] chatglm2 infer demo (#4724) 1 year ago
Hongxin Liu 079bf3cb26
[misc] update pre-commit and run all files (#4752) 1 year ago
digger yu 9c2feb2f0b
fix some typo with colossalai/device colossalai/tensor/ etc. (#4171) 1 year ago
flybird11111 eedaa3e1ef
[shardformer]fix gpt2 double head (#4663) 1 year ago
flybird11111 7486ed7d3a
[shardformer] update llama2/opt finetune example and fix llama2 policy (#4645) 1 year ago
Hongxin Liu 27061426f7
[gemini] improve compatibility and add static placement policy (#4479) 1 year ago
flybird11111 59e252ecdb
[shardformer] chatglm support sequence parallel (#4482) 1 year ago
Jianghai 5545114fd8
rename chatglm to chatglm2 (#4484) 1 year ago
flybird11111 108e54a0b4 [shardformer]update t5 tests for using all optimizations. (#4407) 1 year ago
flybird11111 1edc9b5fb3 [shardformer] update tests for all optimization (#4413) 1 year ago
Baizhou Zhang 7711bd524a [shardformer] rewrite tests for opt/bloom/llama/vit/chatglm (#4395) 1 year ago
Jianghai 7596e9ae08 [pipeline] rewrite bert tests and fix some bugs (#4409) 1 year ago
flybird1111 906426cb44 [Shardformer] Merge flash attention branch to pipeline branch (#4362) 1 year ago
Jianghai a88e92251d [pipeline] add chatglm (#4363) 1 year ago
Baizhou Zhang b1feeced8e [shardformer] add util functions for shardformer tests/fix sync_shared_param (#4366) 1 year ago
Bin Jia 5c6f183192 [test] Hotfix/fix some model test and refactor check util api (#4369) 1 year ago
FoolPlayer 879301d0da [shardformer] support Blip2 (#4243) 1 year ago
klhhhhh 8120eca0c0 [shardformer] support ChatGLMForConditionalGeneration & add fusedlayernorm for vit 1 year ago
klhhhhh 4da05052f4 [shardformer] pre-commit check files 1 year ago
klhhhhh f155ae89c4 [shardformer] ChatGLM support layernorm sharding 1 year ago
klhhhhh 00f6ef159d [shardformer] delete some file 1 year ago
klhhhhh dad00c42aa [shardformer] support chatglm without layernorm 1 year ago
klhhhhh cbb54d3202 [shardformer] polish code 1 year ago
klhhhhh 6ee4c9ee21 [shardformer] add test kit in model zoo for chatglm 1 year ago
klhhhhh 7377be7a53 import chatglm 1 year ago
Kun Lin ed34bb1310 Feature/chatglm (#4240) 1 year ago
FoolPlayer 9ee4ebea83 [shardformer] support whisper (#4212) 1 year ago
FoolPlayer dd2bf02679 [shardformer] support SAM (#4231) 1 year ago
Baizhou Zhang 0ceec8f9a9 [pipeline] support fp32 for HybridPlugin/merge shardformer test and pipeline test into one file (#4354) 1 year ago
FoolPlayer b3f5d7a3ba [shardformer] support pipeline base vit model (#4284) 1 year ago
Baizhou Zhang 36e546b2cc [pipeline] add pipeline support for T5Stack/T5EncoderModel (#4300) 1 year ago
Baizhou Zhang 2a2eacfaf1 [pipeline] support shardformer for GPT2ForQuestionAnswering & complete pipeline support for GPT2 (#4245) 1 year ago
Jianghai e7cc62d735 [pipeline] All bert models (#4233) 1 year ago
Jianghai 37d22f6878 [pipeline] add bloom model pipeline (#4210) 1 year ago
Jianghai 31bcf867ae [pipeline] Llama causal lm and llama for sequence classification pipeline (#4208) 1 year ago
Jianghai 1622031058 [pipeline] Llama pipeline (#4205) 1 year ago
Frank Lee ae035d305d [shardformer] added embedding gradient check (#4124) 1 year ago
Frank Lee b1c2901530 [shardformer] supported bloom model (#4098) 1 year ago
jiangmingyan ac80937138 [shardformer] shardformer support opt models (#4091) 1 year ago
FoolPlayer 7740c55c55 support kit use for bert/gpt test (#4055) 1 year ago
Frank Lee 58df720570 [shardformer] adapted T5 and LLaMa test to use kit (#4049) 1 year ago
digger yu e61ffc77c6
fix typo tests/ (#3936) 1 year ago
Hongxin Liu 4b3240cb59
[booster] add low level zero plugin (#3594) 2 years ago
YuliangLiu0306 f57d34958b
[FX] refactor experimental tracer and adapt it with hf models (#3157) 2 years ago
Frank Lee 085e7f4eff
[test] fixed torchrec registration in model zoo (#3177) 2 years ago
Frank Lee 1ad3a636b1
[test] fixed torchrec model test (#3167) 2 years ago
ver217 6ae8ed0407
[lazyinit] add correctness verification (#3147) 2 years ago