ColossalAI

History

Wenhao Chen 7172459e74 [shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088 ) * [shardformer] implement policy for all GPT-J models and test * [shardformer] support interleaved pipeline parallel for bert finetune * [shardformer] shardformer support falcon (#4883) * [shardformer]: fix interleaved pipeline for bert model (#5048) * [hotfix]: disable seq parallel for gptj and falcon, and polish code (#5093) * Add Mistral support for Shardformer (#5103) * [shardformer] add tests to mistral (#5105) --------- Co-authored-by: Pengtai Xu <henryxu880@gmail.com> Co-authored-by: ppt0011 <143150326+ppt0011@users.noreply.github.com> Co-authored-by: flybird11111 <1829166702@qq.com> Co-authored-by: eric8607242 <e0928021388@gmail.com>		1 year ago
..
kit	[shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088 )	1 year ago
test_analyzer	…
test_auto_parallel	…
test_autochunk	…
test_booster	[shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088 )	1 year ago
test_checkpoint_io	…
test_cluster	…
test_config	…
test_device	…
test_fx	…
test_gptq	…
test_infer	[Hotfix] Fix model policy matching strategy in ShardFormer (#5064 )	1 year ago
test_infer_ops/triton	…
test_lazy	…
test_legacy	[npu] add npu support for gemini and zero (#5067 )	1 year ago
test_moe	…
test_optimizer	…
test_pipeline	[shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088 )	1 year ago
test_shardformer	[shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088 )	1 year ago
test_smoothquant	…
test_tensor	…
test_utils	…
test_zero	[npu] add npu support for gemini and zero (#5067 )	1 year ago
__init__.py	…