ColossalAI/tests/kit/model_zoo/transformers
Hongxin Liu bbb2c21f16
[shardformer] fix chatglm implementation (#5644)
* [shardformer] fix chatglm policy

* [shardformer] fix chatglm flash attn

* [shardformer] update readme

* [shardformer] fix chatglm init

* [shardformer] fix chatglm test

* [pipeline] fix chatglm merge batch
2024-04-25 14:41:17 +08:00
..
__init__.py [shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088) 2023-11-28 16:54:42 +08:00
albert.py [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00
bert.py [test] merge old components to test to model zoo (#4945) 2023-10-20 10:35:08 +08:00
blip2.py [test] merge old components to test to model zoo (#4945) 2023-10-20 10:35:08 +08:00
bloom.py [test] merge old components to test to model zoo (#4945) 2023-10-20 10:35:08 +08:00
chatglm2.py [shardformer] fix chatglm implementation (#5644) 2024-04-25 14:41:17 +08:00
falcon.py [shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088) 2023-11-28 16:54:42 +08:00
gpt.py [shardformer] Sequence Parallelism Optimization (#5533) 2024-04-03 17:15:47 +08:00
gptj.py [workflow] fixed oom tests (#5275) 2024-01-16 18:55:13 +08:00
llama.py [shardformer] update transformers (#5583) 2024-04-24 22:51:50 +08:00
mistral.py [shardformer] update transformers (#5583) 2024-04-24 22:51:50 +08:00
opt.py [test] merge old components to test to model zoo (#4945) 2023-10-20 10:35:08 +08:00
sam.py [test] merge old components to test to model zoo (#4945) 2023-10-20 10:35:08 +08:00
t5.py [test] merge old components to test to model zoo (#4945) 2023-10-20 10:35:08 +08:00
vit.py [test] merge old components to test to model zoo (#4945) 2023-10-20 10:35:08 +08:00
whisper.py [test] merge old components to test to model zoo (#4945) 2023-10-20 10:35:08 +08:00