bert.py
|
[fp8] support hybrid parallel plugin (#5982)
|
2024-08-12 18:17:05 +08:00 |
bloom.py
|
[fp8] support hybrid parallel plugin (#5982)
|
2024-08-12 18:17:05 +08:00 |
deepseek.py
|
fix the merge
|
2024-08-21 02:58:23 +00:00 |
gpt2.py
|
[fp8] support hybrid parallel plugin (#5982)
|
2024-08-12 18:17:05 +08:00 |
gptj.py
|
[fp8] support hybrid parallel plugin (#5982)
|
2024-08-12 18:17:05 +08:00 |
llama.py
|
fix
|
2024-08-21 03:58:21 +00:00 |
mistral.py
|
[FP8] rebase main (#5963)
|
2024-08-06 16:29:37 +08:00 |
mixtral.py
|
fix the merge
|
2024-08-21 02:58:23 +00:00 |
opt.py
|
[FP8] rebase main (#5963)
|
2024-08-06 16:29:37 +08:00 |
qwen2.py
|
[fp8] support hybrid parallel plugin (#5982)
|
2024-08-12 18:17:05 +08:00 |
sam.py
|
[shardformer]delete xformers (#5859)
|
2024-06-28 11:20:04 +08:00 |