__init__.py
|
feat(moe): add moe module (#182)
|
2023-09-27 15:54:53 +08:00 |
linear.py
|
feat(linear): optimize mlp by using jit (#321)
|
2023-09-19 14:57:43 +08:00 |
loss.py
|
initial commit
|
2023-07-06 12:55:23 +08:00 |
metrics.py
|
feat(train): add fsdp training option (#293)
|
2023-10-09 18:59:31 +08:00 |
modeling_moe.py
|
implement overlap moe forward
|
2023-12-01 15:07:32 +08:00 |
norm.py
|
Merge develop to main (#233)
|
2023-08-24 22:03:04 +08:00 |
utils.py
|
fix(moe): remove norm&gate force sync (#448)
|
2023-11-01 11:29:55 +08:00 |