__init__.py
|
feat(moe): add moe module (#182)
|
2023-09-27 15:54:53 +08:00 |
linear.py
|
feat(linear): optimize mlp by using jit (#321)
|
2023-09-19 14:57:43 +08:00 |
loss.py
|
initial commit
|
2023-07-06 12:55:23 +08:00 |
modeling_moe.py
|
feat(moe): add moe module (#182)
|
2023-09-27 15:54:53 +08:00 |
moe.py
|
feat(moe): add moe module (#182)
|
2023-09-27 15:54:53 +08:00 |
multi_head_attention.py
|
Fit to flash attention 1.0.5.
|
2023-10-09 21:03:16 +08:00 |
norm.py
|
Merge develop to main (#233)
|
2023-08-24 22:03:04 +08:00 |
utils.py
|
feat(moe): add moe module (#182)
|
2023-09-27 15:54:53 +08:00 |