core
|
add output embedding tf32 option (#523)
|
2023-12-06 13:50:59 +08:00 |
model
|
fix default behavior
|
2023-12-11 17:43:30 +08:00 |
moe
|
fix(moe): remove norm&gate force sync (#448)
|
2023-11-01 11:29:55 +08:00 |
monitor
|
fix(alert): send exception of all ranks (#491)
|
2023-11-10 19:04:31 +08:00 |
solver
|
feat(grad_norm): vocab grad norm profiling (#519)
|
2023-12-06 13:52:42 +08:00 |
utils
|
fix(storage): unify the name of ak & sk (#527)
|
2023-12-06 15:31:44 +08:00 |
__init__.py
|
initial commit
|
2023-07-06 12:55:23 +08:00 |