InternLM/internlm
Qu Wenwen 7aced82ec7 remove some unused function (is norm/gate group) 2023-11-01 11:20:16 +08:00
..
apis initial commit 2023-07-06 12:55:23 +08:00
core fix(*)/all-reduce for norm in sequence parallel (#443) 2023-10-25 14:16:32 +08:00
data Merge develop to main (#233) 2023-08-24 22:03:04 +08:00
initialize fix merged error 2023-10-27 15:10:16 +08:00
model Merge upstream/develop into fix/add_zero_broadcast_sync 2023-10-27 11:05:53 +08:00
moe delete old sync logic 2023-10-27 11:05:17 +08:00
monitor feat(monitor): send exception to light monitor (#420) 2023-10-18 21:00:21 +08:00
solver remove some unused function (is norm/gate group) 2023-11-01 11:20:16 +08:00
train Merge upstream/develop into fix/add_zero_broadcast_sync 2023-10-27 17:45:34 +08:00
utils feat(optimizer): zero gradient count (#449) 2023-10-27 16:26:55 +08:00
__init__.py initial commit 2023-07-06 12:55:23 +08:00