InternLM/internlm
Pryest a3580acb6c Fit to flash attention 1.0 2023-10-09 20:46:17 +08:00
..
apis initial commit 2023-07-06 12:55:23 +08:00
core feat(moe): add local data parallel support for experts (#376) 2023-09-28 13:38:02 +08:00
data Merge develop to main (#233) 2023-08-24 22:03:04 +08:00
initialize feat(moe): add moe module (#182) 2023-09-27 15:54:53 +08:00
model Fit to flash attention 1.0 2023-10-09 20:46:17 +08:00
moe feat(moe): add moe module (#182) 2023-09-27 15:54:53 +08:00
monitor doc(monitor): add light monitoring doc (#352) 2023-09-25 19:28:09 +08:00
solver feat(moe): add local data parallel support for experts (#376) 2023-09-28 13:38:02 +08:00
train feat(moe): add local data parallel support for experts (#376) 2023-09-28 13:38:02 +08:00
utils feat(moe): add local data parallel support for experts (#376) 2023-09-28 13:38:02 +08:00
__init__.py initial commit 2023-07-06 12:55:23 +08:00