InternLM/internlm/train
ytxiong 809ad9ebc8
fix the type_ids when micro_num=1 and use_flash_attn=False (#516)
2023-12-06 14:38:28 +08:00
..
__init__.py feat(train): add fsdp training option (#293) 2023-10-09 18:59:31 +08:00
training_internlm.py fix the type_ids when micro_num=1 and use_flash_attn=False (#516) 2023-12-06 14:38:28 +08:00
utils.py fix(moe): remove norm&gate force sync (#448) 2023-11-01 11:29:55 +08:00