InternLM/internlm/train
Wenwen Qu 6cf0fec314 replace flashatten experts by feedforward experts 2023-09-08 18:04:57 +08:00
..
__init__.py feat(init): add skip args check flag and add zero overlap flag (#222) 2023-08-24 16:44:18 +08:00
training_internlm.py replace flashatten experts by feedforward experts 2023-09-08 18:04:57 +08:00