768 Commits (788e07dbc5dc5acaf34e24d98238780ecf134ef2)

Author SHA1 Message Date
1SAA 219df6e685 Optimized MoE layer and fixed some bugs; 3 years ago
zbian 3dba070580 fixed padding index issue for vocab parallel embedding layers; updated 3D linear to be compatible with examples in the tutorial 3 years ago
アマデウス 9ee197d0e9 moved env variables to global variables; (#215) 3 years ago
Jiarui Fang 569357fea0
add pytorch hooks (#179) 3 years ago
Frank Lee e2089c5c15
adapted for sequence parallel (#163) 3 years ago
ver217 7bf1e98b97
pipeline last stage supports multi output (#151) 3 years ago
ver217 96780e6ee4
Optimize pipeline schedule (#94) 3 years ago
アマデウス 01a80cd86d
Hotfix/Colossalai layers (#92) 3 years ago
アマデウス 0fedef4f3c
Layer integration (#83) 3 years ago
ver217 8f02a88db2
add interleaved pipeline, fix naive amp and update pipeline model initializer (#80) 3 years ago
Frank Lee 91c327cb44
fixed zero level 3 dtype bug (#76) 3 years ago
Frank Lee cd9c28e055
added CI for unit testing (#69) 3 years ago
Frank Lee da01c234e1
Develop/experiments (#59) 3 years ago
Frank Lee 3defa32aee
Support TP-compatible Torch AMP and Update trainer API (#27) 3 years ago
アマデウス 3245a69fc2
cleaned test scripts 3 years ago
zbian 404ecbdcc6 Migrated project 3 years ago