Hongxin Liu
|
079bf3cb26
|
[misc] update pre-commit and run all files (#4752)
* [misc] update pre-commit
* [misc] run pre-commit
* [misc] remove useless configuration files
* [misc] ignore cuda for clang-format
|
2023-09-19 14:20:26 +08:00 |
Hongxin Liu
|
0b00def881
|
[example] add llama2 example (#4527)
* [example] transfer llama-1 example
* [example] fit llama-2
* [example] refactor scripts folder
* [example] fit new gemini plugin
* [cli] fix multinode runner
* [example] fit gemini optim checkpoint
* [example] refactor scripts
* [example] update requirements
* [example] update requirements
* [example] rename llama to llama2
* [example] update readme and pretrain script
* [example] refactor scripts
|
2023-08-28 17:59:11 +08:00 |
flybird1111
|
38b792aab2
|
[coloattention] fix import error (#4380)
fixed an import error
|
2023-08-04 16:28:41 +08:00 |
flybird1111
|
25c57b9fb4
|
[fix] coloattention support flash attention 2 (#4347)
Improved ColoAttention interface to support flash attention 2. Solved #4322
|
2023-08-04 13:46:22 +08:00 |