Wenhao Chen
|
6d41c3f2aa
|
[doc] update Coati README (#4405)
* style: apply formatter
* fix: add outdated warnings
* docs: add dataset format and polish
* docs: polish README
* fix: fix json format
* fix: fix typos
* revert: revert 7b example
|
2023-08-14 15:26:27 +08:00 |
Wenhao Chen
|
da4f7b855f
|
[chat] fix bugs and add unit tests (#4213)
* style: rename replay buffer
Experience replay is typically for off policy algorithms.
Use this name in PPO maybe misleading.
* fix: fix wrong zero2 default arg
* test: update experience tests
* style: rename zero_pad fn
* fix: defer init in CycledDataLoader
* test: add benchmark test
* style: rename internal fn of generation
* style: rename internal fn of lora
* fix: remove unused loss fn
* fix: remove unused utils fn
* refactor: remove generate_with_actor fn
* fix: fix type annotation
* test: add models tests
* fix: skip llama due to long execution time
* style: modify dataset
* style: apply formatter
* perf: update reward dataset
* fix: fix wrong IGNORE_INDEX in sft dataset
* fix: remove DataCollatorForSupervisedDataset
* test: add dataset tests
* style: apply formatter
* style: rename test_ci to test_train
* feat: add llama in inference
* test: add inference tests
* test: change test scripts directory
* fix: update ci
* fix: fix typo
* fix: skip llama due to oom
* fix: fix file mod
* style: apply formatter
* refactor: remove duplicated llama_gptq
* style: apply formatter
* to: update rm test
* feat: add tokenizer arg
* feat: add download model script
* test: update train tests
* fix: modify gemini load and save pretrained
* test: update checkpoint io test
* to: modify nproc_per_node
* fix: do not remove existing dir
* fix: modify save path
* test: add random choice
* fix: fix sft path
* fix: enlarge nproc_per_node to avoid oom
* fix: add num_retry
* fix: make lora config of rm and critic consistent
* fix: add warning about lora weights
* fix: skip some gpt2 tests
* fix: remove grad ckpt in rm and critic due to errors
* refactor: directly use Actor in train_sft
* test: add more arguments
* fix: disable grad ckpt when using lora
* fix: fix save_pretrained and related tests
* test: enable zero2 tests
* revert: remove useless fn
* style: polish code
* test: modify test args
|
2023-08-02 10:17:36 +08:00 |
Yuanchen
|
dc1b6127f9
|
[NFC] polish applications/Chat/inference/server.py code style (#4274)
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
|
2023-07-26 14:12:57 +08:00 |
Camille Zhong
|
915ed8bed1
|
[NFC] polish applications/Chat/inference/requirements.txt code style (#4265)
|
2023-07-26 14:12:57 +08:00 |
digger-yu
|
ad6460cf2c
|
[NFC] fix typo applications/ and colossalai/ (#3735)
|
2023-05-15 11:46:25 +08:00 |
kingkingofall
|
57a3c4db6d
|
[chat]fix readme (#3429)
* fix stage 2
fix stage 2
* add torch
|
2023-04-06 10:58:53 +08:00 |
ver217
|
62f7156131
|
[coati] fix inference profanity check (#3299)
|
2023-03-29 04:26:35 +08:00 |
ver217
|
73b542a124
|
[coati] inference supports profanity check (#3295)
|
2023-03-29 02:14:35 +08:00 |
ver217
|
ce2cafae76
|
[coati] add repetition_penalty for inference (#3294)
|
2023-03-29 01:18:45 +08:00 |
ver217
|
4905b21b94
|
[coati] fix inference output (#3285)
* [coati] fix inference requirements
* [coati] add output postprocess
* [coati] update inference readme
* [coati] fix inference requirements
|
2023-03-28 21:20:28 +08:00 |
Fazzie-Maqianli
|
b0ce5a1032
|
[Coati] first commit (#3283)
|
2023-03-28 20:25:36 +08:00 |