Commit Graph

2941 Commits (196b85368b9a30d2f7c7436cccdd623d106491ad)
 

Author SHA1 Message Date
Wenhao Chen 196b85368b [pipeline]: add p2p fallback order and fix interleaved pp deadlock (#5214)
11 months ago
Wenhao Chen 931d0e0731 [pipeline]: support arbitrary batch size in forward_only mode (#5201)
11 months ago
Wenhao Chen 1810b9100f [pipeline]: fix p2p comm, add metadata cache and support llama interleaved pp (#5134)
11 months ago
Xuanlei Zhao 6b69f3085b update
11 months ago
Xuanlei Zhao 8ca8cf8ec3 update optim
11 months ago
Xuanlei Zhao f037583bd2 update train
11 months ago
Xuanlei Zhao 0b8c33f474 update
11 months ago
Xuanlei Zhao c1c6af6368 update
11 months ago
Xuanlei Zhao 0bb317d9e6 update
11 months ago
Xuanlei Zhao ccad7014c6 update optim
11 months ago
Xuanlei Zhao 44014faa67 fix optim
11 months ago
Xuanlei Zhao 0a3aae509b update utils and fwd bwd
11 months ago
Xuanlei Zhao a5580e6289 update test
11 months ago
Xuanlei Zhao 73aa406b96 update
11 months ago
Xuanlei Zhao 570f5cd693 update pytest
11 months ago
Xuanlei Zhao 54b197cc02 update readme
11 months ago
Xuanlei Zhao 4922641098 script
11 months ago
Xuanlei Zhao d660a41850 update
11 months ago
Xuanlei Zhao b8fadb68a7 add pad
11 months ago
Xuanlei Zhao 23341687ed update
11 months ago
Xuanlei Zhao aa2e091dc6 update
11 months ago
Xuanlei Zhao 7c5b1a585f update
12 months ago
Xuanlei Zhao ebd8cc579a update script
12 months ago
Xuanlei Zhao f66469e209 update
12 months ago
Xuanlei Zhao 8aef2dba02 init
12 months ago
flybird11111 79718fae04
[shardformer] llama support DistCrossEntropy (#5176)
12 months ago
Yuanchen cefdc32615
[ColossalEval] Support GSM, Data Leakage Evaluation and Tensor Parallel (#5169)
12 months ago
Michelle b07a6f4e27
[colossalqa] fix pangu api (#5170)
12 months ago
flybird11111 21aa5de00b
[gemini] hotfix NaN loss while using Gemini + tensor_parallel (#5150)
12 months ago
Yuanchen b397104438
[Colossal-Llama-2] Add finetuning Colossal-Llama-2 example (#4878)
12 months ago
flybird11111 3dbbf83f1c
fix (#5158)
12 months ago
Michelle 368b5e3d64
[doc] fix colossalqa document (#5146)
1 year ago
Michelle c7fd9a5213
[ColossalQA] refactor server and webui & add new feature (#5138)
1 year ago
flybird11111 2a2ec49aa7
[plugin]fix 3d checkpoint load when booster boost without optimizer. (#5135)
1 year ago
github-actions[bot] f6731db67c
[format] applied code formatting on changed files in pull request 5115 (#5118)
1 year ago
github-actions[bot] 9b36640f28
[format] applied code formatting on changed files in pull request 5124 (#5125)
1 year ago
github-actions[bot] d10ee42f68
[format] applied code formatting on changed files in pull request 5088 (#5127)
1 year ago
digger yu 9110406a47
fix typo change JOSNL TO JSONL etc. (#5116)
1 year ago
Frank Lee 2899cfdabf
[doc] updated paper citation (#5131)
1 year ago
binmakeswell 177c79f2d1
[doc] add moe news (#5128)
1 year ago
Wenhao Chen 7172459e74
[shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088)
1 year ago
アマデウス 126cf180bc
[hotfix] fixed memory usage of shardformer module replacement (#5122)
1 year ago
Zian(Andy) Zheng 7b789f4dd2 [FEATURE] Add Safety Eval Datasets to ColossalEval (#5095)
1 year ago
digger yu d5661f0f25
[nfc] fix typo change directoty to directory (#5111)
1 year ago
digger yu 2bdf76f1f2
fix typo change lazy_iniy to lazy_init (#5099)
1 year ago
Xuanlei Zhao 68fcaa2225
remove duplicate import (#5100)
1 year ago
YeAnbang e53e729d8e
[Feature] Add document retrieval QA (#5020)
1 year ago
Xuanlei Zhao 3acbf6d496
[npu] add npu support for hybrid plugin and llama (#5090)
1 year ago
flybird11111 aae496631c
[shardformer]fix flash attention, when mask is casual, just don't unpad it (#5084)
1 year ago
Zhongkai Zhao 75af66cd81
[Hotfix] Fix model policy matching strategy in ShardFormer (#5064)
1 year ago