Tong Li
dacc04ef75
fix bug
2 weeks ago
Tong Li
375e356a16
update prm
2 weeks ago
Tong Li
a8b4afb747
add prm
2 weeks ago
pre-commit-ci[bot]
ab992b89e4
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2 weeks ago
Tong Li
852333423d
update prm
2 weeks ago
Tong Li
f2f5ff5e24
update init
2 weeks ago
Tong Li
5719974a1e
Merge branch 'feat/prm' of github.com:TongLi3701/ColossalAI into feat/prm
2 weeks ago
Tong Li
797a81a8e2
add loss
2 weeks ago
pre-commit-ci[bot]
38a7f3846d
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2 weeks ago
Tong Li
9995119c28
update init
2 weeks ago
Tong Li
9ff9dc3d4a
Merge branch 'feat/prm' of github.com:TongLi3701/ColossalAI into feat/prm
2 weeks ago
Tong Li
c606d1101c
add tokenize func
2 weeks ago
Tong Li
b6ec337f3d
update tokenize function
2 weeks ago
pre-commit-ci[bot]
0bfb0d32a8
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
3 weeks ago
Tong Li
794e0d4f4a
update conversation
3 weeks ago
Tong Li
73ebbef3a3
add prm dataset example
3 weeks ago
Tong Li
1210dbea97
update tokenization function
3 weeks ago
Tong Li
dcb509c8e3
Merge branch 'feat/prm' of github.com:TongLi3701/ColossalAI into feat/prm
3 weeks ago
Tong Li
ed817a29f9
update
3 weeks ago
pre-commit-ci[bot]
308960534f
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
3 weeks ago
Tong Li
6c619c9992
update best answer function
3 weeks ago
Tong Li
30a9443132
[Coati] Refine prompt for better inference ( #6117 )
...
* refine prompt
* update prompt
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
---------
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
3 weeks ago
Tong Li
7a60161035
update readme ( #6116 )
3 weeks ago
Hongxin Liu
a15ab139ad
[plugin] support get_grad_norm ( #6115 )
3 weeks ago
Hongxin Liu
13ffa08cfa
[release] update version ( #6109 )
4 weeks ago
pre-commit-ci[bot]
2f583c1549
[pre-commit.ci] pre-commit autoupdate ( #6078 )
...
updates:
- [github.com/psf/black-pre-commit-mirror: 24.8.0 → 24.10.0](https://github.com/psf/black-pre-commit-mirror/compare/24.8.0...24.10.0 )
- [github.com/pre-commit/mirrors-clang-format: v18.1.8 → v19.1.2](https://github.com/pre-commit/mirrors-clang-format/compare/v18.1.8...v19.1.2 )
- [github.com/pre-commit/pre-commit-hooks: v4.6.0 → v5.0.0](https://github.com/pre-commit/pre-commit-hooks/compare/v4.6.0...v5.0.0 )
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
4 weeks ago
Hongxin Liu
c2e8f61592
[checkpointio] fix hybrid plugin model save ( #6106 )
4 weeks ago
Tong Li
89a9a600bc
[MCTS] Add self-refined MCTS ( #6098 )
...
* add reasoner
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update code
* delete llama
* update prompts
* update readme
* update readme
---------
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
1 month ago
binmakeswell
4294ae83bb
[doc] sora solution news ( #6100 )
...
* [doc] sora solution news
* [doc] sora solution news
1 month ago
Hongxin Liu
80a8ca916a
[extension] hotfix compile check ( #6099 )
1 month ago
Hanks
dee63cc5ef
Merge pull request #6096 from BurkeHulk/hotfix/lora_ckpt
...
[hotfix] fix lora ckpt saving format
1 month ago
BurkeHulk
6d6cafabe2
pre-commit fix
1 month ago
BurkeHulk
b10339df7c
fix lora ckpt save format (ColoTensor to Tensor)
1 month ago
Hongxin Liu
19baab5fd5
[release] update version ( #6094 )
1 month ago
Hongxin Liu
58d8b8a2dd
[misc] fit torch api upgradation and remove legecy import ( #6093 )
...
* [amp] fit torch's new api
* [amp] fix api call
* [amp] fix api call
* [misc] fit torch pytree api upgrade
* [misc] remove legacy import
* [misc] fit torch amp api
* [misc] fit torch amp api
1 month ago
Hongxin Liu
5ddad486ca
[fp8] add fallback and make compile option configurable ( #6092 )
1 month ago
botbw
3b1d7d1ae8
[chore] refactor
1 month ago
botbw
2bcd0b6844
[ckpt] add safetensors util
1 month ago
Hongxin Liu
cd61353bae
[pipeline] hotfix backward for multiple outputs ( #6090 )
...
* [pipeline] hotfix backward for multiple outputs
* [pipeline] hotfix backward for multiple outputs
1 month ago
Wenxuan Tan
62c13e7969
[Ring Attention] Improve comments ( #6085 )
...
* improve comments
* improve comments
---------
Co-authored-by: Edenzzzz <wtan45@wisc.edu>
1 month ago
Wang Binluo
dcd41d0973
Merge pull request #6071 from wangbluo/ring_attention
...
[Ring Attention] fix the 2d ring attn when using multiple machine
1 month ago
wangbluo
83cf2f84fb
fix
1 month ago
wangbluo
bc7eeade33
fix
1 month ago
wangbluo
fd92789af2
fix
1 month ago
wangbluo
6be9862aaf
fix
2 months ago
wangbluo
3dc08c8a5a
fix
2 months ago
wangbluo
8ff7d0c780
fix
2 months ago
wangbluo
fe9208feac
fix
2 months ago
wangbluo
3201377e94
fix
2 months ago
wangbluo
23199e34cc
fix
2 months ago