ColossalAI

Commit Graph

Author	SHA1	Message	Date
ver217	19ad49fb3b	[chatgpt] making experience support dp (#2971 ) * [chatgpt] making experience support dp * [chatgpt] update example test ci * [chatgpt] update example test ci * [chatgpt] update example test ci * [chatgpt] update example test ci * [chatgpt] update sampler * [chatgpt] update example test ci * [chatgpt] refactor sampler * [chatgpt] update example test ci	2 years ago
github-actions[bot]	827a0af8cc	Automated submodule synchronization (#2982 ) Co-authored-by: github-actions <github-actions@github.com>	2 years ago
binmakeswell	9b4ceefc21	[doc] update news (#2983 ) * [doc] update news * [doc] update news	2 years ago
BlueRum	c9e27f0d1b	[chatgpt]fix lora bug (#2974 ) * fix lora bug * polish	2 years ago
BlueRum	82149e9d1b	[chatgpt] fix inference demo loading bug (#2969 ) * [chatgpt] fix inference demo loading bug * polish	2 years ago
Fazzie-Maqianli	bbf9c827c3	[ChatGPT] fix README (#2966 ) * Update README.md * fix README * Update README.md * Update README.md --------- Co-authored-by: fastalgo <youyang@cs.berkeley.edu> Co-authored-by: BlueRum <70618399+ht-zhou@users.noreply.github.com>	2 years ago
binmakeswell	b0a8766381	[doc] fix chatgpt inference typo (#2964 )	2 years ago
github-actions[bot]	0d07514988	Automated submodule synchronization (#2951 ) Co-authored-by: github-actions <github-actions@github.com>	2 years ago
YuliangLiu0306	e414e4092b	[DTensor] implementation of dtensor (#2946 ) * [DTensor] implementation of dtensor * test layout convert * polish	2 years ago
BlueRum	489a9566af	[chatgpt]add inference example (#2944 ) * [chatgpt] support inference example * Create inference.sh * Update README.md * Delete inference.sh * Update inference.py	2 years ago
YuliangLiu0306	47fb214b3b	[hotfix] add shard dim to aviod backward communication error (#2954 )	2 years ago
ver217	090f14fd6b	[misc] add reference (#2930 ) * [misc] add reference * [misc] add license	2 years ago
github-actions[bot]	dca98937f8	[format] applied code formatting on changed files in pull request 2933 (#2939 ) Co-authored-by: github-actions <github-actions@github.com>	2 years ago
binmakeswell	8264cd7ef1	[doc] add env scope (#2933 )	2 years ago
Frank Lee	b8804aa60c	[doc] added readme for documentation (#2935 )	2 years ago
Frank Lee	9e3b8b7aff	[doc] removed read-the-docs (#2932 )	2 years ago
Frank Lee	77b88a3849	[workflow] added auto doc test on PR (#2929 ) * [workflow] added auto doc test on PR * [workflow] added doc test workflow * polish code * polish code * polish code * polish code * polish code * polish code * polish code	2 years ago
YuliangLiu0306	197d0bf4ed	[autoparallel] apply repeat block to reduce solving time (#2912 )	2 years ago
YH	a848091141	Fix port exception type (#2925 )	2 years ago
zbian	61e687831d	fixed using zero with tp cannot access weight correctly	2 years ago
github-actions[bot]	eb5cf94332	Automated submodule synchronization (#2927 ) Co-authored-by: github-actions <github-actions@github.com>	2 years ago
github-actions[bot]	da056285f2	[format] applied code formatting on changed files in pull request 2922 (#2923 ) Co-authored-by: github-actions <github-actions@github.com>	2 years ago
binmakeswell	12bafe057f	[doc] update installation for GPT (#2922 )	2 years ago
binmakeswell	0afb55fc5b	[doc] add os scope, update tutorial install and tips (#2914 )	2 years ago
YH	7b13f7db18	[zero] trivial zero optimizer refactoring (#2869 ) * Fix mionr grad store interface * Apply lint	2 years ago
fastalgo	dbc01b9c04	Update README.md	2 years ago
Frank Lee	e33c043dec	[workflow] moved pre-commit to post-commit (#2895 )	2 years ago
Jiatong (Julius) Han	8c8a39be95	[hotfix]: Remove math.prod dependency (#2837 ) * Remove math.prod dependency * Fix style * Fix style --------- Co-authored-by: Jiatong Han <jiatong.han@u.nus.edu>	2 years ago
YuliangLiu0306	819e25d8b1	[hotfix] fix autoparallel compatibility test issues (#2754 )	2 years ago
YuliangLiu0306	0f392d7403	[autoparallel] find repeat blocks (#2854 ) * [autoparallel] find repeat blocks * polish * polish * polish	2 years ago
BlueRum	2e16f842a9	[chatgpt]support opt & gpt for rm training (#2876 )	2 years ago
junxu	c52edcf0eb	Rename class method of ZeroDDP (#2692 )	2 years ago
HELSON	6e4ac08172	[hotfix] fix chunk size can not be divided (#2867 ) * [hotfix] fix chunk size can not be divided * [hotfix] use numpy for python3.8	2 years ago
Alex_996	a4fc125c34	Fix typos (#2863 ) Fix typos, `6.7 -> 6.7b`	2 years ago
dawei-wang	55424a16a5	[doc] fix GPT tutorial (#2860 ) Fix hpcaitech/ColossalAI#2851	2 years ago
Boyuan Yao	eae77c831d	[autoparallel] Patch meta information for nodes that will not be handled by SPMD solver (#2823 ) * [autoparallel] non spmd meta information generator * [autoparallel] patch meta information for non spmd nodes	2 years ago
Boyuan Yao	c7764d3f22	[autoparallel] Patch meta information of `torch.where` (#2822 ) * [autoparallel] patch meta information of torch.where * [autoparallel] pre-commit modified	2 years ago
Boyuan Yao	fcc4097efa	[autoparallel] Patch meta information of `torch.tanh()` and `torch.nn.Dropout` (#2773 ) * [autoparallel] tanh meta information * [autoparallel] remove redundant code * [autoparallel] patch meta information of torch.nn.Dropout	2 years ago
BlueRum	34ca324b0d	[chatgpt] Support saving ckpt in examples (#2846 ) * [chatgpt]fix train_rm bug with lora * [chatgpt]support colossalai strategy to train rm * fix pre-commit * fix pre-commit 2 * [chatgpt]fix rm eval typo * fix rm eval * fix pre commit * add support of saving ckpt in examples * fix single-gpu save	2 years ago
Zheng Zeng	597914317b	[doc] fix typo in opt inference tutorial (#2849 )	2 years ago
Frank Lee	935346430f	[cli] handled version check exceptions (#2848 ) * [cli] handled version check exceptions * polish code	2 years ago
BlueRum	3eebc4dff7	[chatgpt] fix rm eval (#2829 ) * [chatgpt]fix train_rm bug with lora * [chatgpt]support colossalai strategy to train rm * fix pre-commit * fix pre-commit 2 * [chatgpt]fix rm eval typo * fix rm eval * fix pre commit	2 years ago
Frank Lee	918bc94b6b	[triton] added copyright information for flash attention (#2835 ) * [triton] added copyright information for flash attention * polish code	2 years ago
Boyuan Yao	7ea6bc7f69	[autoparallel] Patch tensor related operations meta information (#2789 ) * [autoparallel] tensor related meta information prototype * [autoparallel] tensor related meta information * [autoparallel] tensor related meta information * [autoparallel] tensor related meta information * [autoparallel] tensor related meta information	2 years ago
github-actions[bot]	a5721229d9	Automated submodule synchronization (#2740 ) Co-authored-by: github-actions <github-actions@github.com>	2 years ago
Haofan Wang	47ecb22387	[example] add LoRA support (#2821 ) * add lora * format	2 years ago
ver217	b6a108cb91	[chatgpt] add test checkpoint (#2797 ) * [chatgpt] add test checkpoint * [chatgpt] test checkpoint use smaller model	2 years ago
Michelle	c008d4ad0c	[NFC] polish colossalai/engine/schedule/_pipeline_schedule.py code style (#2744 )	2 years ago
mickogoin	58abde2857	Update README.md (#2791 ) Fixed typo on line 285 from "defualt" to "default"	2 years ago
Marco Rodrigues	89f0017a9c	Typo (#2826 )	2 years ago

1 2 3 4 5 ...

2157 Commits (052b03e83f30f46a43f87e2c9739ab04f56b6460) All Branches Search

2157 Commits (052b03e83f30f46a43f87e2c9739ab04f56b6460)

All Branches