ColossalAI

Commit Graph

Author	SHA1	Message	Date
YuliangLiu0306	e414e4092b	[DTensor] implementation of dtensor (#2946 ) * [DTensor] implementation of dtensor * test layout convert * polish	2 years ago
BlueRum	489a9566af	[chatgpt]add inference example (#2944 ) * [chatgpt] support inference example * Create inference.sh * Update README.md * Delete inference.sh * Update inference.py	2 years ago
YuliangLiu0306	47fb214b3b	[hotfix] add shard dim to aviod backward communication error (#2954 )	2 years ago
ver217	090f14fd6b	[misc] add reference (#2930 ) * [misc] add reference * [misc] add license	2 years ago
github-actions[bot]	dca98937f8	[format] applied code formatting on changed files in pull request 2933 (#2939 ) Co-authored-by: github-actions <github-actions@github.com>	2 years ago
binmakeswell	8264cd7ef1	[doc] add env scope (#2933 )	2 years ago
Frank Lee	b8804aa60c	[doc] added readme for documentation (#2935 )	2 years ago
Frank Lee	9e3b8b7aff	[doc] removed read-the-docs (#2932 )	2 years ago
Frank Lee	77b88a3849	[workflow] added auto doc test on PR (#2929 ) * [workflow] added auto doc test on PR * [workflow] added doc test workflow * polish code * polish code * polish code * polish code * polish code * polish code * polish code	2 years ago
YuliangLiu0306	197d0bf4ed	[autoparallel] apply repeat block to reduce solving time (#2912 )	2 years ago
YH	a848091141	Fix port exception type (#2925 )	2 years ago
zbian	61e687831d	fixed using zero with tp cannot access weight correctly	2 years ago
github-actions[bot]	eb5cf94332	Automated submodule synchronization (#2927 ) Co-authored-by: github-actions <github-actions@github.com>	2 years ago
github-actions[bot]	da056285f2	[format] applied code formatting on changed files in pull request 2922 (#2923 ) Co-authored-by: github-actions <github-actions@github.com>	2 years ago
binmakeswell	12bafe057f	[doc] update installation for GPT (#2922 )	2 years ago
binmakeswell	0afb55fc5b	[doc] add os scope, update tutorial install and tips (#2914 )	2 years ago
YH	7b13f7db18	[zero] trivial zero optimizer refactoring (#2869 ) * Fix mionr grad store interface * Apply lint	2 years ago
fastalgo	dbc01b9c04	Update README.md	2 years ago
Frank Lee	e33c043dec	[workflow] moved pre-commit to post-commit (#2895 )	2 years ago
Jiatong (Julius) Han	8c8a39be95	[hotfix]: Remove math.prod dependency (#2837 ) * Remove math.prod dependency * Fix style * Fix style --------- Co-authored-by: Jiatong Han <jiatong.han@u.nus.edu>	2 years ago
YuliangLiu0306	819e25d8b1	[hotfix] fix autoparallel compatibility test issues (#2754 )	2 years ago
YuliangLiu0306	0f392d7403	[autoparallel] find repeat blocks (#2854 ) * [autoparallel] find repeat blocks * polish * polish * polish	2 years ago
BlueRum	2e16f842a9	[chatgpt]support opt & gpt for rm training (#2876 )	2 years ago
junxu	c52edcf0eb	Rename class method of ZeroDDP (#2692 )	2 years ago
HELSON	6e4ac08172	[hotfix] fix chunk size can not be divided (#2867 ) * [hotfix] fix chunk size can not be divided * [hotfix] use numpy for python3.8	2 years ago
Alex_996	a4fc125c34	Fix typos (#2863 ) Fix typos, `6.7 -> 6.7b`	2 years ago
dawei-wang	55424a16a5	[doc] fix GPT tutorial (#2860 ) Fix hpcaitech/ColossalAI#2851	2 years ago
Boyuan Yao	eae77c831d	[autoparallel] Patch meta information for nodes that will not be handled by SPMD solver (#2823 ) * [autoparallel] non spmd meta information generator * [autoparallel] patch meta information for non spmd nodes	2 years ago
Boyuan Yao	c7764d3f22	[autoparallel] Patch meta information of `torch.where` (#2822 ) * [autoparallel] patch meta information of torch.where * [autoparallel] pre-commit modified	2 years ago
Boyuan Yao	fcc4097efa	[autoparallel] Patch meta information of `torch.tanh()` and `torch.nn.Dropout` (#2773 ) * [autoparallel] tanh meta information * [autoparallel] remove redundant code * [autoparallel] patch meta information of torch.nn.Dropout	2 years ago
BlueRum	34ca324b0d	[chatgpt] Support saving ckpt in examples (#2846 ) * [chatgpt]fix train_rm bug with lora * [chatgpt]support colossalai strategy to train rm * fix pre-commit * fix pre-commit 2 * [chatgpt]fix rm eval typo * fix rm eval * fix pre commit * add support of saving ckpt in examples * fix single-gpu save	2 years ago
Zheng Zeng	597914317b	[doc] fix typo in opt inference tutorial (#2849 )	2 years ago
Frank Lee	935346430f	[cli] handled version check exceptions (#2848 ) * [cli] handled version check exceptions * polish code	2 years ago
BlueRum	3eebc4dff7	[chatgpt] fix rm eval (#2829 ) * [chatgpt]fix train_rm bug with lora * [chatgpt]support colossalai strategy to train rm * fix pre-commit * fix pre-commit 2 * [chatgpt]fix rm eval typo * fix rm eval * fix pre commit	2 years ago
Frank Lee	918bc94b6b	[triton] added copyright information for flash attention (#2835 ) * [triton] added copyright information for flash attention * polish code	2 years ago
Boyuan Yao	7ea6bc7f69	[autoparallel] Patch tensor related operations meta information (#2789 ) * [autoparallel] tensor related meta information prototype * [autoparallel] tensor related meta information * [autoparallel] tensor related meta information * [autoparallel] tensor related meta information * [autoparallel] tensor related meta information	2 years ago
github-actions[bot]	a5721229d9	Automated submodule synchronization (#2740 ) Co-authored-by: github-actions <github-actions@github.com>	2 years ago
Haofan Wang	47ecb22387	[example] add LoRA support (#2821 ) * add lora * format	2 years ago
ver217	b6a108cb91	[chatgpt] add test checkpoint (#2797 ) * [chatgpt] add test checkpoint * [chatgpt] test checkpoint use smaller model	2 years ago
Michelle	c008d4ad0c	[NFC] polish colossalai/engine/schedule/_pipeline_schedule.py code style (#2744 )	2 years ago
mickogoin	58abde2857	Update README.md (#2791 ) Fixed typo on line 285 from "defualt" to "default"	2 years ago
Marco Rodrigues	89f0017a9c	Typo (#2826 )	2 years ago
Jiarui Fang	bf0204604f	[exmaple] add bert and albert (#2824 )	2 years ago
YuliangLiu0306	cf6409dd40	Hotfix/auto parallel zh doc (#2820 ) * [hotfix] fix autoparallel zh docs * polish * polish	2 years ago
YuliangLiu0306	2059fdd6b0	[hotfix] add copyright for solver and device mesh (#2803 ) * [hotfix] add copyright for solver and device mesh * add readme * add alpa license * polish	2 years ago
LuGY	dbd0fd1522	[CI/CD] fix nightly release CD running on forked repo (#2812 ) * [CI/CD] fix nightly release CD running on forker repo * fix misunderstanding of dispatch * remove some build condition, enable notify even when release failed	2 years ago
Boyuan Yao	8593ae1a3f	[autoparallel] rotor solver refactor (#2813 ) * [autoparallel] rotor solver refactor * [autoparallel] rotor solver refactor	2 years ago
binmakeswell	09f457479d	[doc] update OPT serving (#2804 ) * [doc] update OPT serving * [doc] update OPT serving	2 years ago
HELSON	56ddc9ca7a	[hotfix] add correct device for fake_param (#2796 )	2 years ago
ver217	a619a190df	[chatgpt] update readme about checkpoint (#2792 ) * [chatgpt] add save/load checkpoint sample code * [chatgpt] add save/load checkpoint readme * [chatgpt] refactor save/load checkpoint readme	2 years ago

... 3 4 5 6 7 ...

2249 Commits (366a035552ff62d5f3dd9750bc9d263c2aa60dbc) All Branches Search

2249 Commits (366a035552ff62d5f3dd9750bc9d263c2aa60dbc)

All Branches