ColossalAI

Commit Graph

Author	SHA1	Message	Date
Jiatong (Julius) Han	8c8a39be95	[hotfix]: Remove math.prod dependency (#2837 ) * Remove math.prod dependency * Fix style * Fix style --------- Co-authored-by: Jiatong Han <jiatong.han@u.nus.edu>	2 years ago
YuliangLiu0306	819e25d8b1	[hotfix] fix autoparallel compatibility test issues (#2754 )	2 years ago
YuliangLiu0306	0f392d7403	[autoparallel] find repeat blocks (#2854 ) * [autoparallel] find repeat blocks * polish * polish * polish	2 years ago
BlueRum	2e16f842a9	[chatgpt]support opt & gpt for rm training (#2876 )	2 years ago
junxu	c52edcf0eb	Rename class method of ZeroDDP (#2692 )	2 years ago
HELSON	6e4ac08172	[hotfix] fix chunk size can not be divided (#2867 ) * [hotfix] fix chunk size can not be divided * [hotfix] use numpy for python3.8	2 years ago
Alex_996	a4fc125c34	Fix typos (#2863 ) Fix typos, `6.7 -> 6.7b`	2 years ago
dawei-wang	55424a16a5	[doc] fix GPT tutorial (#2860 ) Fix hpcaitech/ColossalAI#2851	2 years ago
Boyuan Yao	eae77c831d	[autoparallel] Patch meta information for nodes that will not be handled by SPMD solver (#2823 ) * [autoparallel] non spmd meta information generator * [autoparallel] patch meta information for non spmd nodes	2 years ago
Boyuan Yao	c7764d3f22	[autoparallel] Patch meta information of `torch.where` (#2822 ) * [autoparallel] patch meta information of torch.where * [autoparallel] pre-commit modified	2 years ago
Boyuan Yao	fcc4097efa	[autoparallel] Patch meta information of `torch.tanh()` and `torch.nn.Dropout` (#2773 ) * [autoparallel] tanh meta information * [autoparallel] remove redundant code * [autoparallel] patch meta information of torch.nn.Dropout	2 years ago
BlueRum	34ca324b0d	[chatgpt] Support saving ckpt in examples (#2846 ) * [chatgpt]fix train_rm bug with lora * [chatgpt]support colossalai strategy to train rm * fix pre-commit * fix pre-commit 2 * [chatgpt]fix rm eval typo * fix rm eval * fix pre commit * add support of saving ckpt in examples * fix single-gpu save	2 years ago
Zheng Zeng	597914317b	[doc] fix typo in opt inference tutorial (#2849 )	2 years ago
Frank Lee	935346430f	[cli] handled version check exceptions (#2848 ) * [cli] handled version check exceptions * polish code	2 years ago
BlueRum	3eebc4dff7	[chatgpt] fix rm eval (#2829 ) * [chatgpt]fix train_rm bug with lora * [chatgpt]support colossalai strategy to train rm * fix pre-commit * fix pre-commit 2 * [chatgpt]fix rm eval typo * fix rm eval * fix pre commit	2 years ago
Frank Lee	918bc94b6b	[triton] added copyright information for flash attention (#2835 ) * [triton] added copyright information for flash attention * polish code	2 years ago
Boyuan Yao	7ea6bc7f69	[autoparallel] Patch tensor related operations meta information (#2789 ) * [autoparallel] tensor related meta information prototype * [autoparallel] tensor related meta information * [autoparallel] tensor related meta information * [autoparallel] tensor related meta information * [autoparallel] tensor related meta information	2 years ago
github-actions[bot]	a5721229d9	Automated submodule synchronization (#2740 ) Co-authored-by: github-actions <github-actions@github.com>	2 years ago
Haofan Wang	47ecb22387	[example] add LoRA support (#2821 ) * add lora * format	2 years ago
ver217	b6a108cb91	[chatgpt] add test checkpoint (#2797 ) * [chatgpt] add test checkpoint * [chatgpt] test checkpoint use smaller model	2 years ago
Michelle	c008d4ad0c	[NFC] polish colossalai/engine/schedule/_pipeline_schedule.py code style (#2744 )	2 years ago
mickogoin	58abde2857	Update README.md (#2791 ) Fixed typo on line 285 from "defualt" to "default"	2 years ago
Marco Rodrigues	89f0017a9c	Typo (#2826 )	2 years ago
Jiarui Fang	bf0204604f	[exmaple] add bert and albert (#2824 )	2 years ago
YuliangLiu0306	cf6409dd40	Hotfix/auto parallel zh doc (#2820 ) * [hotfix] fix autoparallel zh docs * polish * polish	2 years ago
YuliangLiu0306	2059fdd6b0	[hotfix] add copyright for solver and device mesh (#2803 ) * [hotfix] add copyright for solver and device mesh * add readme * add alpa license * polish	2 years ago
LuGY	dbd0fd1522	[CI/CD] fix nightly release CD running on forked repo (#2812 ) * [CI/CD] fix nightly release CD running on forker repo * fix misunderstanding of dispatch * remove some build condition, enable notify even when release failed	2 years ago
Boyuan Yao	8593ae1a3f	[autoparallel] rotor solver refactor (#2813 ) * [autoparallel] rotor solver refactor * [autoparallel] rotor solver refactor	2 years ago
binmakeswell	09f457479d	[doc] update OPT serving (#2804 ) * [doc] update OPT serving * [doc] update OPT serving	2 years ago
HELSON	56ddc9ca7a	[hotfix] add correct device for fake_param (#2796 )	2 years ago
ver217	a619a190df	[chatgpt] update readme about checkpoint (#2792 ) * [chatgpt] add save/load checkpoint sample code * [chatgpt] add save/load checkpoint readme * [chatgpt] refactor save/load checkpoint readme	2 years ago
ver217	4ee311c026	[chatgpt] startegy add prepare method (#2766 ) * [chatgpt] startegy add prepare method * [chatgpt] refactor examples * [chatgpt] refactor strategy.prepare * [chatgpt] support save/load checkpoint * [chatgpt] fix unwrap actor * [chatgpt] fix unwrap actor	2 years ago
Boyuan Yao	a2b43e393d	[autoparallel] Patch meta information of `torch.nn.Embedding` (#2760 ) * [autoparallel] embedding metainfo * [autoparallel] fix function name in test_activation_metainfo * [autoparallel] undo changes in activation metainfo and related tests	2 years ago
Boyuan Yao	8e3f66a0d1	[zero] fix wrong import (#2777 )	2 years ago
Fazzie-Maqianli	ba84cd80b2	fix pip install colossal (#2764 )	2 years ago
Nikita Shulga	01066152f1	Don't use `torch._six` (#2775 ) * Don't use `torch._six` This is a private API which is gone after https://github.com/pytorch/pytorch/pull/94709 * Update common.py	2 years ago
ver217	a88bc828d5	[chatgpt] disable shard init for colossalai (#2767 )	2 years ago
binmakeswell	d6d6dec190	[doc] update example and OPT serving link (#2769 ) * [doc] update OPT serving link * [doc] update example and OPT serving link * [doc] update example and OPT serving link	2 years ago
Frank Lee	e376954305	[doc] add opt service doc (#2747 )	2 years ago
BlueRum	613efebc5c	[chatgpt] support colossalai strategy to train rm (#2742 ) * [chatgpt]fix train_rm bug with lora * [chatgpt]support colossalai strategy to train rm * fix pre-commit * fix pre-commit 2	2 years ago
BlueRum	648183a960	[chatgpt]fix train_rm bug with lora (#2741 )	2 years ago
fastalgo	b6e3b955c3	Update README.md	2 years ago
binmakeswell	30aee9c45d	[NFC] polish code format [NFC] polish code format	2 years ago
YuliangLiu0306	1dc003c169	[autoparallel] distinguish different parallel strategies (#2699 )	2 years ago
YH	ae86a29e23	Refact method of grad store (#2687 )	2 years ago
cloudhuang	43dffdaba5	[doc] fixed a typo in GPT readme (#2736 )	2 years ago
binmakeswell	93b788b95a	Merge branch 'main' into fix/format	2 years ago
xyupeng	2fd528b9f4	[NFC] polish colossalai/auto_parallel/tensor_shard/deprecated/graph_analysis.py code style (#2737 )	2 years ago
Zirui Zhu	c9e3ee389e	[NFC] polish colossalai/context/process_group_initializer/initializer_2d.py code style (#2726 )	2 years ago
Zangwei Zheng	1819373e5c	[NFC] polish colossalai/auto_parallel/tensor_shard/deprecated/op_handler/batch_norm_handler.py code style (#2728 )	2 years ago

... 2 3 4 5 6 ...

2180 Commits (31c78f2be3272a9a4062fe78eca34b3847a0c900) All Branches Search

2180 Commits (31c78f2be3272a9a4062fe78eca34b3847a0c900)

All Branches