ColossalAI

Commit Graph

Author	SHA1	Message	Date
zbian	653b0a620e	added skip_bias_add for non-tp linear	2022-11-09 15:41:08 +08:00
XYE	e83b2ce853	[NFC] polish colossalai/nn/layer/vanilla/layers.py code style (#1295 )	2022-07-13 12:08:21 +08:00
Liping233	1000a41fd5	[NFC] polish colossalai/nn/layer/vanilla/__init__.py code style (#1293 )	2022-07-13 12:08:21 +08:00
アマデウス	b8899e0905	[TP] allow layernorm without bias (#750 )	2022-04-14 11:43:56 +08:00
Liang Bowen	ec5086c49c	Refactored docstring to google style	2022-03-29 17:17:47 +08:00
Liang Bowen	7eb87f516d	flake8 style (#352 )	2022-03-11 15:50:28 +08:00
アマデウス	9ee197d0e9	moved env variables to global variables; (#215 ) added branch context; added vocab parallel layers; moved split_batch from load_batch to tensor parallel embedding layers; updated gpt model; updated unit test cases; fixed few collective communicator bugs	2022-02-15 11:31:13 +08:00
HELSON	0f8c7f9804	Fixed docstring in colossalai (#171 )	2022-01-21 10:44:30 +08:00
BoxiangW	4a3d3446b0	Update layer integration documentations (#108 ) Update the documentations of layer integration Update _log_hook.py Update _operation.py	2022-01-10 18:05:58 +08:00
HELSON	dceae85195	Added MoE parallel (#127 )	2022-01-07 15:08:36 +08:00
アマデウス	01a80cd86d	Hotfix/Colossalai layers (#92 ) * optimized 1d layer apis; reorganized nn.layer modules; fixed tests * fixed 2.5d runtime issue * reworked split batch, now called in trainer.schedule.load_batch Co-authored-by: BoxiangW <45734921+BoxiangW@users.noreply.github.com>	2021-12-29 23:32:10 +08:00
アマデウス	0fedef4f3c	Layer integration (#83 ) * integrated parallel layers for ease of building models * integrated 2.5d layers * cleaned codes and unit tests * added log metric by step hook; updated imagenet benchmark; fixed some bugs * reworked initialization; cleaned codes Co-authored-by: BoxiangW <45734921+BoxiangW@users.noreply.github.com>	2021-12-27 15:04:32 +08:00

12 Commits (839847b7d78bce6af5dfe58d27b5ce2c74a3619b)