..
_C
[setup] support pre-build and jit-build of cuda kernels ( #2374 )
2023-01-06 20:50:26 +08:00
_analyzer
[misc] update pre-commit and run all files ( #4752 )
2023-09-19 14:20:26 +08:00
amp
[npu] add npu support for gemini and zero ( #5067 )
2023-11-20 16:12:41 +08:00
auto_parallel
[npu] add npu support for gemini and zero ( #5067 )
2023-11-20 16:12:41 +08:00
autochunk
[misc] update pre-commit and run all files ( #4752 )
2023-09-19 14:20:26 +08:00
booster
[plugin]fix 3d checkpoint load when booster boost without optimizer. ( #5135 )
2023-11-30 18:37:47 +08:00
checkpoint_io
[pipeline,shardformer] Fix p2p efficiency in pipeline, allow skipping loading weight not in weight_map when `strict=False`, fix llama flash attention forward, add flop estimation by megatron in llama benchmark ( #5017 )
2023-11-16 20:15:59 +08:00
cli
[bug] Fix the version check bug in colossalai run when generating the cmd. ( #4713 )
2023-09-22 10:50:47 +08:00
cluster
[gemini] gemini support tensor parallelism. ( #4942 )
2023-11-10 10:15:16 +08:00
context
[moe] merge moe into main ( #4978 )
2023-11-02 02:21:24 +00:00
device
[npu] add npu support for hybrid plugin and llama ( #5090 )
2023-11-22 19:23:21 +08:00
fx
[misc] update pre-commit and run all files ( #4752 )
2023-09-19 14:20:26 +08:00
inference
[Hotfix] Fix model policy matching strategy in ShardFormer ( #5064 )
2023-11-22 11:19:39 +08:00
interface
[lazy] support from_pretrained ( #4801 )
2023-09-26 11:04:11 +08:00
kernel
fix thrust-transform-reduce error ( #5078 )
2023-11-21 15:09:35 +08:00
lazy
[doc] add lazy init docs ( #4808 )
2023-09-27 10:24:04 +08:00
legacy
[shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert ( #5088 )
2023-11-28 16:54:42 +08:00
logging
[misc] update pre-commit and run all files ( #4752 )
2023-09-19 14:20:26 +08:00
moe
[hotfix]: modify create_ep_hierarchical_group and add test ( #5032 )
2023-11-17 10:53:00 +08:00
nn
[npu] add npu support for gemini and zero ( #5067 )
2023-11-20 16:12:41 +08:00
pipeline
[shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert ( #5088 )
2023-11-28 16:54:42 +08:00
shardformer
[shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert ( #5088 )
2023-11-28 16:54:42 +08:00
tensor
[hotfix] fixed memory usage of shardformer module replacement ( #5122 )
2023-11-28 15:38:26 +08:00
testing
[npu] add npu support for hybrid plugin and llama ( #5090 )
2023-11-22 19:23:21 +08:00
utils
[shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert ( #5088 )
2023-11-28 16:54:42 +08:00
zero
[shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert ( #5088 )
2023-11-28 16:54:42 +08:00
__init__.py
[misc] update pre-commit and run all files ( #4752 )
2023-09-19 14:20:26 +08:00
initialize.py
[npu] add npu support for gemini and zero ( #5067 )
2023-11-20 16:12:41 +08:00