ColossalAI

History

Wenhao Chen e614aa34f3 [shardformer, pipeline] add `gradient_checkpointing_ratio` and heterogenous shard policy for llama (#5508 ) * feat: add `GradientCheckpointConfig` and `PipelineGradientCheckpointConfig` * feat: apply `GradientCheckpointConfig` to policy and llama_forward * feat: move `distribute_layer` and `get_stage_index` to PipelineStageManager * fix: add optional args for `distribute_layer` and `get_stage_index` * fix: fix changed API calls * test: update llama tests * style: polish `GradientCheckpointConfig` * fix: fix pipeline utils tests		2024-04-01 11:34:58 +08:00
..
custom	[test] merge old components to test to model zoo (#4945 )	2023-10-20 10:35:08 +08:00
diffusers	[misc] update pre-commit and run all files (#4752 )	2023-09-19 14:20:26 +08:00
timm	[misc] update pre-commit and run all files (#4752 )	2023-09-19 14:20:26 +08:00
torchaudio	[misc] update pre-commit and run all files (#4752 )	2023-09-19 14:20:26 +08:00
torchrec	[test] modify model supporting part of low_level_zero plugin (including correspoding docs)	2023-10-05 15:10:31 +08:00
torchvision	[misc] update pre-commit and run all files (#4752 )	2023-09-19 14:20:26 +08:00
transformers	[shardformer, pipeline] add `gradient_checkpointing_ratio` and heterogenous shard policy for llama (#5508 )	2024-04-01 11:34:58 +08:00
__init__.py	[workflow] fixed build CI (#5240 )	2024-01-10 22:34:16 +08:00
executor.py	[test] merge old components to test to model zoo (#4945 )	2023-10-20 10:35:08 +08:00
registry.py	[example]add gpt2 benchmark example script. (#5295 )	2024-03-04 16:18:13 +08:00