ColossalAI/tests/test_shardformer/test_layer
Wang Binluo dcd41d0973
Merge pull request #6071 from wangbluo/ring_attention
[Ring Attention] fix the 2d ring attn when using multiple machine
2024-10-15 15:17:21 +08:00
..
test_dist_crossentropy.py [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
test_dropout.py [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
test_embedding.py [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
test_gpt2_qkv_fused_linear_1d.py [shardformer] optimize seq parallelism (#6086) 2024-10-11 13:44:40 +08:00
test_layernorm.py [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
test_linear_1d.py [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
test_qkv_fused_linear_1d.py [shardformer] fix linear 1d row and support uneven splits for fused qkv linear (#6084) 2024-10-10 14:34:45 +08:00
test_ring_attn.py fix 2024-10-15 13:26:44 +08:00
test_sequence_parallel.py [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
test_vocab_parallel_embedding_1d.py [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00