40 Commits (788e07dbc5dc5acaf34e24d98238780ecf134ef2)

Author SHA1 Message Date
digger-yu 1f73609adb
[CI] fix typo with tests/ etc. (#3727) 2 years ago
Frank Lee 80eba05b0a
[test] refactor tests with spawn (#3452) 2 years ago
zbian e94c79f15b improved allgather & reducescatter for 3d 2 years ago
Jiarui Fang 986f8cbaa7
[inference] overlap comm and compute in Linear1D_Row when stream_chunk_num > 1 (#1876) 2 years ago
Jiarui Fang c2947dadf1
[inference] streaming Linear 1D Row inference (#1874) 2 years ago
kurisusnowdeng 0b8161fab8 updated tp layers 2 years ago
Ziheng Qin cbe9a4cb45 [NFC] polish tests/test_layers/test_3d/test_3d.py code style (#1740) 2 years ago
lucasliunju 912eb58ea0 [NFC] polish tests/test_layers/test_3d/checks_3d/common.py code style (#1733) 2 years ago
Xue Fuzhao 754aa7c81f [NFC] polish tests/test_layers/test_3d/checks_3d/check_layer_3d.py code style (#1731) 2 years ago
xyupeng ff373a11eb [NFC] polish tests/test_layers/test_sequence/checks_seq/check_layer_seq.py code style (#1723) 2 years ago
Kai Wang (Victor Kai) b38efe4e8a [NFC] polish test_2p5d/checks_2p5d/check_operation_2p5d.py code style (#1718) 2 years ago
binmakeswell f6389d0813 [NFC] polish tests/test_layers/test_2d/checks_2d/check_operation_2d.py code style (#1715) 2 years ago
Jiarui Fang 21962e1593
[embedding] rename FreqAwareEmbedding -> CachedEmbedding (#1699) 2 years ago
Jiarui Fang 38c68b5b9a
[embedding] rollback for better FAW performance (#1625) 2 years ago
Jiarui Fang 504ff1d101
[embeddings] use cache_ratio instead of cuda_row_num (#1611) 2 years ago
CsRic f3403ff98e
[embeddings] add already_split_along_rank flag for tablewise mode (#1584) 2 years ago
Jiarui Fang 64169f3e8f
[embedding] polish parallel embedding tablewise (#1545) 2 years ago
CsRic 964123ae0f
[embedding] freq_aware_embedding: add small functions for caller application (#1537) 2 years ago
CsRic 5156d5b4f8
[embedding] add tablewise sharding for FAW (#1526) 2 years ago
Jiarui Fang 9a9ef65313
[FAW] cpu caching operations (#1520) 2 years ago
Jiarui Fang af5438caa2
[FAW] refactor reorder() for CachedParamMgr (#1514) 2 years ago
CsRic 1b8fee8e9c
[FAW] shrink freq_cnter size (#1509) 2 years ago
CsRic 0ed2f46131
[FAW] FAW embedding use LRU as eviction strategy intialized with dataset stats (#1494) 2 years ago
CsRic b8d0e39eaf
[FAW] LFU cache for the FAW 2 years ago
Jiarui Fang cde7b8a5b8
[FAW] init an LFU implementation for FAW (#1488) 2 years ago
Geng Zhang 0aad53c62b
[FCE] update interface for frequency statistics in FreqCacheEmbedding (#1462) 2 years ago
Geng Zhang 9f3eed66eb
[FAW] reorganize the inheritance struct of FreqCacheEmbedding (#1448) 2 years ago
Jiarui Fang 30b4dd17c0
[FAW] export FAW in _ops (#1438) 2 years ago
Frank Lee 50ec3a7e06
[test] skip tests when not enough GPUs are detected (#1090) 2 years ago
Frank Lee 65ee6dcc20
[test] ignore 8 gpu test (#1080) 2 years ago
Frank Lee 5a1a095b92
[test] refactored with the new rerun decorator (#763) 3 years ago
Frank Lee 3601b2bad0
[test] fixed rerun_on_exception and adapted test cases (#487) 3 years ago
zbian 3dba070580 fixed padding index issue for vocab parallel embedding layers; updated 3D linear to be compatible with examples in the tutorial 3 years ago
アマデウス 9ee197d0e9 moved env variables to global variables; (#215) 3 years ago
Frank Lee e2089c5c15
adapted for sequence parallel (#163) 3 years ago
アマデウス 01a80cd86d
Hotfix/Colossalai layers (#92) 3 years ago
アマデウス 0fedef4f3c
Layer integration (#83) 3 years ago
Frank Lee cd9c28e055
added CI for unit testing (#69) 3 years ago
Frank Lee da01c234e1
Develop/experiments (#59) 3 years ago
zbian 404ecbdcc6 Migrated project 3 years ago