2022-02-03 07:38:00 +00:00
|
|
|
# Change Log
|
|
|
|
|
|
|
|
All notable changes to this project will be documented in this file.
|
|
|
|
|
2022-02-14 09:22:48 +00:00
|
|
|
## v0.0.2 | 2022-02
|
2022-02-03 07:38:00 +00:00
|
|
|
|
|
|
|
### Added
|
|
|
|
|
2022-03-13 14:34:34 +00:00
|
|
|
- Unified distributed layers
|
2022-02-03 07:38:00 +00:00
|
|
|
- MoE support
|
|
|
|
- DevOps tools such as github action, code review automation, etc.
|
|
|
|
- New project official website
|
|
|
|
|
|
|
|
### Changes
|
|
|
|
|
|
|
|
- refactored the APIs for usability, flexibility and modularity
|
|
|
|
- adapted PyTorch AMP for tensor parallel
|
|
|
|
- refactored utilities for tensor parallel and pipeline parallel
|
|
|
|
- Separated benchmarks and examples as independent repositories
|
|
|
|
- Updated pipeline parallelism to support non-interleaved and interleaved versions
|
2022-02-14 09:22:48 +00:00
|
|
|
- refactored installation scripts for convenience
|
2022-02-03 07:38:00 +00:00
|
|
|
|
|
|
|
### Fixed
|
|
|
|
|
|
|
|
- zero level 3 runtime error
|
|
|
|
- incorrect calculation in gradient clipping
|
|
|
|
|
|
|
|
|
|
|
|
## v0.0.1 beta | 2021-10
|
|
|
|
|
|
|
|
The first beta version of Colossal-AI. Thanks to all contributors for the effort to implement the system.
|
|
|
|
|
|
|
|
### Added
|
|
|
|
|
|
|
|
- Initial architecture of the system
|
2022-03-13 14:34:34 +00:00
|
|
|
- Features such as tensor parallelism, gradient clipping, gradient accumulation
|