ColossalAI/applications
Tong Li 4c8e85ee0d
[Coati] Train DPO using PP (#6054)
* update dpo

* remove unsupport plugin

* update msg

* update dpo

* remove unsupport plugin

* update msg

* update template

* update dataset

* add pp for dpo

* update dpo

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add dpo fn

* update dpo

* update dpo

* update dpo

* update dpo

* minor update

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update loss

* update help

* polish code

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-10-11 19:32:00 +08:00
..
Colossal-LLaMA add fused norm (#6038) 2024-08-28 17:12:51 +08:00
ColossalChat [Coati] Train DPO using PP (#6054) 2024-10-11 19:32:00 +08:00
ColossalEval [ColossalEval] support for vllm (#6056) 2024-09-18 17:09:45 +08:00
ColossalMoE [MoE/ZeRO] Moe refactor with zero refactor (#5821) 2024-06-28 14:00:08 +08:00
ColossalQA [pre-commit.ci] pre-commit autoupdate (#5572) 2024-07-01 17:16:41 +08:00
README.md [Hotfix] README link (#5966) 2024-08-08 18:04:47 +08:00

README.md

Applications

This directory contains the applications that are powered by Colossal-AI.

The list of applications include:

  • Open-Sora: Revealing Complete Model Parameters, Training Details, and Everything for Sora-like Video Generation Models
  • ColossalChat: Replication of ChatGPT with RLHF.
  • Colossal-LLaMA: Continual Pre-training and Supervisied Fine-tuning of LLaMA2 / LLaMA3.
  • ColossalEval: Evaluation Pipeline for LLMs.
  • FastFold: Optimizing AlphaFold (Biomedicine) Training and Inference on GPU Clusters.
  • ColossalQA: Document Retrieval Conversation System
  • SwiftInfer: Breaks the Length Limit of LLM Inference for Multi-Round Conversations

Please note that the Chatbot application is migrated from the original ChatGPT folder.

You can find more example code for base models and functions in the Examples directory.