ColossalAI/applications
Wenhao Chen 9d02590c9a
[chat] refactor actor class (#3968)
* refactor: separate log_probs fn from Actor forward fn

* refactor: separate generate fn from Actor class

* feat: update unwrap_model and get_base_model
* unwrap_model returns model not wrapped by Strategy
* get_base_model returns HF model for Actor, Critic and RewardModel

* feat: simplify Strategy.prepare

* style: remove get_base_model method of Actor

* perf: tokenize text in batches

* refactor: move calc_action_log_probs to utils of model

* test: update test with new forward fn

* style: rename forward fn args

* fix: do not unwrap model in save_model fn of naive strategy

* test: add gemini test for train_prompts

* fix: fix _set_default_generate_kwargs
2023-06-13 13:31:56 +08:00
..
Chat [chat] refactor actor class (#3968) 2023-06-13 13:31:56 +08:00
README.md [doc] hide diffusion in application path (#3519) 2023-04-10 17:52:24 +08:00

README.md

Applications

This directory contains the applications that are powered by Colossal-AI.

The list of applications include:

  • Chatbot
  • FastFold: Optimizing AlphaFold (Biomedicine) Training and Inference on GPU Clusters

Please note that the Chatbot application is migrated from the original ChatGPT folder.

You can find more example code for base models and functions in the Examples directory.