ColossalAI/examples/inference/client/locustfile.py

from locust import HttpUser, between, tag, task


class QuickstartUser(HttpUser):
    wait_time = between(1, 5)

    @tag("online-generation")
    @task(5)
    def completion(self):
        self.client.post("/completion", json={"prompt": "hello, who are you? ", "stream": "False"})

    @tag("online-generation")
    @task(5)
    def completion_streaming(self):
        self.client.post("/completion", json={"prompt": "hello, who are you? ", "stream": "True"})

    @tag("online-chat")
    @task(5)
    def chat(self):
        self.client.post(
            "/chat",
            json={
                "messages": [
                    {"role": "system", "content": "you are a helpful assistant"},
                    {"role": "user", "content": "what is 1+1?"},
                ],
                "stream": "False",
            },
        )

    @tag("online-chat")
    @task(5)
    def chat_streaming(self):
        self.client.post(
            "/chat",
            json={
                "messages": [
                    {"role": "system", "content": "you are a helpful assistant"},
                    {"role": "user", "content": "what is 1+1?"},
                ],
                "stream": "True",
            },
        )

    # offline-generation is only for showing the usage, it will never be used in actual serving.
    @tag("offline-generation")
    @task(5)
    def generate_streaming(self):
        self.client.post("/generate", json={"prompt": "Can you help me? ", "stream": "True"})

    @tag("offline-generation")
    @task(5)
    def generate(self):
        self.client.post("/generate", json={"prompt": "Can you help me? ", "stream": "False"})

    @tag("online-generation", "offline-generation")
    @task
    def health_check(self):
        self.client.get("/ping")
[Inference] Finish Online Serving Test, add streaming output api, continuous batching test and example (#5432) * finish online test and add examples * fix test_contionus_batching * fix some bugs * fix bash * fix * fix inference * finish revision * fix typos * revision 8 months ago			`from locust import HttpUser, between, tag, task`


			`class QuickstartUser(HttpUser):`
			`wait_time = between(1, 5)`

			`@tag("online-generation")`
			`@task(5)`
			`def completion(self):`
[Inference] Fix bugs and docs for feat/online-server (#5598) * fix test bugs * add do sample test * del useless lines * fix comments * fix tests * delete version tag * delete version tag * add * del test sever * fix test * fix * Revert "add" This reverts commit b9305fb02440d5cd566d32b508bee9f9c13dda15. 7 months ago			`self.client.post("/completion", json={"prompt": "hello, who are you? ", "stream": "False"})`
[Inference] Finish Online Serving Test, add streaming output api, continuous batching test and example (#5432) * finish online test and add examples * fix test_contionus_batching * fix some bugs * fix bash * fix * fix inference * finish revision * fix typos * revision 8 months ago
			`@tag("online-generation")`
			`@task(5)`
			`def completion_streaming(self):`
[Inference] Fix bugs and docs for feat/online-server (#5598) * fix test bugs * add do sample test * del useless lines * fix comments * fix tests * delete version tag * delete version tag * add * del test sever * fix test * fix * Revert "add" This reverts commit b9305fb02440d5cd566d32b508bee9f9c13dda15. 7 months ago			`self.client.post("/completion", json={"prompt": "hello, who are you? ", "stream": "True"})`
[Inference] Finish Online Serving Test, add streaming output api, continuous batching test and example (#5432) * finish online test and add examples * fix test_contionus_batching * fix some bugs * fix bash * fix * fix inference * finish revision * fix typos * revision 8 months ago
[Online Server] Chat Api for streaming and not streaming response (#5470) * fix bugs * fix bugs * fix api server * fix api server * add chat api and test * del request.n 8 months ago			`@tag("online-chat")`
			`@task(5)`
			`def chat(self):`
			`self.client.post(`
[Inference] Fix bugs and docs for feat/online-server (#5598) * fix test bugs * add do sample test * del useless lines * fix comments * fix tests * delete version tag * delete version tag * add * del test sever * fix test * fix * Revert "add" This reverts commit b9305fb02440d5cd566d32b508bee9f9c13dda15. 7 months ago			`"/chat",`
[Online Server] Chat Api for streaming and not streaming response (#5470) * fix bugs * fix bugs * fix api server * fix api server * add chat api and test * del request.n 8 months ago			`json={`
[Inference]Fix readme and example for API server (#5742) * fix chatapi readme and example * updating doc * add an api and change the doc * remove * add credits and del 'API' heading * readme * readme 6 months ago			`"messages": [`
[Online Server] Chat Api for streaming and not streaming response (#5470) * fix bugs * fix bugs * fix api server * fix api server * add chat api and test * del request.n 8 months ago			`{"role": "system", "content": "you are a helpful assistant"},`
			`{"role": "user", "content": "what is 1+1?"},`
			`],`
			`"stream": "False",`
			`},`
			`)`

			`@tag("online-chat")`
			`@task(5)`
			`def chat_streaming(self):`
			`self.client.post(`
[Inference] Fix bugs and docs for feat/online-server (#5598) * fix test bugs * add do sample test * del useless lines * fix comments * fix tests * delete version tag * delete version tag * add * del test sever * fix test * fix * Revert "add" This reverts commit b9305fb02440d5cd566d32b508bee9f9c13dda15. 7 months ago			`"/chat",`
[Online Server] Chat Api for streaming and not streaming response (#5470) * fix bugs * fix bugs * fix api server * fix api server * add chat api and test * del request.n 8 months ago			`json={`
[Inference]Fix readme and example for API server (#5742) * fix chatapi readme and example * updating doc * add an api and change the doc * remove * add credits and del 'API' heading * readme * readme 6 months ago			`"messages": [`
[Online Server] Chat Api for streaming and not streaming response (#5470) * fix bugs * fix bugs * fix api server * fix api server * add chat api and test * del request.n 8 months ago			`{"role": "system", "content": "you are a helpful assistant"},`
			`{"role": "user", "content": "what is 1+1?"},`
			`],`
			`"stream": "True",`
			`},`
			`)`

[Inference]Fix readme and example for API server (#5742) * fix chatapi readme and example * updating doc * add an api and change the doc * remove * add credits and del 'API' heading * readme * readme 6 months ago			`# offline-generation is only for showing the usage, it will never be used in actual serving.`
[Inference] Finish Online Serving Test, add streaming output api, continuous batching test and example (#5432) * finish online test and add examples * fix test_contionus_batching * fix some bugs * fix bash * fix * fix inference * finish revision * fix typos * revision 8 months ago			`@tag("offline-generation")`
			`@task(5)`
[Online Server] Chat Api for streaming and not streaming response (#5470) * fix bugs * fix bugs * fix api server * fix api server * add chat api and test * del request.n 8 months ago			`def generate_streaming(self):`
[Inference] Finish Online Serving Test, add streaming output api, continuous batching test and example (#5432) * finish online test and add examples * fix test_contionus_batching * fix some bugs * fix bash * fix * fix inference * finish revision * fix typos * revision 8 months ago			`self.client.post("/generate", json={"prompt": "Can you help me? ", "stream": "True"})`

			`@tag("offline-generation")`
			`@task(5)`
			`def generate(self):`
			`self.client.post("/generate", json={"prompt": "Can you help me? ", "stream": "False"})`

			`@tag("online-generation", "offline-generation")`
			`@task`
[Inference]Fix readme and example for API server (#5742) * fix chatapi readme and example * updating doc * add an api and change the doc * remove * add credits and del 'API' heading * readme * readme 6 months ago			`def health_check(self):`
			`self.client.get("/ping")`