From aa9ec6568e71cc0b8058d1cc09947130c0c8332f Mon Sep 17 00:00:00 2001 From: dyastremsky <58150256+dyastremsky@users.noreply.github.com> Date: Wed, 18 Oct 2023 07:55:08 -0700 Subject: [PATCH] Update wording - add "the" Co-authored-by: Neelay Shah --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index b17d31a5..d99a8b4a 100644 --- a/README.md +++ b/README.md @@ -41,7 +41,7 @@ repo](https://github.com/triton-inference-server/backend). This is a Python-based backend. When using this backend, all requests are placed on the vLLM AsyncEngine as soon as they are received. Inflight batching and paged attention is handled -by vLLM engine. +by the vLLM engine. Where can I ask general questions about Triton and Triton backends? Be sure to read all the information below as well as the [general