From aa9ec6568e71cc0b8058d1cc09947130c0c8332f Mon Sep 17 00:00:00 2001
From: dyastremsky <58150256+dyastremsky@users.noreply.github.com>
Date: Wed, 18 Oct 2023 07:55:08 -0700
Subject: [PATCH] Update wording - add "the"

Co-authored-by: Neelay Shah <neelays@nvidia.com>
---
 README.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/README.md b/README.md
index b17d31a5..d99a8b4a 100644
--- a/README.md
+++ b/README.md
@@ -41,7 +41,7 @@ repo](https://github.com/triton-inference-server/backend).
 
 This is a Python-based backend. When using this backend, all requests are placed on the
 vLLM AsyncEngine as soon as they are received. Inflight batching and paged attention is handled
-by vLLM engine.
+by the vLLM engine.
 
 Where can I ask general questions about Triton and Triton backends?
 Be sure to read all the information below as well as the [general