Add testing for decoupled model use case #7246

krishung5 · 2024-05-20T21:49:45Z

Testing added for the specific use case of launching a separate thread for decoupled models. Fixed a small issue for the CI as well.

PYBE: triton-inference-server/python_backend#358

Tabrizian · 2024-05-23T00:51:41Z

qa/python_models/repeat_thread/model.py

+
+
+class TritonPythonModel:
+    """This model launches a separate thread to handle the request from a queue. The thread is launched from


Please keep the number of characters in each line below 80.

@dyastremsky do you know why CodeQL didn't catch this?

Tabrizian · 2024-05-23T00:53:06Z

qa/python_models/repeat_thread/model.py

+        wait_input = pb_utils.get_input_tensor_by_name(requests[0], "WAIT").as_numpy()
+        time.sleep(wait_input[0] / 1000)
+
+        return None


nit: remove this

I was wondering if you mean remove return None? I thought we'd need to return None from execute if it's in decoupled mode.

Not returning anything is similar to returning None.

krishung5 · 2024-05-30T18:44:07Z

Closing as per discussion a better approach would be handle this in the Triton core instead of PYBE. Filed ticket DLIS-6795 to track this.

krishung5 mentioned this pull request May 20, 2024

Clean up ResponseFactory when a final complete flag is set triton-inference-server/python_backend#358

Closed

krishung5 added 2 commits May 21, 2024 15:58

Add testing

67b5a69

Fix CI issues

d5111c1

krishung5 force-pushed the krish-python-finalize branch from 6c85d28 to d5111c1 Compare May 21, 2024 22:58

krishung5 marked this pull request as ready for review May 21, 2024 23:33

krishung5 requested review from Tabrizian and kthui May 21, 2024 23:37

Tabrizian reviewed May 23, 2024

View reviewed changes

krishung5 added 4 commits May 22, 2024 18:30

Format

13bfa74

Merge remote-tracking branch 'origin/main' into krish-python-finalize

cf4d1d2

Address comment

3a38668

Merge conflicts

6ba11a1

krishung5 requested a review from Tabrizian May 23, 2024 19:43

Fix python 3.11 environment

c3f1dda

krishung5 closed this May 30, 2024

krishung5 deleted the krish-python-finalize branch May 30, 2024 18:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add testing for decoupled model use case #7246

Add testing for decoupled model use case #7246

krishung5 commented May 20, 2024 •

edited

Loading

Tabrizian May 23, 2024

Tabrizian May 23, 2024

Tabrizian May 23, 2024

krishung5 May 23, 2024

Tabrizian May 23, 2024

krishung5 commented May 30, 2024



		class TritonPythonModel:
		"""This model launches a separate thread to handle the request from a queue. The thread is launched from

Add testing for decoupled model use case #7246

Add testing for decoupled model use case #7246

Conversation

krishung5 commented May 20, 2024 • edited Loading

Tabrizian May 23, 2024

Choose a reason for hiding this comment

Tabrizian May 23, 2024

Choose a reason for hiding this comment

Tabrizian May 23, 2024

Choose a reason for hiding this comment

krishung5 May 23, 2024

Choose a reason for hiding this comment

Tabrizian May 23, 2024

Choose a reason for hiding this comment

krishung5 commented May 30, 2024

krishung5 commented May 20, 2024 •

edited

Loading