Only store channel if use_cached_channel is true #387

HennerM · 2023-09-04T09:58:43Z

The gRPC client exposes a parameter that disables reusing a cached channel: use_cached_channel.
Setting this to false doesn't stop the channel (and mock) from being inserted to the internal channel cache. The problem with it being inserted to the map means the channel will only get destroyed once it's being replaced by another channel.

In our case we want the channel (and associated connection) to be closed as soon as the client using it is getting destroyed though, meaning we don't want to reuse the channel at all. I believe this is also consistent with the description for use_cached_channel:

use_cached_channel If false, a new channel is created for each
/// new client instance. When true, re-use old channels from cache for new
/// client instances. The default value is true

HennerM · 2024-01-06T16:19:31Z

@Tabrizian would you mind having a look at this? We have to keep an internal fork because of this behaviour. I already have the CLA signed (for my organisation Speechmatics)

Tabrizian · 2024-01-13T00:47:28Z

@debermudez is this something that you could help with reviewing?

debermudez · 2024-01-13T02:54:32Z

Yea. I will make a ticket to track this and look at it next week.
On a surface level, it seems straightforward enough ( I say hoping not to curse my future self ).

debermudez · 2024-01-25T19:57:31Z

@HennerM I have not forgotten about this, but was unfortunately sidetracked. I am trying to review this today.
In the meantime, have you signed the CLA: https://github.com/triton-inference-server/server/blob/main/CONTRIBUTING.md#contributor-license-agreement-cla
We need a signed CLA to accept any code contributions.

debermudez · 2024-01-25T23:27:52Z

I completely missed your second message. Nevermind my ask.

debermudez · 2024-02-02T20:55:35Z

@HennerM Wanted to give you a quick update. This is actively being tested via this PR: #465
If it passes, we should be able to get this merged shortly after.

HennerM · 2024-03-30T17:17:17Z

@debermudez Thanks, is this still being worked at?

aancel · 2024-04-08T12:17:02Z

I'm also looking forward to having the changes proposed by @HennerM being integrated to this repo, as this is also a need we recently had ! 👍

HennerM · 2024-04-10T07:07:39Z

@Tabrizian did this get lost?

debermudez · 2024-04-10T07:16:35Z

Sorry no it did not get lost.
I got pulled off for something else.
I have handed this off to another team mate: @jbkyang-nvi

jbkyang-nvi · 2024-04-10T23:26:00Z

Hello. Looking into this now. As an FYI, this will be added to the 24.05 release instead of 24.04 since it was not prioritized for this release.

HennerM · 2024-04-11T20:34:20Z

Hello. Looking into this now. As an FYI, this will be added to the 24.05 release instead of 24.04 since it was not prioritized for this release.

Thanks! That's okay, we waited already some time, the one month is not going to make a big difference :)

jbkyang-nvi · 2024-04-17T17:03:35Z

Merging with #465 and tested with triton-inference-server/server#7123 after pre-commit passes

Adding testing to verify new use_cached_channel behavior. To be merged after #387 Server test: triton-inference-server/server#7123 --------- Co-authored-by: Markus Hennerbichler <[email protected]> Co-authored-by: Katherine Yang <[email protected]> Co-authored-by: Katherine Yang <[email protected]>

Testing for: triton-inference-server/client#387 Client PR: triton-inference-server/client#465

The gRPC client exposes a parameter that disables reusing a cached channel: use_cached_channel. Setting this to false doesn't stop the channel (and mock) from being inserted to the internal channel cache. The problem with it being inserted to the map means the channel will only get destroyed once it's being replaced by another channel. In our case we want the channel (and associated connection) to be closed as soon as the client using it is getting destroyed though, meaning we don't want to reuse the channel at all. I believe this is also consistent with the description for use_cached_channel: If false, a new channel is created for each new client instance. When true, re-use old channels from cache for new client instances. The default value is true

Adding testing to verify new use_cached_channel behavior. To be merged after #387 Server test: triton-inference-server/server#7123 --------- Co-authored-by: Markus Hennerbichler <[email protected]> Co-authored-by: Katherine Yang <[email protected]> Co-authored-by: Katherine Yang <[email protected]>

The gRPC client exposes a parameter that disables reusing a cached channel: use_cached_channel. Setting this to false doesn't stop the channel (and mock) from being inserted to the internal channel cache. The problem with it being inserted to the map means the channel will only get destroyed once it's being replaced by another channel. In our case we want the channel (and associated connection) to be closed as soon as the client using it is getting destroyed though, meaning we don't want to reuse the channel at all. I believe this is also consistent with the description for use_cached_channel: If false, a new channel is created for each new client instance. When true, re-use old channels from cache for new client instances. The default value is true

Adding testing to verify new use_cached_channel behavior. To be merged after #387 Server test: triton-inference-server/server#7123 --------- Co-authored-by: Markus Hennerbichler <[email protected]> Co-authored-by: Katherine Yang <[email protected]> Co-authored-by: Katherine Yang <[email protected]>

HennerM · 2024-04-20T08:12:10Z

Thanks for getting this merged!

Tabrizian requested a review from debermudez January 13, 2024 00:47

debermudez mentioned this pull request Feb 2, 2024

Pr/387 #465

Merged

Only store channel if use_cached_channel is true

958386c

HennerM force-pushed the patch-1 branch from e94c1e5 to 958386c Compare April 11, 2024 20:32

Merge branch 'main' into patch-1

2f846c8

jbkyang-nvi mentioned this pull request Apr 16, 2024

Add test for reusing cached channels triton-inference-server/server#7123

Merged

Merge branch 'main' into patch-1

4f6bae8

jbkyang-nvi approved these changes Apr 17, 2024

View reviewed changes

jbkyang-nvi merged commit 5d969a6 into triton-inference-server:main Apr 17, 2024
3 checks passed

jbkyang-nvi added a commit to triton-inference-server/server that referenced this pull request Apr 17, 2024

Add new cached channel test (#7123)

cc4266b

Testing for: triton-inference-server/client#387 Client PR: triton-inference-server/client#465

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Only store channel if use_cached_channel is true #387

Only store channel if use_cached_channel is true #387

HennerM commented Sep 4, 2023

HennerM commented Jan 6, 2024

Tabrizian commented Jan 13, 2024

debermudez commented Jan 13, 2024

debermudez commented Jan 25, 2024

debermudez commented Jan 25, 2024

debermudez commented Feb 2, 2024

HennerM commented Mar 30, 2024

aancel commented Apr 8, 2024

HennerM commented Apr 10, 2024

debermudez commented Apr 10, 2024

jbkyang-nvi commented Apr 10, 2024

HennerM commented Apr 11, 2024

jbkyang-nvi commented Apr 17, 2024 •

edited

Loading

HennerM commented Apr 20, 2024

Only store channel if use_cached_channel is true #387

Only store channel if use_cached_channel is true #387

Conversation

HennerM commented Sep 4, 2023

HennerM commented Jan 6, 2024

Tabrizian commented Jan 13, 2024

debermudez commented Jan 13, 2024

debermudez commented Jan 25, 2024

debermudez commented Jan 25, 2024

debermudez commented Feb 2, 2024

HennerM commented Mar 30, 2024

aancel commented Apr 8, 2024

HennerM commented Apr 10, 2024

debermudez commented Apr 10, 2024

jbkyang-nvi commented Apr 10, 2024

HennerM commented Apr 11, 2024

jbkyang-nvi commented Apr 17, 2024 • edited Loading

HennerM commented Apr 20, 2024

jbkyang-nvi commented Apr 17, 2024 •

edited

Loading