Map `decode_request` during dynamic batching using a threadpool #166

aniketmaurya · 2024-07-08T16:42:51Z

🚀 Feature

A default optimization that LitServe can provide users is to map the decode_request function in case of dynamic batching using a ThreadPool. This can be useful for cases like image loading which is IO based.

I did a quick test with a ResNet-152 model for image classification and observed the following throughput (Requests per second) performance gain with threadpool:

Motivation

Pitch

Alternatives

Additional context

The text was updated successfully, but these errors were encountered:

grumpyp · 2024-09-21T14:08:05Z

hi @aniketmaurya have you already thought of an implementation of this?

I'd be interested in implementing it.

aniketmaurya added enhancement New feature or request help wanted Extra attention is needed labels Jul 8, 2024

deependujha mentioned this issue Sep 23, 2024

Feat: decode request in threadpool #290

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Map `decode_request` during dynamic batching using a threadpool #166

Map `decode_request` during dynamic batching using a threadpool #166

aniketmaurya commented Jul 8, 2024 •

edited

Loading

grumpyp commented Sep 21, 2024

Map decode_request during dynamic batching using a threadpool #166

Map decode_request during dynamic batching using a threadpool #166

Comments

aniketmaurya commented Jul 8, 2024 • edited Loading

🚀 Feature

Motivation

Pitch

Alternatives

Additional context

grumpyp commented Sep 21, 2024

Map `decode_request` during dynamic batching using a threadpool #166

Map `decode_request` during dynamic batching using a threadpool #166

aniketmaurya commented Jul 8, 2024 •

edited

Loading