Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Map decode_request during dynamic batching using a threadpool #166

Open
aniketmaurya opened this issue Jul 8, 2024 · 1 comment
Open
Labels
enhancement New feature or request help wanted Extra attention is needed

Comments

@aniketmaurya
Copy link
Collaborator

aniketmaurya commented Jul 8, 2024

🚀 Feature

A default optimization that LitServe can provide users is to map the decode_request function in case of dynamic batching using a ThreadPool. This can be useful for cases like image loading which is IO based.

I did a quick test with a ResNet-152 model for image classification and observed the following throughput (Requests per second) performance gain with threadpool:

image

Motivation

Pitch

Alternatives

Additional context

@aniketmaurya aniketmaurya added enhancement New feature or request help wanted Extra attention is needed labels Jul 8, 2024
@grumpyp
Copy link
Contributor

grumpyp commented Sep 21, 2024

hi @aniketmaurya have you already thought of an implementation of this?

I'd be interested in implementing it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request help wanted Extra attention is needed
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants