Feat: decode request in threadpool #290

deependujha · 2024-09-23T13:00:59Z

Before submitting

Was this discussed/agreed via a Github issue? (no need for typos and docs improvements)
Did you read the contributor guideline, Pull Request section?
Did you make sure to update the docs?
Did you write any new necessary tests?

⚠️ How does this PR impact the user? ⚠️

faster request decoding if it contains binary data (images, audios, etc).

GOOD:

faster decoding in batched loop

BAD:

Might be overhead in the case of simple requests

What does this PR do?

Fixes #166

Speed up reference here

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in GitHub issues there's a high chance it will not be merged.

Did you have fun?

Make sure you had fun coding 🙃

for more information, see https://pre-commit.ci

deependujha · 2024-09-23T13:18:10Z

Is this what was intended, or have I got something wrong?

aniketmaurya

Thank you for opening the PR @deependujha. Looks great so far 🚀

We should provide control to users whether to enable decoding using a threadpool. It might come useful for cases when decode request is CPU intensive or not thread-safe.

Possible API:

class LitServer:
    def __init__(..., concurrent_decode:bool=True)

cc: @lantiga

src/litserve/loops.py

codecov · 2024-09-23T13:35:03Z

Codecov Report

Attention: Patch coverage is 89.47368% with 2 lines in your changes missing coverage. Please review.

Project coverage is 95%. Comparing base (757525e) to head (be2c204).
Report is 24 commits behind head on main.

Additional details and impacted files

@@         Coverage Diff         @@
##           main   #290   +/-   ##
===================================
- Coverage    95%    95%   -0%     
===================================
  Files        19     19           
  Lines      1244   1260   +16     
===================================
+ Hits       1182   1196   +14     
- Misses       62     64    +2

Co-authored-by: Aniket Maurya <[email protected]>

for more information, see https://pre-commit.ci

williamFalcon · 2024-09-23T17:18:41Z

@deependujha nice! love the "faster" claim haha. do you have a benchmark or something showing that it is indeed faster and by how much?

deependujha · 2024-09-24T05:32:31Z

@deependujha nice! love the "faster" claim haha. do you have a benchmark or something showing that it is indeed faster and by how much?

Hi @williamFalcon , I stated so on the basis of what was claimed in the original issue. Testing on 4 GPU machine does not reflect the same.

Throughput stays in the similar range (300-380), averaging at 330. benchmark code by Aniket

I'll try testing it with an audio model. If performance doesn't differ much, I'll close the PR.

aniketmaurya · 2024-09-26T17:21:47Z

src/litserve/loops.py

 ):
+ if concurrent_decode:
+ executor = concurrent.futures.ThreadPoolExecutor(max_workers=os.cpu_count())


We can limit number of threads to batch size

Suggested change

executor = concurrent.futures.ThreadPoolExecutor(max_workers=os.cpu_count())

executor = concurrent.futures.ThreadPoolExecutor(max_workers=min(max_batch_size, os.cpu_count()))

deependujha · 2024-10-04T17:08:09Z

tough luck, tested with the audio model but no significant difference in performance. I'll go ahead and close the PR.

aniketmaurya · 2024-10-04T17:36:01Z

@deependujha how does your decode_request logic look like? Does it do a CPU intensive operation or IO?

deependujha added 2 commits September 23, 2024 18:18

use threadpool to decode_requests

47ca900

update

208f396

deependujha requested review from lantiga, aniketmaurya, awaelchli and Andrei-Aksionov as code owners September 23, 2024 13:00

pre-commit-ci bot and others added 3 commits September 23, 2024 13:01

[pre-commit.ci] auto fixes from pre-commit.com hooks

90e0844

for more information, see https://pre-commit.ci

update

84c1f92

Merge branch 'main' into feat/decode-request-in-threadpool

49b3e84

aniketmaurya reviewed Sep 23, 2024

View reviewed changes

src/litserve/loops.py Outdated Show resolved Hide resolved

src/litserve/loops.py Outdated Show resolved Hide resolved

src/litserve/loops.py Outdated Show resolved Hide resolved

aniketmaurya and others added 5 commits September 23, 2024 20:16

Merge branch 'main' into feat/decode-request-in-threadpool

8d3c667

Update src/litserve/loops.py

232dd4d

Co-authored-by: Aniket Maurya <[email protected]>

fixed issues

c58a580

[pre-commit.ci] auto fixes from pre-commit.com hooks

ab2faf7

for more information, see https://pre-commit.ci

update

202aadb

deependujha requested a review from aniketmaurya September 23, 2024 16:39

Merge branch 'main' into feat/decode-request-in-threadpool

be2c204

aniketmaurya reviewed Sep 26, 2024

View reviewed changes

aniketmaurya added the enhancement New feature or request label Sep 26, 2024

deependujha closed this Oct 4, 2024

deependujha deleted the feat/decode-request-in-threadpool branch October 4, 2024 17:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: decode request in threadpool #290

Feat: decode request in threadpool #290

deependujha commented Sep 23, 2024 •

edited by aniketmaurya

Loading

deependujha commented Sep 23, 2024

aniketmaurya left a comment

codecov bot commented Sep 23, 2024 •

edited

Loading

williamFalcon commented Sep 23, 2024

deependujha commented Sep 24, 2024 •

edited

Loading

aniketmaurya Sep 26, 2024

deependujha commented Oct 4, 2024

aniketmaurya commented Oct 4, 2024

	executor = concurrent.futures.ThreadPoolExecutor(max_workers=os.cpu_count())
	executor = concurrent.futures.ThreadPoolExecutor(max_workers=min(max_batch_size, os.cpu_count()))

Feat: decode request in threadpool #290

Feat: decode request in threadpool #290

Conversation

deependujha commented Sep 23, 2024 • edited by aniketmaurya Loading

⚠️ How does this PR impact the user? ⚠️

GOOD:

BAD:

What does this PR do?

PR review

Did you have fun?

deependujha commented Sep 23, 2024

aniketmaurya left a comment

Choose a reason for hiding this comment

codecov bot commented Sep 23, 2024 • edited Loading

Codecov Report

williamFalcon commented Sep 23, 2024

deependujha commented Sep 24, 2024 • edited Loading

aniketmaurya Sep 26, 2024

Choose a reason for hiding this comment

deependujha commented Oct 4, 2024

aniketmaurya commented Oct 4, 2024

deependujha commented Sep 23, 2024 •

edited by aniketmaurya

Loading

codecov bot commented Sep 23, 2024 •

edited

Loading

deependujha commented Sep 24, 2024 •

edited

Loading