Skip to content

SGlang JSON Generation from Specified Schema (Parallel Requests) #641

Closed Answered by merrymercy
velocity33 asked this question in Q&A
Discussion options

You must be logged in to vote

Yes. It should be easy. You can implement a multi-thread client in Python with https://docs.python.org/3/library/threading.html. Each thread sends requests to the server using the backend directly https://github.com/sgl-project/sglang?tab=readme-ov-file#backend-sglang-runtime-srt or using the frontend https://github.com/sgl-project/sglang?tab=readme-ov-file#json-decoding.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by merrymercy
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants