Skip to content

How to get very long KV cached #102

Closed Answered by merrymercy
pj-ml asked this question in Q&A
Discussion options

You must be logged in to vote

@pj-ml Thanks for your interest. SGLang runtime is just designed for this and can greatly accelerate your workloads.
Please see #106 (comment)

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by merrymercy
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants