[Misc]: Ask for the roadmap of async output processing support for speculative decoding #10387

Lin-Qingyang-Alec · 2024-11-16T10:14:25Z

I wonder when speculative decoding can support async output processing, it is really a important feature.

Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

Lin-Qingyang-Alec added the misc label Nov 16, 2024

Provide feedback