VS Code: LLM request cancellations are not transmitted reliably to Cody Gateway #3614

valerybugakov · 2024-03-29T04:19:40Z

LLM request cancellations are not transmitted reliably to Cody Gateway, leading to token inference continuing until the maximum limit is reached. It significantly increases the load on our inference providers, increasing latency and spending. By the latest estimate, about 2/3 of inferenced tokens are "overhead" tokens.
The Google Sheet with pricing breakdown and approximate potential savings.
Slack thread with more context.

github-actions · 2024-06-30T01:54:14Z

This issue is marked as stale because it has been open 60 days with no activity. Remove stale label or comment or this will be closed automatically in 5 days.

github-actions · 2024-08-31T01:56:05Z

This issue is marked as stale because it has been open 60 days with no activity. Remove stale label or comment or this will be closed automatically in 5 days.

github-actions bot added the cody label Mar 29, 2024

kalanchan assigned valerybugakov Apr 30, 2024

github-actions bot added the Stale label Jun 30, 2024

valerybugakov removed the Stale label Jul 1, 2024

github-actions bot added the Stale label Aug 31, 2024

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Sep 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VS Code: LLM request cancellations are not transmitted reliably to Cody Gateway #3614

VS Code: LLM request cancellations are not transmitted reliably to Cody Gateway #3614

valerybugakov commented Mar 29, 2024

github-actions bot commented Jun 30, 2024

github-actions bot commented Aug 31, 2024

VS Code: LLM request cancellations are not transmitted reliably to Cody Gateway #3614

VS Code: LLM request cancellations are not transmitted reliably to Cody Gateway #3614

Comments

valerybugakov commented Mar 29, 2024

github-actions bot commented Jun 30, 2024

github-actions bot commented Aug 31, 2024