Chat/Commands: Increase output token limit for PLG #3797

umpox · 2024-04-15T07:57:50Z

Description

closes #3648
closes #3795
closes https://github.com/sourcegraph/cody-issues/issues/14
closes #3779
closes #3786
closes #3784
closes #3721
closes #3571
closes #3472
closes #3536

This PR:

Increases the output limits for Sourcegraph.com Cody usage from 1000 to 4000 tokens
Removes a redundant reduction in context size that will add an extra 1000 tokens to our context window. This is only necessary on certain enterprise models where the input + output lengths are computed together.

Note

This PR requires that https://github.com/sourcegraph/sourcegraph/pull/61872 is merged first

Test plan

Run https://github.com/sourcegraph/sourcegraph/pull/61872 locally
Trigger long context responses (e.g. "Show me 300 lines of typescript code")
Expect that everything functions as normal

umpox · 2024-04-15T14:12:01Z

vscode/src/chat/chat-view/SimpleChatPanelProvider.ts

+            : // Minus the character limit reserved for the answer token
+              this.chatModel.maxInputChars - tokensToChars(ANSWER_TOKENS)


This logic only makes sense for older GPT models where the input+output limit is calculated together.

For enterprise we will need to address this

philipp-spiess · 2024-04-15T14:42:37Z

lib/shared/src/models/dotcom.ts

-        maxToken: 1800,
+        maxInputToken: 1800,


unrelated but it seems like we can use the "standard" 7k now :) https://fireworks.ai/models/fireworks/mixtral-8x22b-instruct-preview

philipp-spiess

LGTM :)

abeatrix · 2024-04-16T10:18:43Z

vscode/src/models/utils.ts

+    /**
+     * @deprecated Use `inputTokens` instead.
+     */
+    tokens?: number


We could just remove it since this was not included in any release yet?

Yep I can do, my only concern is that this might break any internal stuff we've configured but I guess it's still fresh so that's not much of a worry

github-actions · 2024-04-16T12:17:19Z

‼️ Hey @sourcegraph/cody-security, please review this PR carefully as it introduces usage of unsafe_ functions or abuses PromptString.

umpox · 2024-04-16T12:18:58Z

@sourcegraph/cody-security Guessing this was triggered because of the merge commit? Maybe we should exclude those?

philipp-spiess · 2024-04-16T12:26:48Z

@umpox Ah no this is because we look at the files and not just the modified ranges within those files. Shoots. This actually needs a much more sophisticated approach (we need to know which ranges were modified and only emit linter issues in those places).

Hmmm maybe as a temporary workaround we can extract the unsafe_ calls in "hot" paths (like in SimpleChatPanelProvider.ts in this case) into a separate module that we don't need to touch so often? cc @sourcegraph/cody-security

I will research how other linters do that. Sorry for the churn @umpox and please ignore it for now in this PR 🥺 🙏

github-actions · 2024-04-16T12:40:35Z

‼️ Hey @sourcegraph/cody-security, please review this PR carefully as it introduces usage of unsafe_ functions or abuses PromptString.

github-actions · 2024-04-16T12:41:02Z

‼️ Hey @sourcegraph/cody-security, please review this PR carefully as it introduces usage of unsafe_ functions or abuses PromptString.

umpox · 2024-04-16T12:41:22Z

:D @philipp-spiess No probs

philipp-spiess · 2024-04-16T13:18:20Z

@umpox fix incoming: #3810

umpox added 5 commits April 15, 2024 08:57

Chat/Commands: Increase output token limit

3a588c1

add logic to set input and output tokens separately

c67edde

handle enterprise config

9518c37

hold off on enterprise for now

09cb6aa

clean

6434b20

umpox commented Apr 15, 2024

View reviewed changes

comments

dcb66e1

umpox changed the title ~~Chat/Commands: Increase output token limit~~ Chat/Commands: Increase output token limit for PLG Apr 15, 2024

umpox marked this pull request as ready for review April 15, 2024 14:23

This was referenced Apr 15, 2024

Gateway: Increase response token limit sourcegraph/sourcegraph-public-snapshot#61872

Merged

Expand the output token limits #3648

Closed

umpox requested review from chrsmith, abeatrix and a team April 15, 2024 14:31

philipp-spiess reviewed Apr 15, 2024

View reviewed changes

philipp-spiess approved these changes Apr 15, 2024

View reviewed changes

abeatrix reviewed Apr 16, 2024

View reviewed changes

Merge branch 'main' into tr/increase-output-tokens

e734ef1

umpox added 2 commits April 16, 2024 13:39

update token usage and bump mixtral context window

f8e044a

rm deprecated field

5834261

philipp-spiess mentioned this pull request Apr 16, 2024

Troubleshoot: Show auth connection issues #3750

Merged

philipp-spiess mentioned this pull request Apr 16, 2024

PromptString linter: Only error when the reported violation is within one of the changed range #3810

Merged

Merge branch 'main' into tr/increase-output-tokens

79f78d3

update tests

a4e4fe5

abeatrix mentioned this pull request Apr 16, 2024

Move new user context window behind feature flag #3781

Merged

umpox added 2 commits April 16, 2024 22:07

Merge branch 'main' into tr/increase-output-tokens

f732966

Use actually rate limited user

9b8a4af

umpox force-pushed the tr/increase-output-tokens branch from 6138f91 to 9b8a4af Compare April 17, 2024 09:07

umpox added 2 commits April 17, 2024 10:10

update changelog

8eb5c8f

fixed the user with no rate limit

ea04192

umpox merged commit 4b97c0d into main Apr 17, 2024
35 checks passed

umpox deleted the tr/increase-output-tokens branch April 17, 2024 09:33

umpox mentioned this pull request May 8, 2024

bug: Deletes end of long code sections #3388

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chat/Commands: Increase output token limit for PLG #3797

Chat/Commands: Increase output token limit for PLG #3797

umpox commented Apr 15, 2024 •

edited

Loading

umpox Apr 15, 2024

philipp-spiess Apr 15, 2024

philipp-spiess left a comment

abeatrix Apr 16, 2024

umpox Apr 16, 2024

github-actions bot commented Apr 16, 2024

umpox commented Apr 16, 2024

philipp-spiess commented Apr 16, 2024

github-actions bot commented Apr 16, 2024

github-actions bot commented Apr 16, 2024

umpox commented Apr 16, 2024

philipp-spiess commented Apr 16, 2024

		: // Minus the character limit reserved for the answer token
		this.chatModel.maxInputChars - tokensToChars(ANSWER_TOKENS)

Chat/Commands: Increase output token limit for PLG #3797

Chat/Commands: Increase output token limit for PLG #3797

Conversation

umpox commented Apr 15, 2024 • edited Loading

Description

Test plan

umpox Apr 15, 2024

Choose a reason for hiding this comment

philipp-spiess Apr 15, 2024

Choose a reason for hiding this comment

philipp-spiess left a comment

Choose a reason for hiding this comment

abeatrix Apr 16, 2024

Choose a reason for hiding this comment

umpox Apr 16, 2024

Choose a reason for hiding this comment

github-actions bot commented Apr 16, 2024

umpox commented Apr 16, 2024

philipp-spiess commented Apr 16, 2024

github-actions bot commented Apr 16, 2024

github-actions bot commented Apr 16, 2024

umpox commented Apr 16, 2024

philipp-spiess commented Apr 16, 2024

umpox commented Apr 15, 2024 •

edited

Loading