use max_completion_tokens for o1 models, and max_tokens for the rest #573

nikochiko · 2024-12-24T15:26:04Z

Q/A checklist

If you add new dependencies, did you update the lock file?

poetry lock --no-update

Run tests

ulimit -n unlimited && ./scripts/run-tests.sh

Do a self code review of the changes - Read the diff at least twice.
Carefully think about the stuff that might break because of this change - this sounds obvious but it's easy to forget to do "Go to references" on each function you're changing and see if it's used in a way you didn't expect.
The relevant pages still run when you press submit
The API for those pages still work (API tab)
The public API interface doesn't change if you didn't want it to (check API tab > docs page)
Do your UI changes (if applicable) look acceptable on mobile?
Ensure you have not regressed the import time unless you have a good reason to do so.
You can visualize this using tuna:

python3 -X importtime -c 'import server' 2> out.log && tuna out.log

To measure import time for a specific library:

$ time python -c 'import pandas'

________________________________________________________
Executed in    1.15 secs    fish           external
   usr time    2.22 secs   86.00 micros    2.22 secs
   sys time    0.72 secs  613.00 micros    0.72 secs

To reduce import times, import libraries that take a long time inside the functions that use them instead of at the top of the file:

def my_function():
    import pandas as pd
    ...

Legal Boilerplate

Look, I get it. The entity doing business as “Gooey.AI” and/or “Dara.network” was incorporated in the State of Delaware in 2020 as Dara Network Inc. and is gonna need some rights from me in order to utilize my contributions in this PR. So here's the deal: I retain all rights, title and interest in and to my contributions, and by keeping this boilerplate intact I confirm that Dara Network Inc can use, modify, copy, and redistribute my contributions, under its choice of terms.

greptile-apps

PR Summary

Added logic to handle max_tokens parameter differently between OpenAI's o1 models and other models in the language model integration.

Modified run_openai_chat() to use max_tokens for o1 models (o1-preview, o1-mini) and max_completion_tokens for all other models
Added conversion of system messages to user messages specifically for o1 models
Set max_tokens=NOT_GIVEN for o1 models and max_completion_tokens=NOT_GIVEN for other models to avoid parameter conflicts
Updated model specifications in LargeLanguageModels enum to include o1 model variants with appropriate context windows

_{💡 (2/5) Greptile learns from your feedback when you react with 👍/👎!}

_{1 file(s) reviewed, no comment(s)}
_{Edit PR Review Bot Settings | Greptile}

use max_completion_tokens for o1 models, and max_tokens for the rest

b36524f

greptile-apps bot reviewed Dec 24, 2024

View reviewed changes

devxpy approved these changes Dec 24, 2024

View reviewed changes

nikochiko merged commit 9668df9 into master Dec 24, 2024
5 of 6 checks passed

nikochiko deleted the max_tokens_and_max_completion_tokens branch December 24, 2024 16:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use max_completion_tokens for o1 models, and max_tokens for the rest #573

use max_completion_tokens for o1 models, and max_tokens for the rest #573

nikochiko commented Dec 24, 2024

greptile-apps bot left a comment

use max_completion_tokens for o1 models, and max_tokens for the rest #573

use max_completion_tokens for o1 models, and max_tokens for the rest #573

Conversation

nikochiko commented Dec 24, 2024

Q/A checklist

Legal Boilerplate

greptile-apps bot left a comment

Choose a reason for hiding this comment

PR Summary