Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Openrouter Provider preferences #286

Open
Munsio opened this issue Jul 25, 2024 · 2 comments
Open

Openrouter Provider preferences #286

Munsio opened this issue Jul 25, 2024 · 2 comments
Assignees
Labels
bug Something isn't working enhancement New feature or request
Milestone

Comments

@Munsio
Copy link
Contributor

Munsio commented Jul 25, 2024

It seems that Openrouter now has a few provider which provide a lower quantization size than others, we need to ensure with our calls to openrouter that we are not going to mix those with multiple requests.

Providers with different quantization:
https://openrouter.ai/models/meta-llama/llama-3.1-8b-instruct/status

@Munsio Munsio added bug Something isn't working enhancement New feature or request labels Jul 25, 2024
@Munsio Munsio added this to the v0.6.0 milestone Jul 25, 2024
@Munsio Munsio self-assigned this Jul 25, 2024
@Munsio
Copy link
Contributor Author

Munsio commented Jul 25, 2024

Provider Routing for Openrouter:
https://openrouter.ai/docs/provider-routing

@Munsio
Copy link
Contributor Author

Munsio commented Jul 25, 2024

This is a derivation of the OpenAI-API which means we need to gate the feature for OpenRouter by either forking the openAI api library we use or switch the API for OpenRouter completely.

Also what we maybe should do is creating our own API for OpenAI by generating everything from the documentation itself, with that we can at least add such custom parts as the original API library does not provide the necessary Public methods to do that.

@bauersimon bauersimon modified the milestones: v0.6.0, v0.7.0 Jul 31, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants