Openrouter Provider preferences #286

Munsio · 2024-07-25T13:36:55Z

It seems that Openrouter now has a few provider which provide a lower quantization size than others, we need to ensure with our calls to openrouter that we are not going to mix those with multiple requests.

Providers with different quantization:
https://openrouter.ai/models/meta-llama/llama-3.1-8b-instruct/status

Munsio · 2024-07-25T13:37:59Z

Provider Routing for Openrouter:
https://openrouter.ai/docs/provider-routing

Munsio · 2024-07-25T13:39:44Z

This is a derivation of the OpenAI-API which means we need to gate the feature for OpenRouter by either forking the openAI api library we use or switch the API for OpenRouter completely.

Also what we maybe should do is creating our own API for OpenAI by generating everything from the documentation itself, with that we can at least add such custom parts as the original API library does not provide the necessary Public methods to do that.

Munsio added bug Something isn't working enhancement New feature or request labels Jul 25, 2024

Munsio added this to the v0.6.0 milestone Jul 25, 2024

Munsio self-assigned this Jul 25, 2024

bauersimon modified the milestones: v0.6.0, v0.7.0 Jul 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Openrouter Provider preferences #286

Openrouter Provider preferences #286

Munsio commented Jul 25, 2024

Munsio commented Jul 25, 2024

Munsio commented Jul 25, 2024 •

edited

Loading

Openrouter Provider preferences #286

Openrouter Provider preferences #286

Comments

Munsio commented Jul 25, 2024

Munsio commented Jul 25, 2024

Munsio commented Jul 25, 2024 • edited Loading

Munsio commented Jul 25, 2024 •

edited

Loading