Ollama support for LLM backend #97

rchan26 · 2024-09-09T10:45:26Z

Somewhat related to #65

In the past I've used Ollama for local inference for LLMs, would it be useful to add this support to the library? I'd be happy to work on this to add support to use an Ollama API endpoint for the LLM

andimarafioti · 2024-09-09T14:53:09Z

can ollama use APIs? There is a PR open for the API part.

rchan26 · 2024-09-09T15:04:17Z

@andimarafioti I think the PR is only for OpenAI API. I was just suggesting to offer Ollama API support. In particular, it's possible to run a Ollama server (either locally on a slightly better machine) and then query it using the Ollama Rest API (like here).

The changes/additions would be similar to the open PR #81 but a class for Ollama API support and querying Ollama endpoints

mattfro · 2024-10-21T11:48:13Z

This would be great, when some people already run ollama. Like me :)

rchan26 · 2024-10-21T12:30:49Z

happy to work on this at some point this week. there's added functionality to Ollama last week to run any GGUF model from the HF Hub too: https://huggingface.co/docs/hub/en/ollama

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ollama support for LLM backend #97

Ollama support for LLM backend #97

rchan26 commented Sep 9, 2024

andimarafioti commented Sep 9, 2024

rchan26 commented Sep 9, 2024

mattfro commented Oct 21, 2024

rchan26 commented Oct 21, 2024

Ollama support for LLM backend #97

Ollama support for LLM backend #97

Comments

rchan26 commented Sep 9, 2024

andimarafioti commented Sep 9, 2024

rchan26 commented Sep 9, 2024

mattfro commented Oct 21, 2024

rchan26 commented Oct 21, 2024