WIP draft of generate() outside of chat.answer() #432

dillonroach · 2024-06-13T08:50:44Z

Takes format from chat.answer and duplicates, without sources, as chat.generate which then gets implemented in Assistant as well. Got a bit turned around with the async aspect in the stream = False case, room for cleanup. The exl2 assistant is currently instantiating a number of vars it likely should be taking from Chat instead (e.g. stream:bool)

…l2.py

…e to ingest messages, if appropriate.

dillonroach · 2024-07-08T04:33:05Z

This PR needs to coordinate with #438 - wherever the final pattern lands there, I need to integrate in the generate() and answer()(_stream()) ingest points. This is also modulo however we expect to pass prompt str or messages, etc, through the pre-process stage. Will make this a point of conversation in our meet Monday.

…s or messages

dillonroach · 2024-08-12T05:19:55Z

@pmeier - I'd already built out most of the assistant examples, so just did them all this pass- but we don't have to discuss them all for first round; lets just pick _anthropic and _openai and see how those are looking specifically.

In the interest of helping this merge quickly, I'll also pull out exl2 for another PR that's just about integrating that as an assistant on its own.

ragna/assistants/_openai.py

ragna/assistants/_anthropic.py

…fic, PR

…eds merge with corpus-dev

dillonroach · 2024-08-26T18:05:02Z

@pmeier - seems like at this point it makes more sense to rebase the changes from latest corpus-dev unless that's causing trouble somewhere - if that works I'll make that PR and then close this one pointing to the new version.

…add-llms

pmeier

@dillonroach I pushed 8e120c6. The two main changes I made are

Fix the return types of .generate() for all assistants. It now returns an async iterator of the full data the API returns. Since this function is aimed as a general building block for users, we shouldn't assume they only want the text.

However, we can make this assumption for .answer(). Meaning, this function calls .generate() and extracts the text.

The _render_prompt and corresponding functions treated incoming messages as dictionaries. ragna.core.Message is a proper object and thus key access will not work. Furthermore, message.role is the ragna.core.MessageRole enum and cannot be used directly. Instead, you need to use message.role.value if you want to have the string.

With my commit, CI should be green. Note that I base my review on the latest commit, i.e. including my cleanup. Thus, please make sure to review what I have pushed, because the comments otherwise might make no sense.

ragna/assistants/_ai21labs.py

ragna/assistants/_anthropic.py

pmeier · 2024-09-06T08:31:17Z

ragna/assistants/_cohere.py

-                "preamble_override": self._make_preamble(),
-                "message": prompt,
+                "preamble_override": system_prompt,
+                "message": self._render_prompt(prompt),


While the message indeed can only be a single string here, the endpoint has a chat_history parameter. And that one takes the previous message similar to all other assistants.

I would let _render_prompt return a tuple of the chat history and the current user message, e.g.

chat_history, message = self._render_prompt(prompt)

Conflicted on this one - seems like this would fit specifically with the pre-process pass, or this puts this one specific assistant ahead of the others in terms of capabilities - it certainly doesn't hurt anything, so happy to do so here, but also see how we might want to see what a pre-process stage looks like for all assistants and implement in one go.

ragna/assistants/_google.py

tests/assistants/test_api.py

…lue cleanup

WIP draft of chat.generate() and worked example with exllamav2 in _ex…

684dc35

…l2.py

dillonroach requested a review from pmeier June 13, 2024 08:50

dillonroach mentioned this pull request Jun 13, 2024

LLM abstraction #420

Open

draft update assistants with generate() - needs chat history integrat…

290f4c7

…e to ingest messages, if appropriate.

dillonroach added 3 commits July 14, 2024 22:36

merge and integrate message list

abb88c0

update generate pattern with _render_prompt in order to handle string…

954c4af

…s or messages

linting, docstrings, minor cleanup

82637dd

pmeier reviewed Aug 12, 2024

View reviewed changes

ragna/assistants/_openai.py Outdated Show resolved Hide resolved

ragna/assistants/_anthropic.py Outdated Show resolved Hide resolved

ragna/assistants/_anthropic.py Outdated Show resolved Hide resolved

dillonroach added 4 commits August 19, 2024 10:55

addressing comments, first pass

a11919e

disentangle exl2 from the generate PR - will reopen in another, speci…

ae59aee

…fic, PR

updated assistant dtype logic pattern - google will not work here, ne…

986b34d

…eds merge with corpus-dev

revert precommit modify to components

084b287

dillonroach and others added 8 commits September 5, 2024 12:31

get default system_prompt strings on each generate call

2e8ae2c

merge in main and resolve conflicts

e767c1b

Merge branch 'main' into dpr-add-llms

caa8eff

fix pre-commit issues

c9e1a7e

Merge branch 'dpr-add-llms' of github.com:dillonroach/ragna into dpr-…

e4451a3

…add-llms

import in the wrong place

a1c912f

cleanup

8e120c6

more cleanup

6288bb1

pmeier reviewed Sep 6, 2024

View reviewed changes

pmeier and others added 2 commits September 6, 2024 12:54

debug

c290d07

rename/reference system prompt functions for clarity, message.role.va…

5b3bfd5

…lue cleanup

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP draft of generate() outside of chat.answer() #432

WIP draft of generate() outside of chat.answer() #432

dillonroach commented Jun 13, 2024

dillonroach commented Jul 8, 2024

dillonroach commented Aug 12, 2024

dillonroach commented Aug 26, 2024

pmeier left a comment

pmeier Sep 6, 2024

dillonroach Sep 15, 2024

WIP draft of generate() outside of chat.answer() #432

Are you sure you want to change the base?

WIP draft of generate() outside of chat.answer() #432

Conversation

dillonroach commented Jun 13, 2024

dillonroach commented Jul 8, 2024

dillonroach commented Aug 12, 2024

dillonroach commented Aug 26, 2024

pmeier left a comment

Choose a reason for hiding this comment

pmeier Sep 6, 2024

Choose a reason for hiding this comment

dillonroach Sep 15, 2024

Choose a reason for hiding this comment