feat: add caching if prompt request fails #148

Matthieu-OD · 2024-11-14T05:53:00Z

Add Prompt Caching Functionality

Changes

Implemented a prompt caching system using a separate class SharedCachePrompt

Implementation Details

Cache keys are generated in three formats:
id
name
tuple(name, version)
Caching logic implemented in both sync and async get_prompt methods
On each successful get prompt we will cache the prompt with keys id, name, tuple(name, version)
Added fallback to cached prompts when API calls fail, with warning logs
To keep in mind this cache will be cleared each time the application Is restarted

Test Commands

Test Code

from fastapi import FastAPI, HTTPException
from typing import Optional
from literalai import LiteralClient

app = FastAPI()
client = LiteralClient(
    url="http://localhost:3000", api_key="my-initial-api-key"
)

@app.get("/prompt")
async def get_prompt(
    id: Optional[str] = None,
    name: Optional[str] = None,
    version: Optional[int] = None,
    should_fail: bool = False  # For testing failure scenarios
):
    print(client.api._prompt_cache.keys())
    try:
        if should_fail:
            original_key = client.api.api_key
            client.api.api_key = "invalid_key"
            try:
                prompt_from_cache = client.api.get_prompt(
                    id=id, name=name, version=version
                )
                return {
                    "success": True,
                    "prompt_id": prompt_from_cache.id,
                    "cache_hit": True
                }
            finally:
                client.api.api_key = original_key
        else:
            prompt = client.api.get_prompt(id=id, name=name, version=version)
        return {
            "success": True,
            "prompt_id": prompt.id,
            "cache_hit": False
        }
    except Exception as e:
        raise HTTPException(status_code=500, detail=str(e))

Install dependencies

pip install fastapi uvicorn

Run test server

uvicorn test_cache:app --reload

Test normal flow

curl "http://localhost:8000/prompt?name=example_prompt"

Test cache with failure

First call to populate cache

curl "http://localhost:8000/prompt?name=example_prompt"

Second call with simulated failure

curl "http://localhost:8000/prompt?name=example_prompt&should_fail=true"

Test with ID

curl "http://localhost:8000/prompt?id=prompt_123"

Test with name and version

curl "http://localhost:8000/prompt?name=example_prompt&version=1"

willydouhard · 2024-11-15T10:56:35Z

literalai/api/__init__.py

            raise ValueError("Either the `id` or the `name` must be provided.")

+        sync_api = LiteralAPI(self.api_key, self.url)
+        cached_prompt = self.prompt_cache.get(id, name, version)
+        timeout = 1 if cached_prompt else None


you could move the cache logic in the get_prompt_helper to avoid duplicating it for the sync/async versions.

willydouhard · 2024-11-15T10:59:09Z

literalai/api/__init__.py

@@ -212,7 +265,7 @@ def raise_error(error):
                self.graphql_endpoint,
                json={"query": query, "variables": variables},
                headers=self.headers,
-                timeout=10,
+                timeout=timeout,


I don't see the modification for the async version of make_gql_call

It is lines 1519 and 1532 in the same file

willydouhard

Looks promising! I would love to see a test for it.

Matthieu-OD · 2024-11-19T16:25:59Z

After discussing with @clementsirieix, I'm changing the implementation to avoid a too specific or complex solution. I'm also increasing the timeout to 2 sec instead of 1.

Matthieu-OD self-assigned this Nov 14, 2024

Matthieu-OD added 3 commits November 14, 2024 10:28

feat: create the dict cache and the method to go with it

4ddda3e

feat: get_prompt add caching

fd9f462

feat: implement caching on get_prompt

8774a00

Matthieu-OD force-pushed the matt/eng-2115-add-client-caching-for-prompts branch from 6136af7 to 8774a00 Compare November 14, 2024 10:07

fix: ci

5d8b5f7

Matthieu-OD marked this pull request as ready for review November 14, 2024 10:24

feat: add timeout if prompt cached

e7589c6

Matthieu-OD marked this pull request as draft November 14, 2024 13:23

Matthieu-OD added 2 commits November 14, 2024 14:31

feat: improve caching

723f7fd

feat: improve logging

32b971f

Matthieu-OD marked this pull request as ready for review November 14, 2024 14:15

fix: ci errors

5bcdce4

Matthieu-OD force-pushed the matt/eng-2115-add-client-caching-for-prompts branch from ae555e3 to 5bcdce4 Compare November 14, 2024 14:44

Matthieu-OD added 3 commits November 15, 2024 10:56

feat: improve the prompt cache class

2476ebc

refactor: remove useless code

0aec701

feat: implement the new SharedCachePrompt class

32b4e48

willydouhard reviewed Nov 15, 2024

View reviewed changes

willydouhard requested changes Nov 15, 2024

View reviewed changes

Matthieu-OD added 3 commits November 15, 2024 12:42

refactor: improve typing and move some logic

f5d460b

feat: adds memory management to the SharedCachePrompt class

49fd140

feat: add unit tests for SharedCachePrompt

3e139f2

Matthieu-OD force-pushed the matt/eng-2115-add-client-caching-for-prompts branch 7 times, most recently from 2f9628c to 3433ee1 Compare November 18, 2024 13:44

Matthieu-OD force-pushed the matt/eng-2115-add-client-caching-for-prompts branch 2 times, most recently from 7ffb0b3 to 896e303 Compare November 18, 2024 13:54

feat: adds tests and updates run-test.sh

3730581

Matthieu-OD force-pushed the matt/eng-2115-add-client-caching-for-prompts branch from 896e303 to 3730581 Compare November 18, 2024 13:58

Matthieu-OD requested a review from willydouhard November 18, 2024 14:00

Matthieu-OD added 2 commits November 19, 2024 17:55

refactor: finishes the simplication

5318751

fix: test and implementation

85c72d1

Matthieu-OD force-pushed the matt/eng-2115-add-client-caching-for-prompts branch from 0bd9044 to 85c72d1 Compare November 20, 2024 09:52

fix: add typing for sharedcache typing

6dfce9c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add caching if prompt request fails #148

feat: add caching if prompt request fails #148

Matthieu-OD commented Nov 14, 2024 •

edited

Loading

willydouhard Nov 15, 2024

willydouhard Nov 15, 2024

Matthieu-OD Nov 15, 2024 •

edited

Loading

willydouhard left a comment

Matthieu-OD commented Nov 19, 2024

feat: add caching if prompt request fails #148

Are you sure you want to change the base?

feat: add caching if prompt request fails #148

Conversation

Matthieu-OD commented Nov 14, 2024 • edited Loading

Add Prompt Caching Functionality

Changes

Implementation Details

Test Commands

Install dependencies

Run test server

Test normal flow

Test cache with failure

willydouhard Nov 15, 2024

Choose a reason for hiding this comment

willydouhard Nov 15, 2024

Choose a reason for hiding this comment

Matthieu-OD Nov 15, 2024 • edited Loading

Choose a reason for hiding this comment

willydouhard left a comment

Choose a reason for hiding this comment

Matthieu-OD commented Nov 19, 2024

Matthieu-OD commented Nov 14, 2024 •

edited

Loading

Matthieu-OD Nov 15, 2024 •

edited

Loading