-
Notifications
You must be signed in to change notification settings - Fork 919
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[computer use] Adjusting 'Only send N most recent images' does not seem to work #118
Comments
it seems to be that
It is only the case where the For example, in the case of "open terminal", the third request will have 3 total images (screenshot, move cursor, and click). If i set A few options for routes to resolution:
I'd vote for option 2. |
I think I can hazard an answer at this one. If the new HTTP json payload (convo) it's just the previous one with a message added to the end, Claude can cache the previous one, load that into the LLM-state, and just run the diff-tokens thru. Therefore, if you are constantly keeping the last 5 images, you're fucking the cache-mechanism, as EVERY new convo you send to Claude fails this criterion (there's a change somewhere in the middle where the 6-th-to-last image used to be, now nuked). So removing images invalidates the cache, ergo you gotta strike a balance. |
@p-i- not sure I understand... In my (admittedly limited/mvp-level) usages, I really only need to send the latest image. I certainly don't need to send the first image (the same generic screen everyone sends on their first request) more than the first request. I imagine I only ever would need to send the latest image, or maybe two if the request is checking for a change. |
@bmacer hmm now that's an interesting idea. Calculate the diff between the current and last image. And send the diff together with the new image. intrrdastingk. |
When I set "Only send N most recent images" to 1, the requests continue to include all previous images.
Simple example is "open terminal":
The issue is in the three
data
values.The three data values are for:
The expectation is that only the last will be included, while the others will be redacted in some way.
Short video:
https://github.com/user-attachments/assets/cc47139e-7bc4-450a-817f-4f74cf1161f8
The text was updated successfully, but these errors were encountered: