Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

VisualWebArena agent #6

Open
wants to merge 11 commits into
base: dev
Choose a base branch
from
Open

VisualWebArena agent #6

wants to merge 11 commits into from

Conversation

gasse
Copy link
Collaborator

@gasse gasse commented Aug 1, 2024

No description provided.

@gasse gasse requested a review from ThibaultLSDC August 1, 2024 15:13
@ThibaultLSDC
Copy link
Collaborator

ThibaultLSDC commented Aug 15, 2024

I have updated generic_agent to accept goal_images. it alters slightly the original behavior when using the screenshot as input.
original behavior:
System message
----text: system_prompt
Human message
---- text: prompt
---- image: screenshot

New behavior:
System message
---- text: system_prompt
Human message
---- text: prompt
---- text: IMAGE (1) - screenshot input # that would appear even without goal_image
---- image: screenshot
---- text: image (2) - image input # if there is a goal_image
---- image: goal_image 1
....

@ThibaultLSDC ThibaultLSDC changed the base branch from main to dev October 15, 2024 18:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants