Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create a USER PROXY agent #54

Open
daveshap opened this issue Sep 28, 2024 · 1 comment
Open

Create a USER PROXY agent #54

daveshap opened this issue Sep 28, 2024 · 1 comment

Comments

@daveshap
Copy link
Owner

daveshap commented Sep 28, 2024

Overview

This agent will serve as a stand-in for the user, providing critical feedback and verification to the EXPERT/SOLVER agent

General requirements:

  1. Evaluate the EXPERT/SOLVER as to whether it is correct, stuck, failed, etc. (this is very hard)
  2. Provide that feedback to the EXPERT/SOLVER or create an exit code
  3. Exit codes should include options such as SUCCESS, FAILURE, CLARIFICATION, etc.

Feedback

Generally, less is more, with small critiques, encouragement, and steering. For instance:

  • "Close, but not quite"
  • "You're almost there"

Although sometimes more steering and specificity is helpful, such as:

  • "This is not quite coherent"
  • "Check the word length"
  • "Try a slightly different approach"

Exit Codes

  1. SUCCESS - if the user proxy determines that the EXPERT/SOLVER is correct and successful, it simply outputs SUCCESS
  2. FAILURE - if the user proxy determines the model is incapable of solving the problem, or confidently believes the wrong answer, FAILURE
  3. CLARIFICATION - if the user proxy detects that there is inadequate information or resources, it can exit, asking for clarification (e.g. a message sent back to the user to provide more context)
@daveshap
Copy link
Owner Author

See this original conversation where a human played the role of user, which we need to approximate automatically: #10

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant