v0.3: Claude 3, Sandboxed Python #238
ianarawjo
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Adds new Anthropic Claude 3 models.
messages
API for Claude 2.1+ models.system
message parameter in Claude settings.Adds browser-sandboxed Python with pyodide
You can now run Python in a safe sandbox entirely in the browser, provided you do not need to import third-party libraries.
The web-hosted version at chainforge.ai/play now has Python evaluators unlocked:
The local version of ChainForge includes a toggle to turn sandboxing on or off:
If you turn sandboxing off, you go back to the previous Python evaluator, executed on your local machine through the Flask backend. In the non-sandboxed eval node you can import any libraries available in your Python environment.
Why sandboxing?
The benefit of sandboxing is that ChainForge can now be used to execute Python code generated by LLMs, using eval() or exec() in your evaluation function. This was possible before but dangerous and unsafe. Benchmarks that do not rely on third-party libraries, like HumanEvals at pass@1 rate, could be run within ChainForge entirely in the web browser (if anyone wants to set this up, let me know!).
This discussion was created from the release v0.3: Claude 3, Sandboxed Python.
Beta Was this translation helpful? Give feedback.
All reactions