Conversation with gpt-4o (OpenAI Vision Model)

This example will show

In this example,

you can have a conversation with OpenAI vision models.
you can show gpt-4o with your drawings or web ui designs and look for its suggestions.
you can share your pictures with gpt-4o and ask for its comments,

Just input your image url (both local and web URLs are supported) and talk with gpt-4o.

Background

In May 13, 2024, OpenAI released their new model, gpt-4o, which is a large multimodal model that can process both text and multimodal data.

The following models are tested in this example. For other models, some modifications may be needed.

You need to satisfy the following requirements to run this example.

Install the latest version of AgentScope by

git clone https://github.com/modelscope/agentscope.git
cd agentscope
pip install -e .

First fill your OpenAI API key in conversation_with_gpt-4o.py, then execute the following command to run the conversation with gpt-4o.

python conversation_with_gpt-4o.py