# Sending image as a URL
from openai import OpenAI

client = OpenAI()

response = client.chat.completions.create(
  model="gpt-4o-mini",
  messages=[
    {
      "role": "user",
      "content": [
        {"type": "text", "text": "What’s in this image?"},
        {
          "type": "image_url",
          "image_url": {
            "url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg",
          },
        },
      ],
    }
  ],
  max_tokens=300,
)

print(response.choices[0].message.content)

Choice(finish_reason='stop', index=0, logprobs=None, message=ChatCompletionMessage(content='The image depicts a serene natural scene with a wooden boardwalk extending through a lush green field. The boardwalk creates a straight path and invites a walk through the tall grasses surrounding it. The field is abundant with greenery, possibly indicating spring or summer season. The sky is clear with a few scattered, wispy clouds, with the sunlight casting a warm glow on the landscape, enhancing the vivid colors of the flora. This could be a natural reserve, park, or a wetland area where boardwalks are commonly built to facilitate access without disturbing the natural environment.', role='assistant', function_call=None, tool_calls=None))

# Sending image as a base64 encoded string
from openai import OpenAI
import base64

client = OpenAI()

image_path = "data/cappadocia.jpeg"

# Function to encode the image
with open(image_path, "rb") as image_file:
    base64_image = base64.b64encode(image_file.read()).decode('utf-8')

response = client.chat.completions.create(
  model="gpt-4o-mini",
  messages=[
    {
      "role": "user",
      "content": [
        {"type": "text", "text": "What’s in this image?"},
        {
          "type": "image_url",
          "image_url": {
            "url": f"data:image/jpeg;base64,{base64_image}"
          },
        },
      ],
    }
  ],
  max_tokens=300,
)

print(response.choices[0].message.content)

This image shows a beautiful snowy landscape with unique geological formations. These formations, characterized by their rugged, rocky outcrops and peaks, suggest that this might be a region with a history of volcanic activity, often resulting in such stark and impressive natural features. The snow adds a contrasting layer to the otherwise dry and eroded rocks, highlighting the natural beauty of the place. There are no visible human figures in this photograph, keeping the focus on the natural environment. The clear blue sky suggests it is a sunny day, and the distant mountain in the background adds depth to the scene, underscoring the wild and expansive topography of the area. This kind of terrain is often found in regions known for their historic and geological significance.

# Multiple image inputs

from openai import OpenAI

client = OpenAI()

response = client.chat.completions.create(
  model="gpt-4o-mini",
  messages=[
    {
      "role": "user",
      "content": [
        {
          "type": "text",
          "text": "What are in these images? Is there any difference between them?",
        },
        {
          "type": "image_url",
          "image_url": {
            "url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg",
          },
        },
        {
          "type": "image_url",
          "image_url": {
            "url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg",
          },
        },
      ],
    }
  ],
  max_tokens=300,
)

print(response.choices[0].message.content)

The images you've provided seem to be identical. Both showcase a wooden boardwalk extending through a lush, green wetland or grassland. The sky is blue with some wispy clouds, and there is a variety of green vegetation on either side of the path. There doesn't appear to be any discernible difference between the two images; they seem to be two copies of the same photo.

from openai import OpenAI

client = OpenAI()

# High fidelity
response = client.chat.completions.create(
  model="gpt-4o-mini",
  messages=[
    {
      "role": "user",
      "content": [
        {"type": "text", "text": "What’s in this image?"},
        {
          "type": "image_url",
          "image_url": {
            "url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg",
            "detail": "high"
          },
        },
      ],
    }
  ],
  max_tokens=300,
)
print("*** High Fidelity ***")
print(response.choices[0].message.content)
print(f"Prompt Tokens: {response.usage.prompt_tokens}, Completion Tokens: {response.usage.completion_tokens}, Total Tokens: {response.usage.total_tokens}")

# Low fidelity
response = client.chat.completions.create(
  model="gpt-4o-mini",
  messages=[
    {
      "role": "user",
      "content": [
        {"type": "text", "text": "What’s in this image?"},
        {
          "type": "image_url",
          "image_url": {
            "url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg",
            "detail": "low"
          },
        },
      ],
    }
  ],
  max_tokens=300,
)

print("*** Low Fidelity ***")
print(response.choices[0].message.content)
print(f"Prompt Tokens: {response.usage.prompt_tokens}, Completion Tokens: {response.usage.completion_tokens}, Total Tokens: {response.usage.total_tokens}")

*** High Fidelity ***
The image shows a wooden boardwalk traversing through a lush green field with tall grass on either side. The sky is a beautiful blue with some scattered white clouds. It appears to be a sunny day, and the scene is tranquil, possibly a nature reserve or park where boardwalks are installed to allow people to enjoy the landscape without disturbing the natural environment. The image has a sense of depth, leading the observer's eye along the boardwalk towards the horizon.
Prompt Tokens: 1118, Completion Tokens: 93, Total Tokens: 1211
*** Low Fidelity ***
The image depicts a serene natural landscape. It features a wooden boardwalk or path that meanders through a lush green meadow filled with tall grass or reeds. The path invites one to walk through and enjoy the surrounding nature. The sky overhead is a bright blue with scattered white clouds, suggesting a pleasant day with good weather. The scene conveys a sense of tranquility and the beauty of a natural, untouched environment.
Prompt Tokens: 98, Completion Tokens: 85, Total Tokens: 183

Vision (Understanding Images)¶

Introduction¶

Low or high fidelity¶

Managing the images¶