Image Understanding with RAG Cookbook #1838

robtinn · 2025-05-13T09:07:29Z

Summary

The goal of this PR is to add a new cookbook to the multimodal examples in this repo. The new OpenAI image understanding capabilities have unlocked new use cases for systems that analyse multimodal data and many real-world datasets are multimodal. The cookbook covers creating a model which can leverage text and image context to answer questions about a synthetic dataset, with evals to test performance for different models.

Motivation

Why are these changes necessary? How do they improve the cookbook?

There has been a lot of interest in the new OpenAI image understanding capabilities recently and this notebook explores how you can leverage them as part of a RAG workflow.
In addition, many teams are still using the chat completions API when a lot of new functionality is available in the Responses API which this cookbook focuses on, in particular File Search capabilities with image analysis context.
I am also helping organise content for a hackathon around customer service and many of the use cases that are popular projects for the hackathon are covered in this notebook, (I hope to share this cookbook as a template for starting the hackathon).
Generally, a lot of the examples for the Evals API are out-of-date or incomplete and this cookbook also covers a simple example of the Evals API end-to-end.

TODO: add authors in authors.yaml

robtinn added 4 commits May 12, 2025 08:55

Draft image understanding cookbook

26b16ae

updated image understanding vector stores

d68a124

Image understanding tidy up prompts

d34d56c

Small comments to image_understanding notebook

419d442

robtinn requested a review from lspacagna-oai May 13, 2025 09:07

Image understanding updating authors

1a1f046

lspacagna-oai approved these changes May 17, 2025

View reviewed changes

Merge main and update registry

a05c098

robtinn merged commit 024c433 into main May 19, 2025
1 check passed

robtinn deleted the rtinn/image_understanding branch May 19, 2025 08:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Image Understanding with RAG Cookbook #1838

Image Understanding with RAG Cookbook #1838

Uh oh!

robtinn commented May 13, 2025

Uh oh!

Uh oh!

Uh oh!

Image Understanding with RAG Cookbook #1838

Image Understanding with RAG Cookbook #1838

Uh oh!

Conversation

robtinn commented May 13, 2025

Summary

Motivation

Uh oh!

Uh oh!

Uh oh!