Understand any image in detail with GPT-4o vision. Get rich descriptions, semantic tags, and contextual analysis from photos, screenshots, diagrams, and more โ in one API call.
The IteraTools Image Description API uses GPT-4o vision to produce detailed, contextual descriptions of any image. Pass a public image URL or base64-encoded image data, and receive a rich natural-language description plus semantic tags. Falls back to Claude 3 Haiku automatically if GPT-4o is unavailable. Perfect for AI agents that need to reason about visual content, build alt-text pipelines, classify images, or understand user-uploaded media.
| Param | Type | Required | Description |
|---|---|---|---|
| image | string | Yes | Public HTTPS URL or base64 data URL (data:image/jpeg;base64,...). Max 10MB. |
| prompt | string | No | Custom question or instruction. Default: "Describe this image in detail. List objects, people, text, colors, and context." |
Adds all 51 IteraTools tools to your MCP client โ including image_describe.
$0.008 per request. Pay with API credits (no subscription). Get your API key โ