OpenAI Launches GPT Image 1.5: Faster, Smarter Image Generation

OpenAI Launches GPT Image 1.5: Faster, Smarter Image Generation

OpenAI is making significant strides in its image generation capabilities with the launch of GPT Image 1.5. This latest update promises a dramatic improvement in instruction following, allowing for more precise image editing and a remarkable 4x increase in generation speed. This new model is now available to all ChatGPT users and through the API, marking a critical juncture in the ongoing AI race.

The release of GPT Image 1.5 is seen as a direct response to recent advancements from Google. Following the unveiling of Google's Gemini 3 and its image generator Nano Banana Pro, which have dominated AI leaderboards, OpenAI CEO Sam Altman reportedly issued an internal "code red." This urgency fueled the accelerated development and release of GPT Image 1.5, a move that comes after OpenAI's previous image model, GPT Image 1, launched in April. The company had previously signaled intentions for a new image generator earlier in January, but current market dynamics have clearly prompted an earlier rollout.

Enhanced Editing and Creative Control

GPT Image 1.5 arrives at a time when AI image and video generation tools are transitioning from experimental prototypes to production-ready solutions. Mirroring capabilities found in Google's Nano Banana Pro, ChatGPT Image now offers advanced post-production features. Users can expect more granular editing controls, enabling them to maintain crucial visual consistency across multiple edits. This includes preserving likeness in faces, ensuring consistent lighting, composition, and color tone, a significant leap forward for iterative image creation.

A common frustration with existing GenAI image tools is their tendency to re-interpret an entire image when a specific change is requested, such as altering a facial expression or adjusting lighting. This often leads to a loss of consistency. GPT Image 1.5 aims to solve this challenge, offering a much-improved experience for iterating on designs and achieving the desired outcome without compromising previous elements.

A Revamped Creative Studio Experience

Beyond the core generation and editing enhancements, OpenAI is also revamping the user interface for ChatGPT Images. A dedicated entry point within the ChatGPT sidebar will now function more like a creative studio. Fidji Simo, OpenAI's CEO of applications, highlighted in a recent blog post that the new viewing and editing screens are designed to simplify the process of bringing visions to life. Users can more easily create images that align with their specific ideas, or draw inspiration from trending prompts and pre-set filters.

Visual Integration Across ChatGPT

OpenAI is not stopping at image generation; the company is also focused on enriching the overall ChatGPT experience with more visual elements. The roadmap includes displaying search queries with integrated visuals and clear source attribution. This enhancement is expected to be particularly beneficial for practical tasks, such as converting measurements or quickly checking sports scores. Simo emphasized that when visuals are more effective than text in conveying information, ChatGPT will incorporate them. The goal is to seamlessly bridge the gap between a user's imagination and their ability to realize it, ensuring that when a quick answer or the next step involves another tool, it's readily accessible.

Related articles