ChatGPT's New Image Engine Handles Text, Complex Requests

ChatGPT's New Image Engine Handles Text, Complex Requests

OpenAI rolled out an upgraded image generation system for ChatGPT on Tuesday, claiming it represents a meaningful jump in text rendering and the ability to process more intricate creative instructions.

The new tool, called ChatGPT Images 2.0, supports multiple aspect ratios and arrives in two flavors: a standard version available to all users, and a "thinking" mode reserved for paid subscribers. The thinking mode incorporates built-in reasoning that helps the system work through more complex requests, though the tradeoff is longer wait times for image generation.

Developers will also gain access to the new models through an API, expanding the tool beyond the ChatGPT interface itself.

OpenAI product manager Adele Li told reporters the company expects the release to spark another round of viral image moments, echoing previous successes when Studio Ghibli-style outputs took off across social media. "We believe that we are going to have another moment here," Li said.

But the company is pitching the upgrade as more than just a meme machine. OpenAI argues the model handles professional design work effectively, positioning it as a serious asset for creating advertisements, posters, and mockups. "It's not just a tool for making beautiful pictures," Li said. "It's a creative assistant."

The timing reflects a crowded AI image market where technical leads evaporate quickly. Google grabbed headlines last year with its Nano Banana model, while OpenAI itself scored viral wins with earlier releases. The competitive pressure to keep shipping improvements and capturing user attention shows no signs of slowing.

Author James Rodriguez: "The real story isn't whether OpenAI nails another viral moment, it's whether they can convince professionals that this tool is essential for their actual work."

Comments