OpenAI Unveils Upgraded ChatGPT Images 2.0 with Enhanced Support for Non-Latin Scripts

OpenAI has launched its latest ChatGPT Images 2.0 model, marking a significant update to its image creation capabilities.

Just over a year since introducing direct image and design generation within the ChatGPT interface, OpenAI is rolling out ChatGPT Images 2.0. The company highlights this version as a major advancement in image generation technology, especially in adhering precisely to user prompts, producing intricate text elements, and accurately positioning and connecting items within compositions. For the first time, OpenAI has integrated reasoning functions into an image model, enabling features such as online searches and output validation. These enhancements, per OpenAI, result in a more dependable system particularly suited for applications requiring precision, uniformity, and seamless visual integration.

The firm reports substantial efforts to improve the model's comprehension and depiction of non-Latin scripts, achieving notable progress in managing languages like Japanese, Korean, Chinese, Hindi, and Bengali. Additionally, Images 2.0 excels at capturing the unique traits of various visual scripts more authentically. OpenAI notes that this boosts its applicability in areas such as prototyping video games and developing storyboards. Beyond these upgrades, the model offers greater adaptability in image proportions, supporting formats up to 3:1 in width or 1:3 in height. It can create visuals at resolutions reaching 2K and generate as many as eight variations simultaneously.

In a pre-release demonstration, the author tested Images 2.0 with an initial request for a pixel-art rendition of a tortoiseshell cat styled after the third-generation Pokémon games. This prompt served as a challenging evaluation since artificial intelligence often falters with pixelated graphics, and the Game Boy Advance-era Pokémon visuals are distinctly recognizable, leaving little room for vague interpretations. The resulting image demonstrated strong fidelity to the specified aesthetic. Next, the model was instructed to transform this into a transparent PNG file. For a final trial, it produced a four-panel manga sequence depicting the cat relaxing on a pleasant day beside a serene urban waterway.

Among these experiments, the transparent PNG conversion required the longest processing time, and the final version showed minor variations from the original output, somewhat straying from the exact instructions. Nevertheless, it successfully delivered a valid transparent format, a task that proves difficult for many competing image generators. As broader user feedback emerges, clearer insights will surface regarding its performance against rivals like Google’s Nano Banana 2 and potential areas for further refinement by OpenAI.

ChatGPT Images 2.0 is now accessible to every ChatGPT user, encompassing the Free and Go plans. Those on Plus and Pro levels benefit from superior generation options. OpenAI is also integrating the model into its API platform and the recently refreshed Codex development tool, which gained native image creation features just a week ago. This launch comes shortly after Anthropic entered the visual creation space with its dedicated design tool.