OpenAI's New Image Generator Has a Secret Price Floor. Here's the Test That Shows It.
On Tuesday, independent researcher Simon Willison published a simple experiment alongside GPT Image 2's launch. He asked the model to draw a raccoon holding a ham radio. At standard quality settings, the image came back with no visible raccoon. He switched to high-quality mode, paid roughly 40 cents per image, and the raccoon appeared. The gap between those two outputs is the real number in this release.
The benchmark number is 99 percent typography accuracy. It is real, and it comes from OpenAI's own benchmarks. But it applies to the highest-quality output mode, the one that costs roughly 15 times more than the base tier. Standard quality is the degraded tier. The raccoon did not appear there.
That price-quality cliff is the story behind the press release numbers. OpenAI is retiring DALL-E 2 and DALL-E 3 on May 12, according to documentation reviewed by AI industry tracker Apiyi. The consumer API that has sat behind Notion AI, Canva, and a range of products you have used without knowing DALL-E was underneath is going dark in three weeks. Developers who built on it face a migration deadline and a pricing structure that does not match the headline figure.
GPT Image 2 is genuinely faster and better at text rendering than its predecessor, by multiple independent accounts. The model ships in two versions, Instant and Thinking, with resolution up to 2K at standard output. A designer can now ask the model to render a product mockup with a specific tagline and get something that actually reads. That is a real capability jump from a year ago, when GPT Image 1 could not reliably produce legible text without inventing menu items like "enchuita" or "churiros," as TechCrunch reported. OpenAI says the model hits 99 percent accuracy on standard typography benchmarks and generates images nearly twice as fast as GPT Image 1, corroborated by Startup Fortune. These figures come from OpenAI's announcement; no independent benchmark verification has been published at standard quality settings.
The architecture behind it is undisclosed. OpenAI declined to say whether GPT Image 2 is autoregressive or diffusion-based, a gap that matters for how the model generalizes and whether its capabilities transfer to video generation, which the company has not addressed.
The pricing structure reveals what the benchmark obscures. OpenAI's official API pricing lists image output at $30 per million tokens. Per-image breakdowns show standard quality 1024x1024 images at $0.009 to $0.013 each. High-quality 4K mode runs roughly 40 cents per image through a separate output token billing mechanism. "Production-ready" has a price floor the headline number does not show.
The competitive landscape is not waiting. Midjourney has the creative community. Ideogram owns typographic design specifically. Imagen 4 leads on text rendering in some independent comparisons. Flux 2 leads on photorealism. GPT Image 2 is arriving into a market that has been filling the gaps OpenAI is now trying to close.
What the DALL-E retirement signals is a company that has decided the consumer API was a legacy experiment and is moving fully upstream into ChatGPT as the distribution layer. The Codex enterprise push announced alongside GPT Image 2 is consistent with this direction: OpenAI is selling platforms with ChatGPT as the front door, not individual developer tools.
For developers who built on DALL-E, the May 12 deadline is a forcing function on a decision they may not have been ready to make. For the rest of the market, the test that matters is the one nobody has published yet: independent benchmark verification of the 99 percent typography claim at standard settings, not high-output mode. That is where the raccoon should reappear, and that is where we will find out whether the headline number is a capability or a marketing tier.