ChatGPT’s Image Generator Is Free - Here’s What You Can (And Can’t) Do With It

On April 1,Sam Altman announced on Xthat ChatGPT’s new image generator will now be accessible to free users. It was no April Fool’s prank; the image generator, which was launched on March 25 for paid users and sucked hundreds of millions of usersinto a Ghibli-inspired AI image frenzythat melted OpenAI’s servers, is now available to everybody. The downside is that if you don’t have a paid OpenAI plan, you’re limited to only three image generations a day.

The initial launch at the end of March was fairly quiet, butOpenAI has always had big plans. After all, the ‘o’ in ChatGPT 4o stands for ‘omni,’ the Latin word for ‘every.’ It could originally handle text, speech, and video, and now 4o can add image generation to its capabilities. If you haven’t had the chance to use ChatGPT’s image generation yet (and you are not as horrified by the existence of AI art asStudio Ghibli’s Hayao Miyazaki was in 2016), you might want to give it a go. Although be warned, while many of the early outage issues have been resolved, the model is still having slowness and capacity issues, as everyone, it seems, is hard at work turning their pet pictures into Ponyo homages. Or, in my case, getting AI to create a collection of SlashGear-related images in the style of a children’s book.

Two AI generated images of a woman standing by a blackboard, produced to look like 1950s book illustrations. The one on the left is produced by DALLE-E and the one on the right is produced by ChatGPT 4.0. Only the right-hand image has successfully included the name of this article: “ChatGPT’s Image Generator Is Free - Here’s What You Can (And Can’t) Do With It”. The DALL-E image contains many of the words but there are duplications, missed words, and garbled letters.

How is ChatGPT different from other image generators?

So why was this new release such big news? There are already plenty of other AI image generators on the market. The list includes OpenAI’s own DALL-E, which hasn’t yet been switched off but is now being referred to as its “legacy image generation model.”

In a nutshell, ChatGPT’s new image generator is really, really impressive. OpenAI has taken massive strides towards making AI art that doesn’t look like AI art. Significantly, it can handle incorporating text into images, which is something that AI image generators have previously struggled with. It feels obvious in retrospect that text was going to be the next big step in AI art generation. ChatGPT has always been good with words; it just needed to marry those capabilities with image generation.

Screenshot of a tweet from Sam Altman dated 27 March that reads “it’s super fun seeing people love images in chatgpt.  but our GPUs are melting. we are going to temporarily introduce some rate limits while we work on making it more efficient. hopefully won’t be long! chatgpt free tier will get 3 generations per day soon.” Altman’s profile image is an AI-generated picture of himself in the style of Studio Ghibli.

Now, you can use ChatGPT to make storyboards, comics, and infographics. You can give it some text and ask it to incorporate it into a prompted image, like the picture on the right above. All the words are there, they’re spelled correctly, and contain real letters from the alphabet! I gave the same prompt to DALL-E to produce the image on the left, and while it understood the assignment, it couldn’t quite manage to get the words right.

It also has improved character consistency, so it can refer back to pictures it has previously created and draw them in different styles and situations, and it has a neat new transparency feature for creating PNGs you can turn into stickers. And it’s still being developed.Sam Altman tweetedon April 2, “Y’all are not ready for images v2…”

What are its limitations?

The only limitation of the free version compared to the paid one is the number of images you’re able to produce per day. The biggest annoyance at the moment for both the free and paid versions is how slow it is. When I tested it, it took about three minutes to produce each image, and that’s if it works at all. There have been numerous outages, and OpenAI temporarily put limits on users' requests because,as Altman said, “our GPUs are melting.”

Even when it’s all running smoothly, creating images in ChatGPT is more time-consuming than the feature-rich dashboards ofother image generators like Midjourneyor Leonardo. Those models also allow you to create multiple images at a time, whereas ChatGPT limits you to one image per request. There are also no automatic editing options. You need to ask ChatGPT to make changes. While OpenAI boasts about consistency in the new feature, I found that asking it to change and remove things from a previous image while keeping most of it the same is a bit hit-and-miss.

The biggest problem with ChatGPT’s image generator, however, isn’t its limitations. It’s how good it is. Everything we thought we knew abouthow to spot an AI image, like poorly drawn hands and nonsense text, has gone out the window. It’s too soon to say what the long-term effects of this AI advancement are going to be. Will there be even moreAI slop saturating social media channels? Are illustrators out of work now? What about the seriousenvironmental implications of AI-generated images? The last few weeks have shown that AI development isn’t slowing down, the art is getting better, and it’s going to get increasingly difficult to distinguish it from human-created stuff.