Tech

Midjourney version 5 improves almost every aspect of AI-generated images

[ad_1]

Getting better: With all the recent news revolving around ChatGPT and other large language models, it’s easy to forget that their cousins—AI image generators—are still improving. One may have figured out how to render eyes and hands without making the subject look like something from a nightmare. however, the results still creep some people out.

Earlier this week, research lab Midjourney released a beta for version 5 of its self-named AI-imaging software. According to its announcement via Twitter, the latest version adds higher picture quality, more “diverse” results, a more expansive range of styles, seamless textures, and much more.

Users have already posted hundreds of stunning results, and feelings about the improvements are mixed. Most are impressed because imaging AI has struggled to produce aspects like shadows, reflections, eyes, and hands. Below is an image we created with OpenAI’s Dall-E as an example of where the machine has trouble.

The composition is somewhat off, and the general feel is cartoonish. The lighting is all wrong. The eyes and hands are badly deformed. The legs are fouled with artifacts, as are the popcorn container and the seat next to the subject. This result is one of four with similar problems to varying degrees.

Version 5 of Midjourney seems to have improved in this respect, at least from the examples others have shared. The results from simple prompts border on the uncanny valley—realistic enough to pass as professional photos in many cases, but still with that odd quality you can’t quite place. While incredibly realistic, many have described the images as creepy.

Our own Kishalaya Kundu said, “I’m more afraid than impressed, to be honest,” after viewing a series of nearly flawless Midjourney V5 photos. The fear being that one could fairly easily create a fake image and pass it off as genuine.

Creep factor aside, compared to V4, Midjourney V5 has dramatically improved quality. Graphic designer Julie Wieland has used Midjourney V4 (released last November) for some time and says that version 5 has “incredibly realistic” skin textures. The lighting effects are also much better, including reflections, glare, and shadows. Perhaps most importantly, the AI generates hands and eyes that appear natural most of the time.

“Eyes are almost perfect and not wonky anymore,” Wieland told Ars Technica. “Hands are correct most of the time, with five fingers instead of 7-10 on one hand. MJ v5 currently feels to me like finally getting glasses after ignoring bad eyesight for a little bit too long. Suddenly you see everything in 4k; it feels weirdly overwhelming but also amazing.”

Midjourney also improved the native resolution from 512x512px to 1024x1024px. The increase aligns it with Dall-E. However, Version 4 could supersample to double the native resolution. It’s not unreasonable to expect V5 to use the same technique to produce 2048×2048 images, but that is for an update further down the road.

The bottom line is MidJourney only hit the AI scene one year ago. Many (not all) of these images flooding Twitter feeds this week are untouched. Previously, Weiland used a combination of techniques to improve Midjourney 4’s visual quality, including “outpainting” with Dall-E and touchups in Photoshop. Version 5 promises less post-generation editing and perhaps photo-perfect images sooner than we can imagine. This prospect is indeed both exciting and frightening.



[ad_2]

Source link