Create AI images 30 times faster: Dall-E 3 and Stable Diffusion left behind

30 times faster, but also good: DMD. (Image: github/tianweiy)

A team at MIT has shortened the multi-stage processes of well-known AI image generators. This not only reduces the time it takes to produce the finished image. The required computing power and power consumption are also reduced at the same rate.

Mario Petzold (translated by Mario Petzold), Published 03/25/2024 🇩🇪 🇨🇳 ...

AI Science

The magic of Dall-E or Stable Diffusion should be familiar by now. From a brief description of the scene, content and perhaps one or two comments, a more or less realistic picture emerges. Fortunately, this can usually be recognized as an AI-generated work, but it also serves its purpose: I don't have to put a dog on a surfboard or a fox in an astronaut suit. The desired image is just a few clicks away.

In the background, however, it is a computationally intensive process consisting of numerous iterations, constant repetitions of the algorithm to finally arrive at the desired image. Researchers at MIT, however, have succeeded in dispensing with these numerous intermediate steps. Instead, the described scene is created after exactly one step.

This makes it possible to achieve a comparable result while significantly reducing the necessary computing power or waiting times. At the same time, less power is required to drive the system called "Distribution Matching Distillation (DMD)".

To put it more figuratively: the images used for training are broken down into coarser areas. This determines the approximate image composition depending on the subject. In addition, the probability of various image elements is analyzed in order to obtain a coherent scene at the end.

Ultimately, detailed information and the complexity are reduced, so that the image generator simply becomes faster. Instead of 2 to 3 seconds per image, the same hardware takes around 100 milliseconds - one thirtieth.

If you look closely at the images, the reduction in detail is clearly visible. Backgrounds are slightly blurred, and image elements can be repeated. The motifs can still look significantly better in some cases and are still easier to recognize as an overall work of artificial intelligence, or at least artificial. Another positive effect.

In addition to the fox astronaut, many other examples of the DMD model can be found here.

Noticeably fewer details on the right do not make the image any less convincing. (Screenshot: tianweiy.github.io)

Source(s)

MIT, Github

A historic deal in the AI world (Image Source: Runway)

Lionsgate's entire film catalog can now be used to train custom Runway AI text-to-video model 09/19/2024

Google opens AI Test Kitchen & Imagen 3 to most users 09/11/2024

Criticism of power consumption. This could fall rapidly with new technology. (Image: pixabay/2427999)

Computers without transistors or silicon: 2D magnets with superior properties 04/10/2024

The potential of fiber optic cables is far from exhausted. (Image: pixabay/Bru-nO)

301 terabits through fiber optic connection: record with existing network technology 04/09/2024

Once in the air, it becomes difficult to capture CO2 again. (Image: pixabay/catazul)

Don't just capture CO2 - convert and use it instead 04/05/2024

Fortunately, the CO2 concentration in water is much higher than in air. (Image: pixabay/Pexels)

Filtering CO2 from the sea: saving the climate via the ocean 03/30/2024

Is it better to use an AI-generated image (left) of an upcoming device (PS5 Pro) or use a stand-in official image (PS5; right) until the actual product is spotted? (Image source: Dall-E 3 / SIE - edited)

AI-generated content in tech media: Good or bad - have your say 03/30/2024

An artificial neural network is structured similar or completely different compared to this image, perhaps much simpler way. (Image: pixabay/geralt)

Neural networks in AI less connected than expected 03/27/2024

Stays in motion for a long time: the FLIRT H2. (Image: Stadler Rail AG)

Hydrogen train with record ride through Colorado 03/27/2024

Nobody likes to deal with a possible meltdown. (Image: pixabay/JamesQube)

Fukushima remains a revealing research object: New lessons from the fallout 03/26/2024

Meta, formerly Facebook, bought WhatsApp for a whopping $19 billion in 2014. (Source: Alexander Shatov on Unsplash)

WhatsApp AI assistant to become more efficient to use thanks to Meta AI integration in the search bar 03/25/2024

Being warned days in advance instead of being surprised is not too difficult. (Image: pixabay/distelAPPArath)

Floods cannot be prevented, but they can be reliably predicted - worldwide 03/24/2024

Every disease "coughs" a little differently. (Image: pixabay/Sambeetarts)

Diagnosis per voice: AI for coughs and colds 03/23/2024

Difference between classical and quantum computers. (Image: Caltech)

Troubleshooting a quantum computer and how to simulate a qubit system 03/21/2024

The sails can be combined with almost any type of ship. (Image: Cargill)

High-tech sails for cargo ship giants: CO2 savings with room for improvement 03/20/2024

Read all 3 comments / answer

Loading Comments

Comment on this article

Tesla Model 2 battery may charge in...

Roborock V20 new robot vacuum shown...

Editor of the original article: Mario Petzold - Tech Writer - 534 articles published on Notebookcheck since 2021

I've been using computers since 1989 and an Intel 8086. I also remember the Internet before college and university networks were supplanted by corporate and social media. The fascination for the technical leaps and social effects never let me go. In particular, I am most interested in the classic PC - and hardly less so in the laptop, in which the components have to come to terms with little space and power. So it seems only logical that I have been writing technical guides and product presentations since 2015. My physics studies provide the necessary basic knowledge and understanding of contexts.

contact me via: LinkedIn

Please share our article, every link counts!