GLIGEN, the Tool to Rule All AI Image Generators
Artificial Intelligence   Data Science   Latest   Machine Learning

Author(s): Ignacio de Gregorio

Originally published on Towards AI.

The Grounding Problem Solved?

The tool all artists have been waiting for is here.

GLIGEN is a new framework by Microsoft and the Universities of Washington-Madison and Columbia that allows much higher control over the generation of images by allowing you to draw bounding boxes and other visual cues besides simply stating what you want through a text description, the current standard.

And the best thing of all is that is designed to be reusable on any open-source image generator, making it clear that if 2023 was the year of AI promise, 2024 is the year of real value.

What’s more, you can try it today.

Anyone who has fiddled with AI image generators like Stable Diffusion, DALL-E, or MidJourney has quickly run into the same wall:

Lack of control.

In other words, the fact that you can only state what you want through text signifies that there’s a limit to… Read the full blog for free on Medium.

