Random Generative AI Art for Endless Creativity
Last Updated on November 3, 2024 by Editorial Team
Author(s): Nicolas MARTIN
Originally published on Towards AI.
One of the greatest features of many Artificial Intelligence algorithms is its randomness. This randomness gives a more natural approach to answering different needs and can be applied to a new layer: creating random prompts for limitless creativity. I will present different ways to benefit from this randomness, from a specific field or set of words to full random generations.
Foucaultβs Pendulum
I was always fascinated by Foucaultβs Pendulum, a book by Umberto Ecco. This book has extraordinary intellectual and inspiring stories that make readers think about many ideas. One of the most interesting ones I remember is when the main protagonists try to create a story from scratch and lack inspiration initially. The solution was simple but brilliant: write random words on small pieces of paper, mix them in a bag, and select a few of them randomly to make a sentence. Here is where all the magic happens! The brain tries to create meaning between words, explore new possibilities, and hence create innovative ideas. They repeat the same process several times, creating different innovative scenarios thanks to the magic of randomness. Eventually, they pick their best one and create the scenario for a great story.
We will use this process to generate random art and explore different solutions to cover all possibilities of the randomness advantage.
Do we need to know programming or any technical skills to make random generative art? Not at all! Some basic knowledge of prompting is necessary. Here are five different solutionsβ¦
1. Experience-based
Internet is full of valid and complete prompts to create stunning images, why not use them with AI? Therefore, we need to use Microsoftβs Co-Pilot or Googleβs Perplexity because they get their knowledge directly from the Internet.
For instance: βGet 5 great prompts for image generation from different sources on the Internet.β
Here is the result I get:
- A futuristic cityscape at sunset: Imagine a sprawling metropolis with towering skyscrapers, flying cars, and neon lights reflecting off glass buildings. The sky is painted in hues of orange and pink as the sun sets behind the horizon.
- A serene forest with mythical creatures: Picture a dense forest with tall, ancient trees. Among the foliage, you can see unicorns grazing, fairies fluttering around, and a gentle stream flowing through the scene, illuminated by soft, magical light.
- A steampunk-inspired airship: Visualize a massive airship floating above a Victorian-era city. The ship is adorned with brass gears, steam engines, and intricate metalwork. Below, the city is bustling with people in period attire, and the sky is filled with other airships and hot air balloons.
- A cozy, rustic cabin in the mountains: Envision a small wooden cabin nestled in a snowy mountain landscape. Smoke rises from the chimney, and warm light glows from the windows. Surrounding the cabin are tall pine trees, and in the distance, you can see the majestic peaks of the mountains.
- A bustling alien marketplace: Imagine a vibrant market on a distant planet, filled with various alien species. Stalls are overflowing with exotic goods, strange fruits, and futuristic gadgets. The atmosphere is lively, with colorful banners and lights strung overhead, and the sky is a deep shade of purple with multiple moons visible.
If I take any of those prompts, the result is already great. We can generate them with Co-Pilot, Meta.ai, Leonardo, or Xβs Grok.
For example, I try the first prompt using Grok (based on Flux):
But we want something more creative using randomness.
To do that, we ask the chatbot: βCreate 3 random prompts based on the previous ones.β Once we have the result, we copy/paste it in the next query to see the result.
I get the following renderings:
Those examples might not be as beautiful as some non-random ones but they are much more creative:
- An underwater world with trees and licorn creatures doesnβt exist.
- A castle with an external library is also something new.
- A floating alien marketplace in a purple world is not something that everyone could imagine.
Those are interesting examples as creative and successful prompts inspire them.
We can go a step further using a similar approach for a specific themeβ¦
2. Theme-based
Some projects or challenges require a high creativity with a specific theme like trains or blue or robots.
Getting creativity with randomness is possible by applying the previous solution, just adding the desired theme.
First, ask: βGenerate 5 very different prompts about trains to generate images.β
Here is the result:
Here are five distinct prompts about trains that can be used to generate different kinds of images:
1. A futuristic high-speed bullet train traveling through a neon-lit city at night. The trainβs sleek silver body reflects the vibrant lights of the towering skyscrapers, with glowing billboards and flying cars visible in the background. The scene is fast-paced, emphasizing advanced technology and a utopian atmosphere.
2. An old steam locomotive crossing a rickety wooden bridge over a misty, deep canyon at dawn. The trainβs thick black smoke swirls into the early morning sky as the sunβs rays begin to pierce through the fog. The setting is a lush, green mountainous region, giving the scene a sense of adventure and historical nostalgia.
3. A luxurious vintage train interior from the early 20th century, with polished wood paneling, velvet upholstery, and golden chandeliers. The atmosphere is opulent, with passengers dressed in elegant period clothing, sipping tea, and enjoying the grandeur of travel in a bygone era.
4. A post-apocalyptic train station, where a rusty, decaying train sits on the tracks. The sky is overcast, and the environment is filled with abandoned buildings, overgrown vegetation, and makeshift shelters. Survivors are seen scavenging supplies around the station, hinting at the collapse of society, with a somber, dystopian tone.
5. A magical train flying through the sky, above the clouds, surrounded by stars and a glowing moon. The train has an ethereal, steampunk design with gears and glowing runes. Wisps of light trail behind the train as it glides through the sky, creating a whimsical, dreamlike atmosphere full of wonder and fantasy.
Each previous prompt creates beautiful pictures, but we want something more challenging to imagine.
So, we ask: βCreate 3 prompts with random words from the previous prompts.β
And here are the results with each prompt below:
As we can see, the results are creative, but even if they have a random factor, they are based on classical prompts.
What if we bring this randomness to the next level?
3. Parameter-based
In general, image prompting follows more or less the same structure:
- The main focus. It could be a person, an animal, an object, a landscape, a concept. It should reflect what is on the center of the picture.
- The details. Those are extra, but information about the main focus. Does it has an action? Is there some specific things on it? What is its color?
- The background or the environment. Every picture can be created with only one main focus, but the AI will suggest any environment. Thatβs why it is important to specify what background or environment we want. I could be a forest, a city, a couch shop, etc.
- The style. It is the form or the tone of our picture. Do I want a traditional, an Asian, or a cyberpunk style?
- The emotion. Many options are possible: Dramatic, happy, sad, chill, etc.
- The camera features like the focus mode, or the type of camera (any known model).
- The camera angle. Low angle, high angle, wide angle, etc.
- The camera shot. Full shot, close-up, half-body shot.
If I ask ChatGPT or Grok a list for each parameter βcreate a list of 6 different examples for each feature about image promptingβ, here is the result:
Main Focus:
- A wizard casting a spell β Emphasizes a character in action.
- A vintage car β Focuses on an object with historical significance.
- A coral reef β Highlights a natural underwater landscape.
- A futuristic drone β Centers on a piece of technology.
- A philosophical debate β Represents an abstract concept or activity.
- A phoenix rising from ashes β Symbolizes rebirth or a mythical creature.
Details:
- The wizard has a glowing blue staff β Adds color and magical elements.
- The vintage car is rusted but with shiny chrome bumpers β Specific textures and conditions.
- The coral reef is teeming with neon-colored fish β Adds vibrancy and life.
- The drone has four rotors and a camera with a red blinking light β Details on the droneβs design.
- The debate takes place in an ancient Greek forum β Setting the scene for the concept.
- The phoenix is red and gold, with flames that donβt burn the surrounding flora β Specific colors and magical properties.
Background or Environment:
- In a misty forest at dawn β Sets a natural, serene environment.
- On a bustling city street in the 1950s β Places the car in a specific time period.
- In the deep ocean with faint light from above β Creates depth and a sense of isolation.
- Hovering above a smart city skyline at night β Suggests a futuristic setting.
- Under the shade of an old oak tree in a sunlit field β Peaceful, rural environment.
- Against a backdrop of a volcanic eruption β Adds drama and danger.
Style:
- Oil painting β Traditional art style.
- Manga β Japanese comic style for dramatic expressions.
- Steampunk β Combines Victorian era and mechanical elements.
- Watercolor β Soft, flowing, and translucent effects.
- Pixel art β Retro video game aesthetic.
- Art Deco β Geometric shapes, bold outlines, 1920s flair.
Emotion:
- Mysterious β The wizardβs spell casting feels like uncovering ancient secrets.
- Nostalgic β The car evokes a sense of past eras.
- Serene β The coral reef suggests tranquility and beauty.
- Innovative β The drone imparts a feeling of technological advancement.
- Intense β The debate conveys intellectual rigor and passion.
- Awe-inspiring β The phoenixβs rebirth is majestic and inspiring.
Camera Features:
- Macro lens for intricate details of the wizardβs staff.
- Vintage film camera for the car to enhance the old-time feel.
- Wide-angle lens to capture the expanse of the coral reef.
- High-speed camera to catch the drone in motion without blur.
- Tilt-shift photography for the debate to focus on the philosophers.
- Long exposure to capture the movement of the flames and the phoenix.
Camera Angle:
- Eye-level to engage directly with the wizard.
- Low angle looking up at the vintage car to give it grandeur.
- Top-down view to show the layout of the coral reef ecosystem.
- Birdβs eye view for the drone, emphasizing its surveillance capability.
- Side angle to capture the dynamic of the debate.
- From below to emphasize the phoenixβs ascension.
Camera Shot:
- Close-up on the wizardβs face showing concentration.
- Full shot of the car to see it in entirety with the environment.
- Extreme wide shot to show the vastness of the ocean around the reef.
- Medium shot of the drone to see it in relation to its surroundings.
- Two-shot of the philosophers engaged in discussion.
- Over-the-shoulder shot capturing the phoenix from behind as it rises, showing its perspective.
Although we can extend the list of main focus objects, we will use this list to create random prompts thanks to the query: βNow, pick single random examples of each feature to generate 3 random prompts.β
Here are the results we get:
4. Semi-Random
After several trials, I wanted to get random suggestions about things thanks to ChatGPT, Claude, or Grok. Unfortunately, some words came back, such as βserenadeβ, donβt ask me why. Consequently, current chatbots cannot suggest total random words yet, but we can try alternative solutions to reach this objective.
One of the best one, is to ask to an available service such as the Random Word Generator, get a dozen random words, and ask a chatbot this:
βGenerate a prompt with 5 random words in this list: gainful puzzled morning kindhearted pin influence zonked ground time real wretched identifyβ
Here are the results, repeating the process 3 times:
I find the results interesting, and the results are very different if we ask the same prompt again. This is probably due to the fuzzy view that those prompts suggest.
This is not completely random, as the chatbot selects words to extract some meaning, but this is a good approach for creativity. You can reduce the suggested words or increase them, but the fewer choices the AI has, the more effort it will make to create meaning.
5. Crazy Prompts
Now, letβs explore the most creative side of AI using crazy prompts.
To achieve that, we create 5 random words randomly with the Random Word Generator, and ask the chatbot: βGenerate a crazy image prompt with those key words: historical fire illegal petite steerβ, then apply the same logic twice. Here are the results:
As we can see, the results are even more creative than before. This is due to the word βcrazyβ in the prompt to let the chatbot create anything without common sense and to limit the request to 5 random words. Of course, we can use more random words or mix Crazy Prompts with Semi-Random or Parameter-Based solutions (e.g., a crazy prompt with a random camera angle). Last tip: This applies to music, stories, videos, etc. The sky is the limit!
Here are random AI-generated pictures brought to life thanks to Hailuo AI:
Conclusion
Randomness is one of the main functions used in Artificial Intelligence that allows more creativity within acceptable limits, including in image generation. That explains why AI-generated images are well-balanced and always different with the same prompt. However, we can go a step further in randomness by creating random prompts thanks to chatbots. Those new prompts can be adjusted to be more or less random, allowing for highly creative outputs.
Do you want other random generative AI art examples? Check my account on X.
Join thousands of data leaders on the AI newsletter. Join over 80,000 subscribers and keep up to date with the latest developments in AI. From research to projects and ideas. If you are building an AI startup, an AI-related product, or a service, we invite you to consider becoming aΒ sponsor.
Published via Towards AI