Google Whisk is a new way to create AI visuals using image prompts – here's how to try it

A promotional image for Google Whisk, an experimental AI image generator
(Image credit: Google)
  • Google Whisk uses images as inputs instead of text-based prompts
  • It's built on Google’s Imagen 3 generative AI model
  • The experimental tool is free to try for users in the US

Google’s new AI tool makes it easier to create and remix your visual concepts. Instead of asking you to describe what’s in your mind’s eye, Whisk lets you input three image prompts: one for subject, one for scene and one for style. Whisk takes care of the rest, making it a more intuitive way to experiment with different ideas.

While most of the best AI image generators require you to write a detailed prompt, Whisk handles that behind the scenes. When you drop pictures into the web-based Whisk interface as inspiration, Google’s Gemini model automatically analyzes them and writes a detailed caption for each. These are then fed into the Imagen 3 model, to create a matching image.

For example, you could drop in an image of a car as the subject and a photo of a rural landscape for the scene. You could them add a watercolor as the style to see what Whisk creates. Hit the button and you’ll get a pair of images based on your inputs.

From here, it’s easy to remix the images. The interface allows you to specify additional text-based details to tweak the outcomes. You can also easily drop in different source images or roll the dice if you’re in need of inspiration. New results appear in pairs in the feed, making it an intuitive way to ideate. You can also choose to refine images by revealing the text prompt and adding more details.

Whisk it up

Introducing Whisk: Prompt Less, Play More | Google Labs - YouTube Introducing Whisk: Prompt Less, Play More | Google Labs - YouTube
Watch On

While Whisk is designed to eliminate the need for text-based prompts, Google includes the option to refine the written prompts because results won’t always match up to the source material.

In a blog post about the experimental tool, Google explains that Whisk, “captures your subject’s essence, not an exact replica.” It’s only as effective as Gemini’s analysis of the images you submit. While this is generally very impressive, it also isn’t able to get inside your mind: you might expect Whisk to pull out one detail from an image, where it focuses on another.

The post explains further: “Since Whisk extracts only a few key characteristics from your image, it might generate images that differ from your expectations. For example, the generated subject might have a different height, weight, hairstyle or skin tone. We understand these features may be crucial for your project and Whisk may miss the mark, so we let you view and edit the underlying prompts at any time.”

Even with these shortcomings, Whisk an interesting application of Google’s existing AI tools. The underlying generative models are the same as if you were chatting with Gemini via its text interface. By relying on image inputs, though, Whisk is a more accessible and intuitive way for visual creators to play with their ideas.

Based on early feedback from digital creatives, Google refers to Whisk as “a new type of creative tool” which is intended for “rapid visual exploration, not pixel-perfect edits.”

How to try Google Whisk

Google Whisk is currently only available to users in the US. If you’re based there, you can try it out via your web browser at labs.google/whisk.

The experimental tool is completely free to play with. Data from your experience with Whisk will be fed back to Google to help refine and develop future AI products.

You might also like...

Christian Rowlands
TechRadar contributor

Formerly News Editor at Stuff, Chris now writes about tech from his tropical office. Sidetracked by sustainable stuff, he’s also keen on cameras, classic cars and any gear that gets better with age.

Read more
Google Whisk
I turned my dog into a plushie using AI and it was super easy
An image created by Google's Imagen3 artificial intelligence image generator.
What is Imagen 3: everything you need to know about Google's text-to-image model
The Google Gemini logo against a black background.
I tried Gemini's new AI image generation tool - here are 5 ways to get the best art from Google's upcoming Flash 2.0 built-in image upgrade
A silhouette of a woman holding a smartphone with the Google Gemini logo in the background
Top 5 ways you can use Google Gemini to be more creative
DeepDream
What is DeepDream? Everything we know about the AI image tool
Ideogram front page
What is Ideogram: Get creative with typography using this AI image generator
Latest in Artificial Intelligence
The Claude, ChatGPT, Google Gemini and Perplexity logos, clockwise from top left
The ultimate AI search face-off - I pitted Claude's new search tool against ChatGPT Search, Perplexity, and Gemini, the results might surprise you
Dream Machine on a laptop.
What is Dream Machine: everything you need to know about the AI video generator
Apple Intelligence Bella Ramsey ad
The Bella Ramsey Apple Intelligence ad that disappeared, and why Apple is now facing a false advertising lawsuit
Google Gemini Canvas
Is Gemini Canvas better than ChatGPT Canvas? I tested out both AI writing tools to find out which is king
Hugging Snap
This AI app claims it can see what I'm looking at – which it mostly can
Apple's Craig Federighi presents Apple Intelligence at the 2024 Worldwide Developers Conference (WWDC).
Apple Intelligence might finally transform Siri into the ultimate AI assistant if these leadership changes are true
Latest in News
Quordle on a smartphone held in a hand
Quordle hints and answers for Sunday, March 23 (game #1154)
NYT Strands homescreen on a mobile phone screen, on a light blue background
NYT Strands hints and answers for Sunday, March 23 (game #385)
NYT Connections homescreen on a phone, on a purple background
NYT Connections hints and answers for Sunday, March 23 (game #651)
Google Pixel 9 Pro Fold main display opened
Apple is rumored to be prioritizing battery life on the foldable iPhone – which could also feature a liquid metal hinge for added durability
Google Pixel 9
The Google Pixel 10 just showed up in Android code – and may come with a useful speed boost
L-mount alliance
Sirui joins L-Mount Alliance to deliver its superb budget lenses for Leica, DJI, Sigma and Panasonic cameras