ChatGPT can now look at pictures and tell you a bedtime story in five different voices

Robot talking on the phone
(Image credit: Bas Nastassia via Shutterstock)

ChatGPT can now hear, see and speak, opening up a whole new world of possibilities for how we interact with AI chatbots. The new capabilities unlock the ability to have a voice conversation with ChatGPT, or physically show the bot what you’re talking about. 

According to the official OpenAI blog post, you’ll soon be able to show the bot pictures of a landmark while on holiday and have a conversation about the history behind the structure. You could also send the bot a photo of your fridge contents and have it whip up a potential recipe.  

The new features will be rolling out to ChatGPT Plus and Enterprise users first over the next few weeks. Voice is coming to iOS and Android apps, and images will be available across platforms. As with most ChatGPT features, users who aren’t subscribed to the Plus platform will likely see the features a little later. 

ChatGPT talks back

The blog post notes that you’ll now be able to engage in back-and-forth conversations with your AI assistant on the go via the phone app. From what we can tell it would be a similar experience to how you’d speak to Siri or Amazon Alexa

The video example on the blog post (below) shows off a stylish user interface with a voice asking ChatGPT to tell a bedtime story, with the user interrupting every so often to ask questions.

Regardless of how you might feel about the technology it’s still very impressive. We’ll have to wait to see if real conversations match up with the seamless example in the video, but if they do, Siri and Amazon Alexa have a lot to be worried about. If I can access a talkative, intelligent chatbot like ChatGPT, which looks at pictures and can go into depth about topics without pause, why would I ever use any other virtual assistants? 

If you’re a Plus subscriber, head over to Settings, click ‘New Features’ on the mobile app and opt into voice conversations. You’ll be able to choose your favorite voice out of five different options: Sky, Cove, Ember, Breeze and Juniper, and you can listen to each one over on the official site.

Sight for sore eyes

ChatGPT can also now look at more than one image as well. You can show graphs that need analyzing, get help with homework or just show a rough draft of work you’d like feedback on, but can’t be bothered to type out. 

If you want it to focus on something specific in the photo, you can use the new drawing tool within the ChatGPT app and circle exactly what you want the bot to concentrate on. 

While this is scarily impressive for a generative AI chatbot, there are concerns that immediately spring to mind upon hearing about the new features.

OpenAI does acknowledge these concerns at the bottom of the announcement, stating that with new features come new challenges, including hallucinations - basically an incorrect response given by an AI bot but delivered with confidence - and the possibility of the voice capabilities that impersonate public figures or commit fraud. 

In order to combat this, OpenAI states that Voice Chat was created with real voice actors, and the image input feature was tested with rosh domains in extremism and scientific proficiency, to “align key features for responsible usage”.  

We’re so incredibly buzzed to try out the new features, especially the ability to chat directly to ChatGPT and probe its mind. We’re also keen to see how this will ripple down to other products like Bing AI, Google Bard and even Meta’s budding AI project. As ChatGPT is an AI trailblazer, introducing new features like this will mean everyone else will have to catch up.

You might also like...

TOPICS
Muskaan Saxena
Computing Staff Writer

Muskaan is TechRadar’s UK-based Computing writer. She has always been a passionate writer and has had her creative work published in several literary journals and magazines. Her debut into the writing world was a poem published in The Times of Zambia, on the subject of sunflowers and the insignificance of human existence in comparison. Growing up in Zambia, Muskaan was fascinated with technology, especially computers, and she's joined TechRadar to write about the latest GPUs, laptops and recently anything AI related. If you've got questions, moral concerns or just an interest in anything ChatGPT or general AI, you're in the right place. Muskaan also somehow managed to install a game on her work MacBook's Touch Bar, without the IT department finding out (yet).

Read more
ChatGPT WhatsApp
ChatGPT on WhatsApp can now see, hear, and remember your conversations from elsewhere
Using Advanced Voice Mode in ChatGPT to learn how to make coffee with a French Press.
ChatGPT adds eyes to its voice with new screen and video sharing feature
Using ChatGPT for desktop on a Mac with XCode.
ChatGPT's Mac app gets a glowup with new coding and notetaking features
Woman with multiple personalities
ChatGPT is getting powerful new custom personalities – and they could change how you use the AI chatbot
ChatGPT Advanced Voice mode on a smartphone.
OpenAI is rolling out exciting new features for all ChatGPT users, and I can't wait
ChatGPT on a phone
What is ChatGPT: everything you should know about the AI chatbot
Latest in Artificial Intelligence
Deep Resarch
I test AI agents for a living and these are the 5 reasons you should let tools like ChatGPT Deep Research get things done for you
ChatGPT vs. Manus
I compared Manus AI to ChatGPT – now I understand why everyone is calling it the next DeepSeek
Two business men playing chess in the office.
It turns out ChatGPT o1 and DeepSeek-R1 cheat at chess if they’re losing, which makes me wonder if I should I should trust AI with anything
Google Gemini Calendar
Gemini is coming to Google Calendar, here’s how it will work and how to try it now
Netflix
Netflix tried to fix 80s sitcom A Different World with AI but it gave us a different nightmare
Pictory
What is Pictory: Everything we know about this business-focussed AI video generator
Latest in News
Vision Pro Metallica
Apple Vision Pro goes off to never never land with Metallica concert footage
Mufasa is joined by another lion, a monkey and a bird in this promotional image
Mufasa: The Lion King prowls onto Disney+ as it finally gets a streaming release date
An American flag flying outside the US Capitol building against a blue sky
Sean Plankey selected as CISA director by President Trump
An Nvidia GeForce RTX 4060 on a table with its retail packaging
Nvidia RTX 5060 GPU spotted in Acer gaming PC, suggesting rumors of imminent launch are correct – and that it’ll run with only 8GB of video RAM
Indiana Jones talking to a friend in a university setting with a jaunty smile on his face
New leak claims Indiana Jones and the Great Circle PS5 release will come in April
A close up of the limited edition vinyl turntable wrist watch from AndoAndoAndo
This limited-edition timepiece turns the iconic Technics SL-1200 turntable into a watch, and I want one