Microsoft's latest breakthrough can understand speech as well as a human

Microsoft announced today that it cracked the code on voice recognition, claiming its new system can recognize spoken words as accurately as if you'd heard it yourself.

Engineers at Microsoft Artificial Intelligence and Research report that their automated voice recognition system has reached human parity - meaning it only makes as many mistakes recognizing words in a normal conversation as the average human being.

In tests, the system only reached a word error rate of 5.9%, a figure the research team claims is on par, if not better, than professional transcript writers - making for a historic first in the development of artificial intelligence.

“Even five years ago, I wouldn’t have thought we could have achieved this. I just wouldn’t have thought it would be possible,” said Harry Shum, executive vice president of Microsoft Artificial Intelligence and Research.

Can you read me, Cortana?

History-making aside, Microsoft knows the everyday potential of this new tech, noting the breakthrough's "broad implications for consumer and business products." 

One such example was the Xbox One which can already use voice commands to turn itself on, launch games, and more. 

Shum also specifically mentions the company's Siri-like, Cortana, saying that human-like voice recognition would make "a truly intelligent assistant" possible.

We've got to go Deeper

Google's DeepMind project also made major strides in artificial intelligence this year with its AlphaGo robot, which beat some of humanity's greatest go champions - a game oft considered impossible for robots to grasp due to its near-infinite strategic possibilities.

Last month, DeepMind engineers also developed a new way to make robotic speech inflect like a human's, using a system of waveforms called WaveNet to make robots sound more human.

On the smaller end of the scale, Google is also working to cut down on the Uncanny Valley aspects of its Google Assistant, with researchers - and also some reported comedy writers - working to give its consumer AI a less monotone voice and even a sense of humor. 

The robo-uprising is imminent, but not immediate 

While historic, Microsoft's voice recognition system is still prone to the same things that trip up humans - things like accents, vocal impairments, and distracting background noise.

Microsoft also adds that voice recognition is not the same as understanding speech. Transcribing a conversation into words and distilling them into meaning are two different things, though Microsoft's next goal will be to improve its system so that it not only picks up speech, but can get it.

"Transcription is different from understanding; understanding is a different story,"  told Xuedong Huang, Microsoft's Chief Speech Scientist to us earlier this year. "To understand the message, the subtlety of what's being said – that's a long way off." 

Until then, we're plenty excited knowing that soon we won't have to talk, to, computers, like, this, any, more.

TOPICS
Parker Wilhelm
Parker Wilhelm is a freelance writer for TechRadar. He likes to tinker in Photoshop and talk people's ears off about Persona 4.
Latest in Tech
Apple iPhone 16e
Which affordable phone wins the mid-range race: the iPhone 16e, Nothing 3a, or Samsung Galaxy A56? Our latest podcast tells all
The Apple MacBook Air next to the Dyson Supersonic R and new AMD GPU
ICYMI: the week's 7 biggest tech stories from the best tech at MWC to Apple's new iPads and MacBooks
A triptych image featuring the Bose Solo Soundbar 2, Nothing Phone 3a Pro and the Panasonic Lumix S1R II.
5 trailblazing tech reviews of the week: Nothing's stylish, affordable flagship and why you should buy AMD's new graphics card over Nvidia's
The best tech of MWC 2025 examples, including the Nothing Phone 3a Pro, the Nubia Flip 2, and the Lenovo Solar PC
Best of MWC 2025: the 10 top tech launches we tried on the show floor
Toy Fair 2025 Primal Hatch
The 7 best toys we saw at Toy Fair 2025, from a Lego boat to a hatching, robotic dinosaur
ICYMI
ICYMI: the 7 biggest tech stories of the week, from a next-gen Alexa to the new iPhone 16e
Latest in News
Nicole Kidman wears a blue blouse with her arms crossed.
Netflix might be renewing The Perfect Couple and Beauty in Black for season 2, but I don’t get why when it’s canceled shows with poorer ratings
The Russo brothers posing for a photograph and Herman carrying a Volkswagen camper van in The Electric State
'We're optimists': AI enthusiasts Joe and Anthony Russo defend its use in movies and TV shows, but admit there are 'very real dangers' around its application
UK Prime Minister Sir Kier Starmer
UK PM says AI should soon replace civil servants
Xbox Copilot in Minecraft
Microsoft confirms Copilot can be tested by Xbox Insiders next month and shares new details about how the AI sidekick will enhance the player experience: 'It has to be about gameplay, it has to be personalized to you'
Eight Samsung TVs mounted to the wall showing different basketball games
Samsung is offering you 8 new TVs in one bundle for March Madness, in case you want to watch all games at once like a Bond villain’s lair
Image depicting hands typing on a keyboard, with phishing hooks holding files, passwords and credit cards.
Microsoft warns about a new phishing campaign impersonating Booking.com