CustomerZone360 NEWS

Free eNews Subscription

AI Is Driving Greater Accuracy in Advanced Speech Recognition

By Tracey E. Schelmetic May 10, 2023

“I’m sorry, I didn’t quite get that.”

Most of us have had an experience like this with speech-recognition solutions deployed for customer support or other public-facing applications. In many cases, this has led to low public opinion of the technology, which has often made us wonder if we’re speaking properly. (There’s nothing like being gaslit by a machine.)

The good news is that there is evidence that AI is driving greater accuracy in advanced speech recognition (ASR) technology. A new report from 3Play Media, a media accessibility company, found that the accuracy of ASR technology has improved measurably since the company’s last evaluation in 2022. As ASR improves, it's important to understand which engine is best for different use cases. Some nuances to consider include performance on different error types, transcription styles, formatting, and industry-specific content.

Accuracy is the key component in captioning for several reasons, most importantly ensuring that individuals who are deaf or hard of hearing and rely on captions as an accommodation receive information that fully depicts the original content. For captions to be accessible and legally compliant, they need to be 99 percent accurate, the industry requirement for accessibility. While there was improvement across industry leaders, the study found that even the best engines performed well below 99 percent accuracy, indicating a continued need for human revision.

The report, titled “The 2023 State of ASR,” measured accuracy against two measurements, Word Error Rate (WER) and Formatted Error Rate (FER). While WER is used as the standard measure of transcription accuracy, FER considers formatting, sound effects, grammar, and punctuation and is a better representation of the experienced accuracy of captioning. Accuracy in FER is harder to achieve, and even the best-tested engines were only 82 percent accurate, whereas the best-tested engines in WER were 93 percent accurate.

“The advances in AI we’ve seen across industries have also had an impact on ASR,” said Chris Antunes, co-CEO and cofounder of 3Play Media. “Longtime industry leader Speechmatics and newer entrants AssemblyAI and Whisper performed at the top of the pack, with each excelling in different areas. This proves that not all engines are created equal -- the training material and models matter -  and that there is room at the top for multiple engines to specialize in different use cases.”

Edited by Greg Tavarez
Get stories like this delivered straight to your inbox. [Free eNews Subscription]

CustomerZone360 Contributor

Related Articles

TELUS International Study Highlights the Importance of Voice Engagement for Customers

By: Tracey E. Schelmetic    4/24/2024

A recent study completed by TELUS International highlighted the importance of a future in which individuals can engage with data - including customer …

Read More

Nimble and PhoneBurner Partner for a Solution to Improve Outbound Calling

By: Tracey E. Schelmetic    4/23/2024

CRM solutions provider Nimble recently unveiled an integration with PhoneBurner. The partnership blends PhoneBurner's outbound calling with Nimble's p…

Read More

Calabrio Offers a New Solution to Quality Monitor and Analyze Chatbots

By: Tracey E. Schelmetic    4/23/2024

Workforce performance solutions provider Calabrio recently announced a new suite of Bot Analytics tools for Quality Management (QM). Bot Analytics giv…

Read More

CUSTOMER Magazine Announces Winners of the 2024 Voice Technology Excellence Awards

By: TMCnet News    4/19/2024

The CUSTOMER Voice Technology Excellence Awards recognize vendors that are emerging as the true leaders in an evolving and growing Voice Technology ma…

Read More

Eventide Steps into Call Centers with NexLog DX-Series Recorders

By: Greg Tavarez    4/17/2024

Eventide Communications, a provider of public safety recording solutions, extended its reach into the call center market with the NexLog DX-Series rec…

Read More