Google Overhauls Cloud Speech-to-Text API, Now More ‘Business Friendly’

6 years ago

Google has released an overhaul of its Cloud Speech-to-Text API, designed to make the technology more business friendly.

According to a new entry on Google’s Cloud Platform Blog, Google Cloud Speech-to-Text now supports a selection of pre-built models, automatic punctuation, recognition metadata, and standard service level agreement (SLA). The new API promises a reduction in word errors around 54 percent across all of Google’s tests, but in some areas the results are actually far better than that.

“Access to quality speech transcription technology opens up a world of possibilities for companies that want to connect with and learn from their users,” writes Google product manager Dan Aharon. The update takes advantage of Google’s latest research around machine learning technology, he said.

According to the blog post, Google’s Cloud Speech-to-Text APU now supports:

A selection of pre-built models for improved transcription accuracy from phone calls and video
Automatic punctuation, to improve readability of transcribed long-form audio
A new mechanism (recognition metadata) to tag and group your transcription workloads, and provide feedback to the Google team
A standard service level agreement (SLA) with a commitment to 99.9% availability

The company introduced the Google Cloud Speech API in May 2016, and in 2017 the company added several new features including word-level timestamps and support for long-form audio files up to three hours long.

Google said Cloud Speech-to-Text is available now priced at $0.006 USD per 15 seconds for all models, except for the video model, which is twice as expensive at $0.012 USD per 15 seconds.

P.S. Help support us and independent media here: Buy us a beer, Buy us a coffee, or use our Amazon link to shop.

Other articles in the category: News

Apple Ramps Up Talks with OpenAI for iPhone Features: Report

Apple has revived discussions with OpenAI to potentially incorporate the its generative AI technology into upcoming iPhone features this year, claims sources speaking to Bloomberg’s Mark Gurman. These talks focus on adding OpenAI's technology into iOS 18, which we will see previewed at WWDC in June. Sources indicated that the discussions are in early stages....

John Quintet

23 hours ago

Google Launches AI Essentials Course; Free Version for Teachers

Google has introduced a new course called AI Essentials, aimed at breaking down how to use artificial intelligence for professionals across various industries. Upon completion of the course, you’ll get a certificate that you can add to your resumé. Taught by AI experts actively working at Google, this course offers practical, hands-on training in using...

John Quintet

1 day ago

Apple Deepens China Ties Amid Global Supply Chain Shifts

Apple is strategically aligning itself with China while simultaneously expanding its manufacturing footprint in Southeast Asia and India.

Usman Qureshi

2 days ago