Google Cloud: AI Speech-to-Text with Python 3
Learn to implement AI-powered speech-to-text features using Google Cloud services and Python 3, focusing on practical applications.
Welcome! My name is Bruce Bookman and I’m a subject matter expert in Conversational AI at Google. In this course, I will show you how to incorporate Google’s powerful Speech-to-Text Artificial Intelligence models into a Python program.
Google Speech-to-Text enables you to convert audio to text by applying neural network models in an easy-to-use API. So, in this course, you will start by understanding the main use cases for Speech-to-Text (STT) and an overview of the API.
You will then execute some demo code for the API to create a transcription for an audio file. Don’t worry, you’ll run through each line of code to make sure you’ve got it down.
In the following chapters, you will focus on recognition configuration, speech adaptation, and the different models used for speech recognition. Lastly, you will learn about word error rate and how to measure transcription accuracy.
By the end of this course, you will be able to inject STT in your own Python projects and you will have a great new skill for your resume.
There are no reviews yet.