Spoken Language Processing in Python
Learn how to load, transform, and transcribe speech from raw audio files in Python.
Learn Speech Recognition and Spoken Language Processing in Python
We learn to speak far before we learn to read. Even in the digital age, our main method of communication is speech. Spoken Language Processing in Python will help you load, transform, and transcribe audio files. You’ll start by seeing what raw audio looks like in Python, and move on to exploring popular libraries and working through an example business use case.
Use Python SpeechRecognition and PyDub to Transcribe Audio Files
Python has a number of popular libraries that help you to process spoken language. SpeechRecognition offers you an easy way to integrate with speech-to-text APIs, while PyDub helps you to programmatically alter audio file attributes to get them ready for transcription. Each of these libraries is covered in an in-depth chapter, offering you the opportunity to put theory into practice to cement your knowledge.
Practice Speech Transcription with an In-Course Project
The final chapter in this course offers you the opportunity to put everything you’ve learned together by building a speech processing proof of concept for a fictional technology company. You’ll build a system that transcribes phone call audio to text and then performs sentiment analysis to review customer support phone calls.
By the end of this course, you’ll have both the knowledge and hands-on experience to put your learning into practice within your job or personal projects.
There are no reviews yet.