Introduction to audio classification with PyTorch
In this Learn module, you learn how to do audio classification with PyTorch. You’ll understand more about audio data features and how to transform the sound signals into a visual representation called spectrograms. Then you’ll build the model by using computer vision on the spectrogram images. That’s right, you can turn audio into an image format, and then do computer vision to classify the word spoken!
Learn the basics features of audio data., Learn how to transform sound signals to a visual image format by using spectrograms., Build a speech classification model that can recognize sounds or spoken words by using convolutional neural networks (CNNs).
Prerequisites
Basic Python knowledge.
Basic knowledge about how to use Jupyter Notebooks.
Basic understanding of CNNs. The “Introduction to Computer Vision with PyTorch” module in this learning path is a good place to start.
There are no reviews yet.