Speech Analytics, Sound Analytics in TorchAudio

The landscape of DataScience is changing everyday. In the last few years we have seen numerous number of research and advancement in NLP and Computer Vision field. But there is a field which is still unexplored and has a lot of potential , the field is — SPEECH.

In the last tutorial we have learned about:

What is a Sound wave?

2. The basic properties of sound wave

3. Feature Extraction from sound wave

4. Pre-Processing a sound wave

In this tutorial we would be looking into the practical application of it in Python. The two most popular libraries to help you in your journey are:

Librosa
TorchAudio

TorchAudio — It is a PyTorch domain library consisting of I/O, popular datasets and common audio transformations that can bring new speed and efficiency to your PyTorch projects. It is one of the powerful speech modulation software created by Facebook.

#speech-analytics #speech-recognition #machine-learning-ai #pytorch #speech #ai

medium.com

Speech Analytics, Sound Analytics in TorchAudio