In this post, I will show you how to extract speeches from a video recording file. After recognizing the speeches we will convert them into a text document. This will be a simple machine learning project, that will help you to understand some basics of the Google Speech Recognition library. Speech Recognition is a popular topic under machine learning concepts. Speech Recognition is getting used more in many fields. For example, the subtitles that we see on Netflix shows or YouTube videos are created mostly by machines using Artificial Intelligence. Other great examples of speech recognizers are personal voice assistants such as Google’s Home Mini, Amazon Alexa, Apple’s Siri.

Table of Contents:

  • Getting Started
  • Step 1: Import Libraries
  • Step 2: Video to Audio Conversion
  • Step 3: Speech Recognition
  • Final Step: Exporting Result

Getting Started

As you can understand from the title, we will need a video recording for this project. It can even be a recording of yourself speaking to the camera. Using a library called _MoviePy, _we will extract the audio from the video recording. And in the next step, we will convert that audio file into text using Google’s speech recognition library. If you are ready, let’s get started by installing the libraries!

#data-science #speech-recognition #machine-learning #artificial-intelligence #python

Extracting Speech from Video using Python
23.85 GEEK