Speech to text using AWS Transcribe, S3, Lambda, and output notifications using SNS and CloudWatch Events.

Speech to text is the process of converting audio to text. For computer software and programs, audio files are near impossible to be used for visualization/analysis or to harvest data out of it in a meaningful way. Therefore there is the need of converting these audio files to text before they can be used for analysis and harvest data. Although speech to text seems very simple in the current technology world it involves many linguistic models and algorithms to provide a near 100% accuracy.

Currently, there are many tools created by software providers who have created their own models and algorithms to provide this speech-to-text as a service. In this article, I am going to go through one such kind of service provided by AWS for speech to text named AWS Transcribe.

AWS Transcribe

AWS Transcribe is the speech-to-text solution provided by Amazon Web Services which has renowned to be very quick and have high accuracy.AWS Transcribe under the hood uses a deep learning process names ASR (automatic speech recognition) to convert the audio to text quickly and more accurately. It also has a separate service inside named Amazon Transcribe Medical to be used for medical documentation applications as well.

AWS Transcribe has a well-created API where programs can automate transcribe jobs of converting audio files to text files. But once you start a transcribing job since it may take time depending on the file, AWS Transcribe will not send the output in the same request as a response. Since that we need to either poll continuously to check whether the transcribe job is completed or we are going to need to have some kind of event triggers to identify the status of the job. In this article, we are going to explore two kinds of such event triggers that will make it be possible to automate transcribe from start to end. Below is a brief overview of what we are going to accomplish in this article.

#lambda #s3 #aws #speech-recognition #aws transcribe

Speech to Text using AWS Transcribe, S3 and Lambda
1.85 GEEK