No matter your political leaning, the first 2020 Presidential Election Debate was a dumpster fire. It was hard for a human to understand what was being shouted over each other. How will AWS Transcribe do? Let’s set it up and take a look.

The Data

A basic Google search found the audio clip. I had to try a couple until I landed on one that didn’t have news commentary.

https://wgnradio.com/news/audio-first-2020-presidential-debate-between-president-donald-trump-and-former-vice-president-joe-biden/

The Setup

I already have an AWS account and subscribed to Amazon Transcribe a month ago. All I had to do is click Create Job. I chose the General model and asked for a multi-speaker breakdown with a total of three speakers.

Image for post

Transcribe job setup — screenshot by the author a few optional settings then hit Create — screenshot by the author.

The audio split job ran longer than the regular job. Makes sense. Considering the file was 90 minutes long, 20 minutes run time is reasonable.

Image for post

#nlp #aws #data-science #women-in-tech #election-2020

AWS Transcribes the Presidential Debate
1.15 GEEK