In this article will show you how to build your own virtual assistant using the Python Programming Language! A virtual assistant is an application that can understand voice commands and complete tasks for a user. Google’s assistant and Amazon’s Alexa are good examples of virtual assistants.
I will start by stating what I want this program to do. This program will run specific commands once a wake word is given by the users voice. This program will then execute those commands audibly back to the user.
For example, the user may say ‘hey computer, what time is it?’ or ‘okay computer, what is today’s date ?’ and the virtual assistant will respond accordingly.
Now, let’s write a short description of what this program is going to do in comments.
# Description: This is a virtual assistant program that will greet
# you with a random greeting, get the date, time, and
# information on a person.
We need to install a few packages pyadio
SpeechRecognition
gTTS
and wikipedia
.
pip install pyaudio
pip install SpeechRecognition
pip install gTTS
pip install wikipedia
Next import some of the libraries that will be used within this program. We will use the warnings library to ignore the warnings
we get with this program. The library speech_recognition
will be used to recognize speech :).
The os
library will allow us to interact with the Operating System. The gtts
library will help us convert text to speech. The wikipedia
library will allow us to get information about a person from Wikipedia. The datetime
library will allow us to get the current date and time. The calendar
library will allow us to get the day of the week, and the random
library will be used for randomization.
# Import the libraries
import speech_recognition as sr
import os
from gtts import gTTS
import datetime
import warnings
import calendar
import random
import wikipedia
We will ignore any warning messages that may be given during the execution of this program.
# Ignore any warning messages
warnings.filterwarnings('ignore')
We will start creating helpful functions that will make the code look clean and execute certain commands.
First we need a function that can take in audio (a voice command) and recognize the speech, then return that speech as a string (text). Let’s call this function recordAudio().
# Record audio and return it as a string
def recordAudio():
# Record the audio
r = sr.Recognizer()
with sr.Microphone() as source:
print('Say something!')
audio = r.listen(source)
# Speech recognition using Google's Speech Recognition
data = ''
try:
data = r.recognize_google(audio)
print('You said: ' + data)
except sr.UnknownValueError:
print('Google Speech Recognition could not understand')
except sr.RequestError as e:
print('Request error from Google Speech Recognition')
return data
Perfect ! We now have a function to record the audio. Let’s create a function for the program to respond back to the user audibly and call it assistantResponse(). The function will take in a string (text) and convert it to audio. I will also have this function print the text to the screen for testing purposes.
# Function to get the virtual assistant response
def assistantResponse(text):
print(text)
# Convert the text to speech
myobj = gTTS(text=text, lang='en', slow=False)
# Save the converted audio to a file
myobj.save('assistant_response.mp3')
# Play the converted file
os.system('start assistant_response.mp3')
Next, we will create a function to take in some text and check if the wake word was given in that text. For this program I created two wake words “okay computer” and “hey computer”, similar to the Google assistant wake word “hey Google” and “okay Google”. If the wake word was detected from the text then the function will return True otherwise it will return False.
# A function to check for wake word(s)
def wakeWord(text):
WAKE_WORDS = ['hey computer', 'okay computer']
text = text.lower() # Convert the text to all lower case words
# Check to see if the users command/text contains a wake word
for phrase in WAKE_WORDS:
if phrase in text:
return True
# If the wake word was not found return false
return False
Now, I will create a function to return today’s date as a string. For example it will return “Today is Monday October the 21st” if today was indeed Monday October 21st . Let’s call this function getDate().
def getDate():
now = datetime.datetime.now()
my_date = datetime.datetime.today()
weekday = calendar.day_name[my_date.weekday()]# e.g. Monday
monthNum = now.month
dayNum = now.day
month_names = ['January', 'February', 'March', 'April', 'May',
'June', 'July', 'August', 'September', 'October', 'November',
'December']
ordinalNumbers = ['1st', '2nd', '3rd', '4th', '5th', '6th',
'7th', '8th', '9th', '10th', '11th', '12th',
'13th', '14th', '15th', '16th', '17th',
'18th', '19th', '20th', '21st', '22nd',
'23rd', '24th', '25th', '26th', '27th',
'28th', '29th', '30th', '31st']
return 'Today is ' + weekday + ' ' + month_names[monthNum - 1] + ' the ' + ordinalNumbers[dayNum - 1] + '.'
Next, we will create a function that takes in text and returns a random greeting response as text to the user, if the user said a greeting input like ‘hello’ or ‘hi’ for example the function will return some random greeting like ‘howdy’. This function will be called greeting().
# Function to return a random greeting response
def greeting(text):
# Greeting Inputs
GREETING_INPUTS = ['hi', 'hey', 'hola', 'greetings', 'wassup', 'hello']
# Greeting Response back to the user
GREETING_RESPONSES = ['howdy', 'whats good', 'hello', 'hey there']
# If the users input is a greeting, then return random response
for word in text.split():
if word.lower() in GREETING_INPUTS:
return random.choice(GREETING_RESPONSES) + '.'
# If no greeting was detected then return an empty string
return ''
Last but not least we will create a function to get a person’s first and last name from text after detecting the key command, ‘who is’. Once we detect the word ‘who’ followed by the word ‘is’ then we will return the next two words as a single string (the next two words should be the person’s first name followed by that person’s last name).
For example if the user says ‘Who is LeBron James ?’ , then the function will take in that text and simply return ‘LeBron James’ . We will use this function later on to get a two sentence summary about the person from Wikipedia. This function will be called getPerson().
# Function to get a person first and last name
def getPerson(text):
wordList = text.split()# Split the text into a list of words
for i in range(0, len(wordList)):
if i + 3 <= len(wordList) - 1 and wordList[i].lower() == 'who' and wordList[i + 1].lower() == 'is':
return wordList[i + 2] + ' ' + wordList[i + 3]
All of the most popular virtual assistants (Google Assistant, Amazon Alexa, & Apples Siri) continuously listen to your conversation and waits to execute commands only after hearing the wake word like ‘Okay Google’, ‘Alexa’ , or ‘Hey Siri’.
This means we will need to have the program consistently listening for the wake word, so we will need a continuous loop that runs forever recording the audio.
Once the wake word is said, then we will check to see if the user said a greeting, ‘date’, ‘time’, or ‘who is’ , and have the computer respond accordingly by audio.
while True:
# Record the audio
text = recordAudio()
response = '' #Empty response string
# Checking for the wake word/phrase
if (wakeWord(text) == True):
# Check for greetings by the user
response = response + greeting(text)
# Check to see if the user said date
if ('date' in text):
get_date = getDate()
response = response + ' ' + get_date
# Check to see if the user said time
if('time' in text):
now = datetime.datetime.now()
meridiem = ''
if now.hour >= 12:
meridiem = 'p.m' #Post Meridiem (PM)
hour = now.hour - 12
else:
meridiem = 'a.m'#Ante Meridiem (AM)
hour = now.hour
# Convert minute into a proper string
if now.minute < 10:
minute = '0'+str(now.minute)
else:
minute = str(now.minute)
response = response + ' '+ 'It is '+ str(hour)+ ':'+minute+' '+meridiem+' .'
# Check to see if the user said 'who is'
if ('who is' in text):
person = getPerson(text)
wiki = wikipedia.summary(person, sentences=2)
response = response + ' ' + wiki
# Assistant Audio Response
assistantResponse(response)
we are done creating this program !
Thanks for reading !
Originally published by randerson112358 at medium
#python #programming