How To Build A Virtual Assistant Using Python

In this article will show you how to build your own virtual assistant using the Python Programming Language! A virtual assistant is an application that can understand voice commands and complete tasks for a user. Google’s assistant and Amazon’s Alexa are good examples of virtual assistants.

I will start by stating what I want this program to do. This program will run specific commands once a wake word is given by the users voice. This program will then execute those commands audibly back to the user.

The commands to be executed are as follows:

Say a random greeting back to the user, if the user used a greeting word.
Get the date for the user
Get the time for the user
Get information about a person to the user
(NOTE: We can use Wikipedia for this)

For example, the user may say ‘hey computer, what time is it?’ or ‘okay computer, what is today’s date ?’ and the virtual assistant will respond accordingly.
Now, let’s write a short description of what this program is going to do in comments.

# Description: This is a virtual assistant program that will greet 
#              you with a random greeting, get the date, time, and
#              information on a person.

We need to install a few packages pyadio SpeechRecognition gTTSand wikipedia .

pip install pyaudio
pip install SpeechRecognition 
pip install gTTS
pip install wikipedia

Import The Libraries & Packages

Next import some of the libraries that will be used within this program. We will use the warnings library to ignore the warnings we get with this program. The library speech_recognition will be used to recognize speech :).

The os library will allow us to interact with the Operating System. The gtts library will help us convert text to speech. The wikipedia library will allow us to get information about a person from Wikipedia. The datetime library will allow us to get the current date and time. The calendar library will allow us to get the day of the week, and the random library will be used for randomization.

# Import the libraries
import speech_recognition as sr
import os
from gtts import gTTS
import datetime
import warnings
import calendar
import random
import wikipedia

We will ignore any warning messages that may be given during the execution of this program.


# Ignore any warning messages
warnings.filterwarnings('ignore')

Create Helpful Functions

We will start creating helpful functions that will make the code look clean and execute certain commands.
First we need a function that can take in audio (a voice command) and recognize the speech, then return that speech as a string (text). Let’s call this function recordAudio().

# Record audio and return it as a string
def recordAudio():    
    # Record the audio
    r = sr.Recognizer()
    with sr.Microphone() as source:  
       print('Say something!')
       audio = r.listen(source)
    
    # Speech recognition using Google's Speech Recognition
    data = ''
    try:
        data = r.recognize_google(audio)
        print('You said: ' + data)
    except sr.UnknownValueError:
        print('Google Speech Recognition could not understand')
    except sr.RequestError as e:
        print('Request error from Google Speech Recognition')
     return data

Perfect ! We now have a function to record the audio. Let’s create a function for the program to respond back to the user audibly and call it assistantResponse(). The function will take in a string (text) and convert it to audio. I will also have this function print the text to the screen for testing purposes.


# Function to get the virtual assistant response
def assistantResponse(text):
    print(text)
    # Convert the text to speech
    myobj = gTTS(text=text, lang='en', slow=False)
    
    # Save the converted audio to a file    
    myobj.save('assistant_response.mp3')
    # Play the converted file
    os.system('start assistant_response.mp3')

Next, we will create a function to take in some text and check if the wake word was given in that text. For this program I created two wake words “okay computer” and “hey computer”, similar to the Google assistant wake word “hey Google” and “okay Google”. If the wake word was detected from the text then the function will return True otherwise it will return False.


# A function to check for wake word(s)
def wakeWord(text):
    WAKE_WORDS = ['hey computer', 'okay computer'] 
    text = text.lower()  # Convert the text to all lower case words
  # Check to see if the users command/text contains a wake word    
    for phrase in WAKE_WORDS:
        if phrase in text:
            return True
  # If the wake word was not found return false
    return False

Now, I will create a function to return today’s date as a string. For example it will return “Today is Monday October the 21st” if today was indeed Monday October 21st . Let’s call this function getDate().


def getDate():
    
    now = datetime.datetime.now()
    my_date = datetime.datetime.today()
    weekday = calendar.day_name[my_date.weekday()]# e.g. Monday
    monthNum = now.month
    dayNum = now.day
    month_names = ['January', 'February', 'March', 'April', 'May',
       'June', 'July', 'August', 'September', 'October', 'November',   
       'December']
    ordinalNumbers = ['1st', '2nd', '3rd', '4th', '5th', '6th', 
                      '7th', '8th', '9th', '10th', '11th', '12th',                      
                      '13th', '14th', '15th', '16th', '17th', 
                      '18th', '19th', '20th', '21st', '22nd', 
                      '23rd', '24th', '25th', '26th', '27th', 
                      '28th', '29th', '30th', '31st']
   
return 'Today is ' + weekday + ' ' + month_names[monthNum - 1] + ' the ' + ordinalNumbers[dayNum - 1] + '.'

Next, we will create a function that takes in text and returns a random greeting response as text to the user, if the user said a greeting input like ‘hello’ or ‘hi’ for example the function will return some random greeting like ‘howdy’. This function will be called greeting().


# Function to return a random greeting response
def greeting(text):
    # Greeting Inputs
    GREETING_INPUTS = ['hi', 'hey', 'hola', 'greetings', 'wassup', 'hello']
     
		 # Greeting Response back to the user
    GREETING_RESPONSES = ['howdy', 'whats good', 'hello', 'hey there']
    
		# If the users input is a greeting, then return random response
    for word in text.split():
        if word.lower() in GREETING_INPUTS:
            return random.choice(GREETING_RESPONSES) + '.'
    
		# If no greeting was detected then return an empty string
    return ''

Last but not least we will create a function to get a person’s first and last name from text after detecting the key command, ‘who is’. Once we detect the word ‘who’ followed by the word ‘is’ then we will return the next two words as a single string (the next two words should be the person’s first name followed by that person’s last name).
For example if the user says ‘Who is LeBron James ?’ , then the function will take in that text and simply return ‘LeBron James’ . We will use this function later on to get a two sentence summary about the person from Wikipedia. This function will be called getPerson().


# Function to get a person first and last name
def getPerson(text):
 wordList = text.split()# Split the text into a list of words     
 
 for i in range(0, len(wordList)):
   if i + 3 <= len(wordList) - 1 and wordList[i].lower() == 'who' and wordList[i + 1].lower() == 'is':
            return wordList[i + 2] + ' ' + wordList[i + 3]

Create the Main Program

All of the most popular virtual assistants (Google Assistant, Amazon Alexa, & Apples Siri) continuously listen to your conversation and waits to execute commands only after hearing the wake word like ‘Okay Google’, ‘Alexa’ , or ‘Hey Siri’.
This means we will need to have the program consistently listening for the wake word, so we will need a continuous loop that runs forever recording the audio.
Once the wake word is said, then we will check to see if the user said a greeting, ‘date’, ‘time’, or ‘who is’ , and have the computer respond accordingly by audio.


while True:
    # Record the audio
    text = recordAudio()
    response = '' #Empty response string
     
    # Checking for the wake word/phrase
    if (wakeWord(text) == True):
         # Check for greetings by the user
         response = response + greeting(text)
         # Check to see if the user said date
        if ('date' in text):
            get_date = getDate()
            response = response + ' ' + get_date
         # Check to see if the user said time
        if('time' in text):
            now = datetime.datetime.now()
            meridiem = ''
            if now.hour >= 12:
                meridiem = 'p.m' #Post Meridiem (PM)
                hour = now.hour - 12
            else:
                meridiem = 'a.m'#Ante Meridiem (AM)
                hour = now.hour
           # Convert minute into a proper string
              if now.minute < 10:
                minute = '0'+str(now.minute)
              else:
                minute = str(now.minute)
             response = response + ' '+ 'It is '+ str(hour)+ ':'+minute+' '+meridiem+' .'
                
        # Check to see if the user said 'who is'
        if ('who is' in text):
            person = getPerson(text)
            wiki = wikipedia.summary(person, sentences=2)            
            response = response + ' ' + wiki
       
       # Assistant Audio Response
       assistantResponse(response)

we are done creating this program !

Thanks for reading !

Originally published by randerson112358 at medium

#python #programming

The commands to be executed are as follows:

Import The Libraries & Packages

Create Helpful Functions

Create the Main Program

youtube.com

How To Build A Virtual Assistant Using Python