How to Build a Voice-controlled Virtual Assistant (IVR) in Python Using Flask and Plivo

How to Build a Voice-controlled Virtual Assistant (IVR) in Python Using Flask and Plivo

A virtual assistant can help your business if you have clients who call your phone number. Interactive voice response (IVR) helps you to automate call reception by routing callers to the most appropriate department or the agent most qualified to meet their needs. Among its many advantages, IVR can provide increased operational efficiency, a stronger brand image, and better customer insights.

A voice-controlled virtual assistant is one step ahead of the legacy Touch-Tone/DTMF controlled one because of the flexibility it allows end-users. They can just speak into their phone’s microphone to provide input to control the call.

Building a voice-controlled virtual assistant using Plivo’s automatic speech recognition (ASR) feature in Python using Flask is simple. This guide shows you how to set up a voice-controlled IVR phone tree to a Plivo number and manage the call flow when the call reaches the Plivo voice platform. To see how to do this, we’ll build a Flask application to receive an incoming call and use the GetInput XML element to capture speech input and implement a simple IVR phone system.

Prerequisites

Before you get started, you’ll need:

  • A Plivo account — sign up for one for free if you don’t have one already.
  • A voice-enabled Plivo phone number if you want to receive incoming calls. To search for and buy a number, go to Phone Numbers > Buy Numbers on the Plivo console.
Buy a New Plivo Number
  • Flask and Plivo Python packages — run pip install plivo flask to install them.
  • ngrok — a utility that exposes your local development server to the internet over secure tunnels.

How it works

Receive Speech Inputs

Create a Flask application to create a voice-controlled virtual assistant

Once you’ve installed Flask and the Plivo Python SDK, create a simple Flask application to handle incoming calls on a Plivo number. To handle an incoming call, you need to return an XML document from the URL configured as the Answer URL in the application assigned to the Plivo number. The Python SDK can manage the XML document generation, and you can use the GetInput XML element to capture speech inputs and implement a simple IVR phone system. Use this code:

from flask import Flask, Response, url_for
from plivo import plivoxml

# Welcome message - firstbranch
welcome_message = "Welcome to the demo app, Say Sales to talk to our 
Sales representative. Say Support to talk to our Support representative"

# This is the message that Plivo reads when the caller does nothing at all
noinput_message = "Sorry, I didn't catch that. Please hangup and try again later."

# This is the message that Plivo reads when the caller inputs a wrong digit.
wronginput_message = "Sorry, it's a wrong input."

app = Flask(__name__)

@app.route('/virtual_assistant/', methods = ['GET', 'POST'])
def virtual_assistant():
    element = plivoxml.ResponseElement()
    response = element.add(plivoxml.GetInputElement()
        .set_action(url_for('firstbranch', _external=True))
        .set_method('POST').set_input_type('speech')
        .set_interim_speech_results_callback(url_for('firstbranch', _external=True))
        .set_interim_speech_results_callback_method('POST')
        .set_redirect(True)
        .add_speak(content = welcome_message))
    response.add(plivoxml.SpeakElement(noinput_message))

    return Response(response.to_string(), mimetype = 'text/xml')

@app.route('/virtual_assistant/firstbranch/', methods = ['GET', 'POST'])
def firstbranch():
    response = plivoxml.ResponseElement()
    speech = request.values.get('Speech')
    from_number = request.values.get('From')
    print("Speech input is:"+str(speech))

    if speech == "sales":
        response = plivoxml.ResponseElement()
        response.add(
            plivoxml.DialElement()
            .add(plivoxml.NumberElement('+14156667777')))

    elif speech == "support":
        response = plivoxml.ResponseElement()
        response.add(
            plivoxml.DialElement()
            .add(plivoxml.NumberElement('+14156667778')))
    else:
        response.add_speak(wronginput_message)
    
    return Response(response.to_string(), mimetype = 'text/xml')

if __name__ == '__main__':
    app.run(host = '0.0.0.0', debug = True)

Test the code locally

Save the code in any file — we named the file virtual_assistant.py. To run the code on the server, go to the folder where the file resides and use the command

$ python virtual_assistant.py

You should see your basic server app in action on http://localhost:5000/virtual_assistant/

Expose the local server to the internet using ngrok

Once you see the application working locally, the next step is to connect the application to the internet to return the XML document to process the incoming call. For that, we recommend using ngrok, which exposes local servers behind NATs and firewalls to the public internet over secure tunnels.

Install it and run ngrok on the command line, specifying the port that hosts the application on which you want to receive calls (5000 in this case, as our local Flask application is running there):

$ ./ngrok http 5000

Ngrok will display a forwarding link that you can use as a webhook to access your local server over the public network.

Ngrok CLI

Test the link by opening the ngrok URL(http://fd3a77b913ed.ngrok.io/virtual_assistant/) in a browser or HTTPie to check the XML response from the ngrok URL.

XML document with GetDigits XML element

Connect the Flask application to a Plivo number

The final step is to configure the application as a Plivo voice application and assign it to a Plivo number on which you want to activate the voice-controlled virtual assistant.

Go to the Plivo console and navigate to Voice > Applications > XML, then click on the Add New Application button in the upper right.

Provide a friendly name for the application — we used “App-Virtual-Assistant” — and configure the ngrok URL http://fd3a77b913ed.ngrok.io/virtual_assistant/ as the Answer URL. Select the HTTP verb as POST, then click Create Application.

Create Plivo App for voice-controlled IVR MVC app

Now go to Phone Numbers > Your Numbers and click on the number to which you want to assign the application. From the Plivo Application drop-down, choose the voice application you just created. Finally, click Update Number.

Assign Virtual-Assistant Plivo App

Test the application

Make a phone call to the Plivo number you selected. You should see that the VirtualAssistant Flask application automatically routes the call to the Sales and Support departments based on the speech inputs received on the call.

And that’s how simple it is to set up a voice-controlled virtual assistant on a Plivo number and handle it using XML documents using Plivo’s Python SDK and an Flask application. You can implement other use cases on the Plivo Voice platform, such as phone system IVR, call forwarding, and number masking, as your business requires.

Haven’t tried Plivo yet? Getting started is easy and only takes five minutes. Sign up today.

The State of Marketing in 2024

HubSpot's Annual Inbound Marketing Trends Report

Frequently asked questions

No items found.
footer bg

Subscribe to Our Newsletter

Get monthly product and feature updates, the latest industry news, and more!

Thank you icon
Thank you!
Thank you for subscribing
Oops! Something went wrong while submitting the form.