image
Enabling Speech

Contents

Start course
Difficulty
Intermediate
Duration
22m
Students
1362
Ratings
4.9/5
Description

The course starts by providing a quick overview of some of Azure’s AI services and what you're going to build.

You'll then be shown:

  • How to create a searchable knowledge base using the question answering service
  • How to create a bot and connect it to the knowledge base
  • How to create a web chat app and connect it to the bot
  • How to enable spoken input and output in the chatbot

Learning Objectives

  • Describe Azure AI services
  • Create a knowledge base using the question answering service
  • Create a bot and connect it to the knowledge base
  • Create a web chat app and connect it to the bot
  • Enable spoken input and output

Intended Audience

  • Azure AI engineers

Prerequisites

  • Experience with Azure
  • Experience with writing code (not mandatory)

Resources

The GitHub repository for this course is at https://github.com/cloudacademy/azure-chatbot.

Transcript

The last step to accomplish our goal is to get our chatbot to support speech input and output. First, I’ll show you a simple way to add speech, although it’s not suitable for production use, and then I’ll explain how you’d use a more robust method.

The code I’m going to use for a speech-enabled version of the web chat is much longer than the other one, so I put it in the GitHub repository for this course. By the way, I didn’t write this code. It’s a sample from Microsoft. Download speech.html from the repository. Then go back into the App Service Editor, right-click in the files area, and select “Upload Files”. Select the speech.html file.

This code only works on some browsers, such as Chrome and Edge. In one of the supported browsers, paste the app’s URL in the address bar. Then go to the Web Chat channel configuration and copy the first secret key. Now go back to the address bar in the browser, add “?s=” and then paste the secret key. You wouldn’t want to use this method in production because you’d have to reveal this key to all of your users, which would be a security problem.

First, let’s make sure it’s connected to the bot. Type “run bot offline” again. Good, it’s working. Now let’s try speech input. Let’s ask it something else, like how to remove a bot. When you click the microphone icon, your browser might ask you if you want to allow it to use your microphone. If it does, then click “Allow”. Great, it read the answer too.

Now, to add speech to your bot in a production-ready way, here’s what you’d need to do. First, to make sure it will work on all browsers, create an instance of the Speech Service, which is part of Cognitive Services. Then create a Direct Line Speech channel in your bot and connect it to the Speech service you created. Next, make sure that your App Service has web sockets enabled so that the bot can communicate with the Direct Line Speech channel. You also need to select “Enable Streaming Endpoint” in your bot’s configuration.

Finally, you need to add code to your bot that uses the Speech SDK to do speech recognition and speech synthesis. The code will have to authenticate with the Speech service first. To avoid having to expose the secret key for the Speech service, this code would need to retrieve a token from the Speech service. The token would expire after 10 minutes, so a hacker couldn’t use it to access your speech service after that.

It takes quite a while to set all of this up, and it’s outside the scope of this course, so I won’t be demonstrating it. If you’re interested in learning more, you can read the tutorial at this URL.

And that’s it for building our chatbot. Please give this course a rating, and if you have any questions or comments, please let us know. Thanks!

 

About the Author
Students
216081
Courses
98
Learning Paths
164

Guy launched his first training website in 1995 and he's been helping people learn IT technologies ever since. He has been a sysadmin, instructor, sales engineer, IT manager, and entrepreneur. In his most recent venture, he founded and led a cloud-based training infrastructure company that provided virtual labs for some of the largest software vendors in the world. Guy’s passion is making complex technology easy to understand. His activities outside of work have included riding an elephant and skydiving (although not at the same time).