Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

LinkedIn Learning

Microsoft Cognitive Services for Developers: 2 Speech

via LinkedIn Learning

Overview

Learn how to use Microsoft Cognitive Services to embed AI in your applications. This course covers the speech APIs for text translation, text to speech, and speaker recognition.

Microsoft Cognitive Services opens up artificial intelligence to every developer. Cognitive Services gives us access to powerful AI algorithms that allow our apps to hear, speak, identify, and interpret information using natural methods of communication. This course explores the speech APIs, which are responsible for text translation, text to speech, and speaker recognition. Instructor Sahil Malik shows you how to configure Cognitive Services and use the speech capabilities to translate text, convert text to speech and speech to text, and identify and authenticate multiple speakers. Along the way, he covers various APIs, including Translate Text, Bing Speech, and Speaker Recognition.

Syllabus

Introduction
  • Welcome
  • What you should know
  • How to use the exercise files
1. The Basics
  • Introduction
  • Set up a Node.js project
  • Add TypeScript debugging support
  • Add helper code and debugging
2. Translation Text API
  • Set up a project
  • Set up Translate Text API in Azure and update a project
  • Write code for getting supported languages
  • Get supported languages running and refactor the code
  • Get supported languages using an access token
  • Get supported languages using access token
  • Get language names
  • Write code for translating between langauges
  • Translations between multiple languages
  • Break apart longer sentences
  • Translate lots of content using TranslateArray
  • Get languages for Speak
  • Performing Text-to-Speech
3. Bing Speech API
  • Set up a project and register Bing Speech in Azure
  • Speech to Text
  • Get an access token for Text-to-Speech
  • Bing Speech Text-to-Speech
4. Speaker Recognition API
  • The theory behind speaker recognition
  • Register speaker recognition API in Azure and update the project
  • Write business objects
  • Craft up the structure of identification helper class
  • Write code to create and delete identification profiles
  • Write code for enrollment
  • Write code for speaker identification
  • Enrolling the first user
  • Enrolling the second user
  • Identifying speakers
  • Delete identification profiles
Conclusion
  • Next steps

Taught by

Sahil Malik

Related Courses

Reviews

Start your review of Microsoft Cognitive Services for Developers: 2 Speech

Never Stop Learning!

Get personalized course recommendations, track subjects and courses with reminders, and more.

Sign up for free