Microsoft Cognitive Services for Developers: 2 Speech
Overview
Learn how to use Microsoft Cognitive Services to embed AI in your applications. This course covers the speech APIs for text translation, text to speech, and speaker recognition.
Microsoft Cognitive Services opens up artificial intelligence to every developer. Cognitive Services gives us access to powerful AI algorithms that allow our apps to hear, speak, identify, and interpret information using natural methods of communication. This course explores the speech APIs, which are responsible for text translation, text to speech, and speaker recognition. Instructor Sahil Malik shows you how to configure Cognitive Services and use the speech capabilities to translate text, convert text to speech and speech to text, and identify and authenticate multiple speakers. Along the way, he covers various APIs, including Translate Text, Bing Speech, and Speaker Recognition.
Microsoft Cognitive Services opens up artificial intelligence to every developer. Cognitive Services gives us access to powerful AI algorithms that allow our apps to hear, speak, identify, and interpret information using natural methods of communication. This course explores the speech APIs, which are responsible for text translation, text to speech, and speaker recognition. Instructor Sahil Malik shows you how to configure Cognitive Services and use the speech capabilities to translate text, convert text to speech and speech to text, and identify and authenticate multiple speakers. Along the way, he covers various APIs, including Translate Text, Bing Speech, and Speaker Recognition.
Syllabus
Introduction
- Welcome
- What you should know
- How to use the exercise files
- Introduction
- Set up a Node.js project
- Add TypeScript debugging support
- Add helper code and debugging
- Set up a project
- Set up Translate Text API in Azure and update a project
- Write code for getting supported languages
- Get supported languages running and refactor the code
- Get supported languages using an access token
- Get supported languages using access token
- Get language names
- Write code for translating between langauges
- Translations between multiple languages
- Break apart longer sentences
- Translate lots of content using TranslateArray
- Get languages for Speak
- Performing Text-to-Speech
- Set up a project and register Bing Speech in Azure
- Speech to Text
- Get an access token for Text-to-Speech
- Bing Speech Text-to-Speech
- The theory behind speaker recognition
- Register speaker recognition API in Azure and update the project
- Write business objects
- Craft up the structure of identification helper class
- Write code to create and delete identification profiles
- Write code for enrollment
- Write code for speaker identification
- Enrolling the first user
- Enrolling the second user
- Identifying speakers
- Delete identification profiles
- Next steps
Taught by
Sahil Malik
Related Courses
-
Microsoft Cognitive Services for Developers: 3 Language
-
Learning Microsoft Cognitive Services for Developers
-
Microsoft Cognitive Services for Developers: 1 Vision
-
Microsoft Cognitive Services for Developers: 4 Search
-
Microsoft Graph for Developers
-
Developing AI Speech Apps with Microsoft Cognitive Services