Speaker Recognition | By Award Winning Textbook Author

Audio processing, feature extraction, speaker recognition, deep learning, with coding examples

This course is an introduction to speaker recognition techniques.

What you’ll learn

  • Basic concepts and core algorithms in speaker recognition.
  • Audio processing and acoustics.
  • Machine learning and deep learning basics.
  • Coding practice and toolkits for audio and speech.

Course Content

  • Introduction to this course –> 4 lectures • 9min.
  • The History of Voice Identity Techniques –> 4 lectures • 29min.
  • Fundamental of Audio Processing –> 6 lectures • 48min.
  • Acoustic Feature Extraction –> 4 lectures • 36min.
  • Fundamentals of Speaker Recognition –> 7 lectures • 54min.
  • Early Speaker Recognition Approaches –> 9 lectures • 57min.
  • Deep Learning Basics –> 7 lectures • 52min.
  • Speaker Recognition with Deep Learning –> 7 lectures • 57min.
  • Data Processing in Speaker Recognition –> 7 lectures • 55min.

Speaker Recognition | By Award Winning Textbook Author


This course is an introduction to speaker recognition techniques.


Speaker recognition lies in the intersection of audio processing, biometrics, and machine learning, and has various applications. You can find the application of speaker recognition on your smart phones, smart home devices, and various commercial services.


In this course, we will start with an introduction to the history of speaker recognition techniques, to see how it evolved from simple human efforts to modern deep learning based intelligent systems.


We will cover the basics of acoustics, perception, audio processing, signal processing, and feature extraction, so you don’t need a background in these domains. We will also have an introduction of popular machine learning approaches, such as Gaussian mixture models, support vector machines, factor analysis, and neural networks.


We will focus on how to build speaker recognition systems based on acoustic features and machine learning models, with an emphasis on modern speaker recognition with deep learning, such as the different options for inference logic, loss function, and neural network topologies.


We will also talk about data processing techniques such as data cleansing, data augmentation, and data fusion.


If you are a college student interested in AI or signal processing, or an engineer or product manager working with related technologies, then this course is definitely for you!

Get Tutorial