Voice biometrics #
Identification and voice recognition are issues that have a number of practical applications in automation, authentication and security. It is a popular method of remote authorization thanks to its non-invasive and accessibility (e.g. telephone, personal computer). This method combines physical and behavioral elements. It uses a person’s acoustic features, which are shaped by biological features (e.g. the shape of the larynx) and by additional behavioral features such as stress, rhythm, intonation and vocabulary selection.
Pre-read (Required) #
- https://azure.microsoft.com/en-us/services/cognitive-services/speaker-recognition/#features
- https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/speaker-recognition-overview#speaker-verification
- https://docs.scipy.org/doc/scipy/reference/generated/scipy.signal.spectrogram.html
Class content #
- Implementation of audio sampling and visualization (e.g. MFFC, spectrogram).
- Introduction to 1D convolution and Recurrent Neural Networks.
- Implementation of Microsoft Azure - Speaker Recognition.
- Solution security analysis, attack vector through voice imitation, voice generation (deepfake).