End to End speaker recognition system

My goal is to build a speaker recognition system(i.e., verifying if a person is who he claims he is, with his voice).

Where do I start?
Are there any resources out there I can use.
Please provide your insights on how to build such a system.

Have a look into the Fast AI Audio Thread

This notebook should help as well

1 Like

so you can start by creating a GUI like web app or a desktop app using anything you want but i recomment streamlit or qt for fast development and then use speechbrain’s ECAPA-TDNN trained with voxceleb pretrianed model which has both speaker identification and speaker verification so you would have an entire self hosted app which works wouth local apis
also you can finetune it to your local languages and add more accuracy to the model the model it self has 99+ accuracy