End to End speaker recognition system

My goal is to build a speaker recognition system(i.e., verifying if a person is who he claims he is, with his voice).

Where do I start?
Are there any resources out there I can use.
Please provide your insights on how to build such a system.

Have a look into the Fast AI Audio Thread

This notebook should help as well

