ResNet Equivalent for Speech Recognition


I have a project that is based around getting transcripts out of audio. I have no idea were to start. But I assumed since pre trained networks like ResNet-50 exist, there must be something equivalent for audio.

I appreciate any hint or help on where to start.


1 Like

I don’t think this is a Resnet equivalent for STT, but it is a great project.

Thanks a lot. Do you know of any other open source project?

More info here:

thanks a lot much appreciate it

HI sina hope your having a jolly day!

Here are a few forum threads that may have some ideas that could help you.

Cheers mrfabulous1 :smiley::smiley:

This isn’t quite at a usable state, but should give you a good step in the right direction!

1 Like

Hi am facing this error and trying for 20 days but failed to resolve it.

My data set is contained on audio folder. in audio folder there is a subfolder train and valid folder.

I mean in audio folder there are two subfolders, train folder and valid folder. Both the train and valid folder contains on wav files.