ResNet Equivalent for Speech Recognition

sina · December 12, 2019, 2:12pm

Hi,

I have a project that is based around getting transcripts out of audio. I have no idea were to start. But I assumed since pre trained networks like ResNet-50 exist, there must be something equivalent for audio.

I appreciate any hint or help on where to start.

Thanks,
Sina

Johnpal · December 13, 2019, 2:34am

I don’t think this is a Resnet equivalent for STT, but it is a great project.

sina · December 13, 2019, 7:51pm

Thanks a lot. Do you know of any other open source project?

Johnpal · December 13, 2019, 9:16pm

http://opennmt.net/features/

More info here:

https://modelzoo.co/category/audio-speech

sina · December 13, 2019, 9:43pm

thanks a lot much appreciate it

mrfabulous1 · December 14, 2019, 3:17pm

HI sina hope your having a jolly day!

Here are a few forum threads that may have some ideas that could help you.

Cheers mrfabulous1

KevinB · December 17, 2019, 5:59pm

This isn’t quite at a usable state, but should give you a good step in the right direction!

Ubaid · May 27, 2021, 9:56pm

Hi am facing this error and trying for 20 days but failed to resolve it.

My data set is contained on audio folder. in audio folder there is a subfolder train and valid folder.

Ubaid · May 27, 2021, 9:57pm

I mean in audio folder there are two subfolders, train folder and valid folder. Both the train and valid folder contains on wav files.