Deep Learning based audio processing with pytorch

Hello all,
This is Sai Krishna. Currently I’m doing some research on Deep learning based audio processing. I want to start the basic neural network to process and train audio data. I want to know, how the load the data set containing audio files and textgrid files. If anyone worked on this audio Processing on Deep Learning please let me know. Please help me with this.

