Any lessons talk about how to remove vocal from a song by deep learning?

pietz · July 11, 2017, 3:36pm

Pre-releases are already online for Part 2: Pre-release part 2 videos

My comment was a little confusion. It’s 76 3D samples, so in reality i have 24 times that in 2D images yes, segmentations are what you refer to as “less data hungry”. the reason is that you have much more information to back propagate through the network. a segmentation can be seen as a classification for every pixel and as such you get more info from a single sample.