Any lessons talk about how to remove vocal from a song by deep learning?

Pre-releases are already online for Part 2: Pre-release part 2 videos

My comment was a little confusion. It’s 76 3D samples, so in reality i have 24 times that in 2D images :slight_smile: yes, segmentations are what you refer to as “less data hungry”. the reason is that you have much more information to back propagate through the network. a segmentation can be seen as a classification for every pixel and as such you get more info from a single sample.

1 Like