Hey everyone! First post here.
In the video for lesson 7, about 18 minutes in @jeremy makes a point about imbalanced datasets. He mentioned a recent paper that looked at some approaches to deal with this and concluded that oversampling from the smaller category wins out consistently.
Has anyone tracked down this paper? I’d love to have a look at it.