[Kaggle] TalkingData AdTracking Fraud Detection Challenge


Now as the competition ended. Just wondered if anyone used DNN for the structured data problem, as taught in Part 1, with Rossman example?
Did DNN score good for you?
Would be great if you could share notebook / kernel. Thanks!
I couldn’t make the proc_df to work, also struggled to load the whole 200 million rows of data, kernel was restarting due to memory i believe.

(Quan Tran) #2

I tried fastai DNN with a custom loss function to deal with this imbalanced dataset but due to time constraint I didn’t have much time to experiment with it more. My validation roc auc score is .973 (trained on 3 mil records of day 8, validated on 2 mil records day 9 with a bunch of new features). You can take a look at my repo here:


Thanks for sharing!