@mlnoob: I think this is correctly stated in the wiki of the lesson:
“If you use a smaller samples, say set_rf_samples(), you’ll overfit less […] Therefore, although you actually have less accuracy per tree/estimator, but the correlation between the trees will be also be less, and your RF model can generalize (make a prediction on new data) better”