How does decreasing Random Forest samples size increase correlation?

javiergaitan · April 26, 2020, 7:40pm

@mlnoob: I think this is correctly stated in the wiki of the lesson:

“If you use a smaller samples, say set_rf_samples(), you’ll overfit less […] Therefore, although you actually have less accuracy per tree/estimator, but the correlation between the trees will be also be less, and your RF model can generalize (make a prediction on new data) better”