I am using Random Forest for one of my datasets (Classification problem), I see that
Training Accuracy is 97.9
Validation Accuracy is 95.5
Test Accuracy is 80.4
I have couple of question in random forest interpretation.
I would like to check the confidence of my trees, so I checked
%time preds = np.stack([t.predict(X_valid) for t in m.estimators_])
total number of estimators is 250 here. I see preds.shape as (250, 18681)
what is 18681 here?
To check my confidence I calculated mean and std of the preds,
I see for mean and std - (0.0, 0.0)
What it does mean? Should I take any other approach rather than this for classification problem?