Fully connected layers

I have read somewhere that the rule of thumb in a classification model is to use maximum 3 fully connected layers in total. Can someone tell me if this is correct and why?