Deep Learning structured data Requirement on training data quantity

The lesson4, Jeremy demonstrates the embedding matrix technique on a large dataset.

I am wondering if there exists lower bound for the data size below which DL would not be effective(compared to traditional ML technique)?

If yes, what do empirical experiments tell us about a rule of thumb threshold?