The lesson4, Jeremy demonstrates the embedding matrix technique on a large dataset.
I am wondering if there exists lower bound for the data size below which DL would not be effective(compared to traditional ML technique)?
If yes, what do empirical experiments tell us about a rule of thumb threshold?