Using deep and wide networks for text

I was wondering what would be the implications of using wide and deep networks for text generation.