Question about LanguageModel Y label

Hi, I noticed that the underlying data that we got after using the LanguageModelData results in a Y label that is flatten to a 1-dimension tensor. Why is that so since the X is of shape numOfWordsxbatch_size?

Should the Y be of the same size to match up?

Thanks!

From here: