Training language model to predict characters

For future readers, I had the same question and answered it in this thread: