I think the loss function is cross_entropy. @nikhil_no_1
1 Like
I just want to experiment Jeremy’s method, so I didn’t obey the rules. I trained the language model with pretrained on and got pretty good result.
Total time: 39:17
epoch train_loss valid_loss accuracy
1 3.580200 3.459874 0.397804
2 3.384912 3.311979 0.412261
learn.predict('How to learn Chinese', n_words=30, temperature=0.75)
output: ‘How to learn Chinese from China ? xxbos How do you feel about your child having a friend who talks to you ? What can you tell him about him ?’
1 Like