Hello, I am using BERT on the Sentiment Analysis on Movie Reviews dataset from a past (4y ago) Kaggle competition (https://www.kaggle.com/c/sentiment-analysis-on-movie-reviews). The dataset has around 150K training examples and a public test set (for public leaderboard) of 67K examples.
After running 2 epochs (took me 3h) I got 0.688 score on the public leader board which is in the top 5 on the public leaderboard (private leaderboard is not available anymore). It seems to work but I will try to tune the learning rate to see if I can get better result.
I would like to share this in case someone want to do the same experiment so that we can compare the results.