We have an entry for Germeval (binary task only) but I’m fairly confident that it is not that great. Unfortunately I saw the competition late and had a very heavy workload towards the end that clashed a bit with doing more. Additionally there were some technical difficulties towards the end (heatwave in Germany + computers that crunch for 3-4 days = bad combination). We deliberately kept it very vanilla ULMFiT so I just used a 50k token Wiki German LM, about 300k self collected unlabeled Tweets and just the provided training data. No ensembling. The LM and the Twitter model are pretty decent I think (<28 perplexity and <18 perplexity respectively). The classifier eventually converged (I underestimated this step) and we got an F1 of about 0.8 on the validation set which I’d been very happy with but a rather disappointing score for the test set. I’ll discuss the final results after the event (it’s this weekend). If anyone else from these forums attends, shoot me a PM and let’s meet/talk 
Even with the very hectic finish I’d do it again. Very many lessons learned. I’m confident that the results can be improved a good bit and have some ideas but little time 