I would like to implement a multi-classification-model, that classifies reviews (e.g. from trustpilot.com) into different categories (e.g. customer experience, product quality, online show, delivery, on-site store experience).
Is there any model on hugging face you suggest for German texts? I think it won’t be a good idea to use a model that was pretrained on English text and finetune this model on German texts?!
Otherwise, I would say that fine-tuning an English model on German texts could also be a good idea. Drawing a parallel between NLP and CV (computer vision), one can start with Image Net pretrained weights and fine-tune to get good results in quite different domains, like medical images classification. Because learning basic things like shapes, detecting edges, etc. stays relevant even for a new domain. So maybe it would work for languages with similar grammar as well.
Also, one can try to use a seq2seq model to translate German into English, and run a classifier on a translated sentence.
Here are German (and multi-lingual) models that have already been trained/fine-tuned on some text classification tasks, but I am not sure if they will be more helpful than the general LMs for your use case: Models - Hugging Face
If you want to consider models (regardless of language) that were already fine-tuned on a dataset more similar to yours, you could check models trained on Amazon reviews: Models - Hugging Face
I think it could be tricky though to fine-tune an English model on German texts (called cross-lingual transfer learning) without adjusting the vocabulary of the model.
This works quite well in my experience. You could use e.g. with this translation model from hugging face
zero shot classification means you want the model to classify into categories on which it wasn’t trained.
With “default” classifiers you have to have labeled training data for each class. E.g. trainingdata with texts which you classify as A, texts which you classify as B. Then you model is able to classify into the categories A and B but cannot classify C.
With zero shot classifying you can grab a model and use it to classify into a bunch of given labels.
There are plenty tutorials on zero shot classification. You could start here: NLP Town