The Kaggle’s Quora Pairs competition’s objective is to figure out if 2 questions have the same meaning. This should help users find similar questions and reduce duplicate content on Quora.
One solution could to be create a language model using the dataset. And then forming a Siamese Network (a twin network as shown in image below, taken from this Medium article) that takes in 2 questions and compares the output activations using cross entropy (or Manhattan distance).
How can this Siamese network architecture be implemented using the fast.ai library.
I have trained the language model on the Quora questions dataset.
How can I use fast.ai library modules to create this architecture.