I am a novice in NLP. Recently I need to get feature from a sentence. I notice there are different ways to get these features from LSTM or RNN.
- Get the last output from the network.
- Get the last hidden from the network.
- Get the output that before padding input.
Since the sentence is variable length, these features are different from each other. Method 1 and Method may contain some padding input. I wonder which one is the best to extract sentence’s features.