output of bidirectional LSTM #149

hwijeen · 2018-12-06T08:04:10Z

First of all, thanks for your great tutorial on pytorch! It's a great tip for beginners.
I have a question about the way you use the output of a bidirectional model.

pytorch-tutorial/tutorials/02-intermediate/bidirectional_recurrent_neural_network/main.py

Lines 54 to 58 in 4896cef

    
           out, _ = self.lstm(x, (h0, c0))  # out: tensor of shape (batch_size, seq_length, hidden_size*2) 
        
           # Decode the hidden state of the last time step 
        
           out = self.fc(out[:, -1, :]) 
        
           return out

From this code snippet, you took the LAST hidden state of forward and backward LSTM.
I think the image below illustrates what you did with the code. Please refer to this why your code corresponds to the image below. Please note that if we pick the output at the last time step, the reverse RNN will have only seen the last input (x_3 in the picture). It’ll hardly provide any predictive power.(source)

Is this the way you intended?
I think a more information-rich way of using the output of bidirectional LSTM is to concatenate the last hidden state of forward LSTM and first hidden state of reverse LSTM, so that both hidden states will have seen the entire input.

Thanks in advance!

FreyWang · 2018-12-11T08:25:38Z

you are right, surely the output is the concatenated result of the last hidden state of forward LSTM and first hidden state of reverse LSTM, or BP will be wrong

Related to yunjey#149: concatenate the last hidden state of forward LSTM and first hidden state of reverse LSTM

JiahaoYao added a commit to JiahaoYao/pytorch-tutorial that referenced this issue May 12, 2019

Bidirection RNN issue

8c0897e

Related to yunjey#149: concatenate the last hidden state of forward LSTM and first hidden state of reverse LSTM

JiahaoYao mentioned this issue May 12, 2019

Bidirection RNN issue #174

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

output of bidirectional LSTM #149

output of bidirectional LSTM #149

hwijeen commented Dec 6, 2018

FreyWang commented Dec 11, 2018

Uh oh!

output of bidirectional LSTM #149

output of bidirectional LSTM #149

Comments

hwijeen commented Dec 6, 2018

FreyWang commented Dec 11, 2018

Uh oh!