ConvLSTM did not concat hidden from last round

Question

ConvLSTM did not concat hidden from last round

tomfox1900 opened this issue 6 years ago · 7 comments

tomfox1900 commented 6 years ago

In the structure presented in the paper, the hidden from last round is concat with input and then proceed for other operation. But it seems your LSTM did not use the hidden information from previous round.

Answer 1 · 2019-06-04T13:16:40.000Z

Can you perhaps show me where you think this is the case?

The last hidden state of the generator LSTM is concatenated with the input and fed into the inference LSTM here:

https://github.com/wohlert/generative-query-network-pytorch/blob/master/gqn/generator.py#L146

Answer 2 · 2019-06-04T21:32:19.000Z

So the input fed into inference should be h_(i-1) + h_g + x + v + r, what I'm saying is it seems you missed the h_(i-1) part, which is the hidden state from last round of inference LSTM.

Similarly the generator LSTM.

Answer 3 · 2019-06-04T21:40:53.000Z

Another question I have is about the sigma scheme, I noticed you updated the shepardmetzler.py, and the range of data is from 0 to 1 now, but the sigma scheme is set to be 0.7 to 2, not sure if this will affect the log likelihood estimation.

Answer 4 · 2019-06-05T15:13:31.000Z

The h_(i-1) part is fed into the LSTM through the state variable, so the information is available to the network. You can see this at the end of the line that i linked.

The sigma scheme is the same as done in the paper. The authors never specify the data range, but it is customary to have it be floating points [0, 1].

Answer 5 · 2019-06-05T19:00:03.000Z

Check the following two lines:
https://github.com/wohlert/generative-query-network-pytorch/blob/master/gqn/generator.py#L53
https://github.com/wohlert/generative-query-network-pytorch/blob/master/gqn/generator.py#L62

The hidden state is fed in, but not used, and directly overwritten by the new hidden state.

Answer 6 · 2019-06-05T21:07:13.000Z

Yes, I see that is a mistake.

Answer 7 · 2019-06-07T12:49:44.000Z

It has been fixed in the recent commit. Thank you for posting this issue.