The original video features also need feed into eLSTM and dLSTM
moushuai opened this issue · 0 comments
moushuai commented
Base on the paper, the original video features also need feed into eLSTM and dLSTM and then feed it to Discriminator(cLSTM). But this implementation seems feed the original features directly into Discriminator after a linear_compress layer. Is this a Bug here ?