Generic input layer

Question

Generic input layer

kjancsi opened this issue 9 years ago · 8 comments

While I understand that the library's main purpose is text processing, it would be great to have a generic input layer as well for sequences of real-valued input vectors.

Answer 1 · 2015-02-05T20:45:01.000Z

Yeah, makes a lot of sense, have this locally already as a dummy input layer that exposes input directly the model.

Update this weekend will have this and a linear iterator to go through datasets like that!

Answer 2 · 2015-02-15T15:21:42.000Z

Thanks, sounds good. Any estimate when this would be available?

Answer 3 · 2015-03-09T13:15:18.000Z

Hello, just checking on progress: any idea when this feature will land in the code? Thanks.

Answer 4 · 2015-03-26T04:47:00.000Z

Sorry for the delay was on other projects. Feature added with #22
Example usage applying an RNN to mnist (reading left to right) here - achieves accuracy comparable or slightly better than a fully connected network.

Answer 5 · 2015-03-26T16:39:45.000Z

No problem at all. Thanks a lot for adding this feature and including the MNIST example. Passage is shaping up to be a really nifty tool for RNNs. One question regarding the implementation: does this work for variable-length sequences as well?

Answer 6 · 2015-03-26T16:51:54.000Z

Thanks!

Currently Passage needs the iteration dimension to be the same length for all sequences being trained on in a minibatch. Currently we handle that for text by padding sequences with a "PAD" token so the model can just learn to deal with it. For real valued data, the simplest option is to zero pad the beginning of all sequences out to the same length, would this work for you?

Answer 7 · 2015-04-05T11:17:49.000Z

Thanks, I'll give it a go. For text does the padding contribute to the loss function or do you use a mask for the loss calculation?

Answer 8 · 2015-04-06T20:28:52.000Z

When doing non-sequence prediction, padding the input won't effect loss calculation. For sequence prediction, we currently pad which is definitely not optimal. Mask support is a significant re-factoring that's in progress and one of the main reason why sequence prediction support is on it's own branch / not pushed to master yet.