Mnist classification problem
lonelygoatherd opened this issue · 4 comments
Hi, I want to ask a simple problem about the mnist classification example. Images in mnist are treated as sequences by expanding them to 1D, then each image gets a probability distribution. But images have no relations, so what is the meaning of tcn here when you process each sequence separately. It looks like a fully connected layer. Did I misunderstand the process procedure?
Oh, we flatten each image to 1D. For example, a 28x28 image is converted to a 784x1 sequence (i.e., length 784). So each "time step" to the TCN is essentially a single pixel, not an image.
-
"[...] the hidden layer's input is the previous layer's output and has no direct relation with the origin input X" --> Correct. That is how any convolutional networks operate.
-
"[...] permitting the input tensor" ---> Do you mean shuffling the data? That is just to ensure that the training data are sufficiently mixed and prevent the model from overfitting too early. You can also turn it off.
-
What do you mean by partition strategies? Do you mean dilations?