How to infer the shape of the output when connecting convolution layer with dense layers?

Question

I am trying to construct a Convolutional Neural Network using pytorch and can not understand how to interpret the input neurons for the first densely connected layer. Say, for example, I have the following architecture: Here X would be the number of neurons in the first linear layer. So, do I need to keep track of the shape of the

Accepted Answer

An easy solution would be to use LazyLinear layer: https://pytorch.org/docs/stable/generated/torch.nn.LazyLinear.html.According to the documentation:A torch.nn.Linear module where in_features is inferred &#8230; They will be initialized after the first call to forward is done and the module will become a regular torch.nn.Linear module. The in_features argument of the Linear is inferred from the input.shape[-1].

Advertisement

Answer