mat1 and mat2 shapes cannot be multiplied (128×4 and 128×64)

Question

Could not find out why the mat1 from the convolutional network is 128&#215;4 and not 4&#215;128. The following is the convolutional network used: The model training code is as follows: The error log shown is: mat1 should be the output of the convolutional network after it is flattened, and mat2 is the linear …

Accepted Answer

Here are the output shapes for each layerConv2d(2,32,kernel_size=3,padding=1)   # 32x12x12MaxPool2d(2,2)                         # 32x6x6Conv2d(32,64,kernel_size=3,padding=1)  # 64x6x6MaxPool2d(2,2)                         # 64x3x3Conv2d(64,128,kernel_size=3,padding=1) # 128x3x3MaxPool2d(2,2,padding=1)               # 128x2x2Flatten()                              # 128x4You&#8217;ll need to change the kernel parameters and padding sizes if you wish to obtain an output of a given shape. This link might help in calculating the output shapes after each layer.Another approach is that you could take a transpose of the flattened array and pass it into the Linear layers. You&#8217;ll need to add the line in your forward function like belowimport torchimport torch.nn as nnclass NN(nn.Module):  def __init__(self):      super(NN, self).__init__()            self.layer1 = nn.Sequential(          torch.nn.Conv2d(2,32,kernel_size=3,padding=1),          torch.nn.ReLU(),          torch.nn.MaxPool2d(2,2))      self.layer2 = nn.Sequential(          torch.nn.Conv2d(32,64,kernel_size=3,padding=1),          torch.nn.ReLU(),          torch.nn.MaxPool2d(2,2))            self.layer3 = nn.Sequential(          torch.nn.Conv2d(64,128,kernel_size=3,padding=1),          torch.nn.ReLU(),          torch.nn.MaxPool2d(2,2,padding=1))            self.flattened_tensor = nn.Flatten()      self.linear_layer = nn.Sequential(          torch.nn.Linear(128, 64),          torch.nn.ReLU(),          torch.nn.Linear(64,4)      )      def forward(self, inp):    conv_output = self.layer3(self.layer2(self.layer1(inp)))    flattened_output = self.flattened_tensor(conv_output)        transposed_matrix = torch.transpose(flattened_output, 0, 1)        linear_output = self.linear_layer(transposed_matrix)    return linear_outputmodel = NN()output = model(arr)

Advertisement

Answer