Pytorch softmax: What dimension to use?

Question

The function torch.nn.functional.softmax takes two parameters: input and dim. According to its documentation, the softmax operation is applied to all slices of input along the specified dim, and will rescale them so that the elements lie in the range (0, 1) and sum to 1. Let input be: Suppose I want the following, so that every entry in that array

Accepted Answer

The easiest way I can think of to make you understand is: say you are given a tensor of shape (s1, s2, s3, s4) and as you mentioned you want to have the sum of all the entries along the last axis to be 1.sum = torch.sum(input, dim = 3) # input is of shape (s1, s2, s3, s4)Then you should call the softmax as:softmax(input, dim = 3)To understand easily, you can consider a 4d tensor of shape (s1, s2, s3, s4) as a 2d tensor or matrix of shape (s1*s2*s3, s4). Now if you want the matrix to contain values in each row (axis=0) or column (axis=1) that sum to 1, then, you can simply call the softmax function on the 2d tensor as follows:softmax(input, dim = 0) # normalizes values along axis 0softmax(input, dim = 1) # normalizes values along axis 1You can see the example that Steven mentioned in his answer.

Advertisement

Answer