How to understand creating leaf tensors in PyTorch?

Question

From PyTorch documentation: But why are e and f leaf Tensors, when they both were also cast from a CPU Tensor, into a Cuda Tensor (an operation)? Is it because Tensor e was cast into Cuda before the in-place operation requires_grad_()? And because f was cast by assignment device="cuda" rather than by method .cuda()? Answer When a tensor is first

Accepted Answer

When a tensor is first created, it becomes a leaf node.Basically, all inputs and weights of a neural network are leaf nodes of the computational graph.When any operation is performed on a tensor, it is not a leaf node anymore.b = torch.rand(10, requires_grad=True) # create a leaf nodeb.is_leaf # Trueb = b.cuda() # perform a casting operationb.is_leaf # Falserequires_grad_() is not an operation in the same way as cuda() or others are.It creates a new tensor, because tensor which requires gradient (trainable weight) cannot depend on anything else.e = torch.rand(10) # create a leaf nodee.is_leaf # Truee = e.cuda() # perform a casting operatione.is_leaf # Falsee = e.requires_grad_() # this creates a NEW tensore.is_leaf # TrueAlso, detach() operation creates a new tensor which does not require gradient:b = torch.rand(10, requires_grad=True)b.is_leaf # Trueb = b.detach()b.is_leaf # TrueIn the last example we create a new tensor which is already on a cuda device.We do not need any operation to cast it.f = torch.rand(10, requires_grad=True, device="cuda") # create a leaf node on cuda

Advertisement

Answer