What is the relation between a learning rate scheduler and an optimizer?

Question

If I have a model: And then I'm defining my inputs, optimizer (with lr=0.1), scheduler (with base_lr=1e-3), and training: The optimizer seems to take the learning rate of the scheduler Does the learning rate scheduler overwrite the optimizer? How does it connect to it? Trying to understand the relation between them (i.e how they interact, etc.) Answer TL;DR: The LR

Accepted Answer

TL;DR: The LR scheduler contains the optimizer as a member and alters its parameters learning rates explicitly.As mentioned in PyTorch Official Documentations, the learning rate scheduler receives the optimizer as a parameter in its constructor, and thus has access to its parameters.The common use is to update the LR after every epoch:scheduler = ... # initialize some LR schedulerfor epoch in range(100):    train(...) # here optimizer.step() is called numerous times.    validate(...)    scheduler.step()All optimizers inherit from a common parent class torch.nn.Optimizer and are updated using the step method implemented for each of them.Similarly, all LR schedulers (besides ReduceLROnPlateau) inherit from a common parent class named _LRScheduler. Observing its source code uncovers that in the step method the class indeed changes the LR of the parameters of the optimizer:...for i, data in enumerate(zip(self.optimizer.param_groups, values)):            param_group, lr = data            param_group['lr'] = lr...

Advertisement

Answer