Keras LSTM – why different results with “same” model & same weights?

Question

(NOTE: Properly fixing the RNG state before each model creating as described in comment in comment practically fixed my problem, as within 3 decimals results are consistent, but they aren't exactly so, so there's somewhere a hidden source of randomness not fixed by seeding the RNG... probably some lib uses time milisecs or smth...if anyone has an idea on that,

Accepted Answer

Machine learning algorithms in general are non-deterministic. This means that every time you run them the outcome should vary. This has to do with the random initialization of the weights. If you want to make the results reproducible you have to eliminate the randomness from the table. A simple way to do this is to use a random seed.import numpy as npimport tensorflow as tfnp.random.seed(1234)tf.random.set_seed(1234)# rest of your codeIf you want the randomness factor but not so high variance in your output, I would suggest either lowering your learning rate or changing your optimizer (I would suggest an SGD optimizer with a relatively low learning rate). A cool overview of gradient descent optimization is available here!A note on TensorFlow&#8217;s random generators is that besides a global seed (i.e. tf.random.set_seed()), they also use an internal counter, so if you runtf.random.set_seed(1234)print(tf.random.uniform([1]).numpy())print(tf.random.uniform([1]).numpy())You&#8217;ll get 0.5380393 and 0.3253647, respectively. However if you re-run that same snippet, you&#8217;ll get the same two numbers again.A detailed explanation of how random seeds work in TensorFlow can be found here.For newer TF versions take care of this too: TensorFlow 2.2 ships with a os environment variable TF_DETERMINISTIC_OPS which if set to '1', will ensure that only deterministic GPU ops are used.

Advertisement

Answer