tf.keras.BatchNormalization giving unexpected output

Question

The output of the above code (in Tensorflow 1.15) is: My problem is why the same function is giving completely different outputs. I also played with some of the parameters of the functions but the result was the same. For me, the second output is what I want. Also, pytorch's batchnorm also gives the same output as second one. So

Accepted Answer

Batch Normalization layer has different behavior in training vs. inferencing:During training (i.e. when using fit() or when calling the layer/model with the argument training=True), the layer normalizes its output using the mean and standard deviation of the current batch of inputs.During inference (i.e. when using evaluate() or predict() or when calling the layer/model with the argument training=False (which is the default), the layer normalizes its output using a moving average of the mean and standard deviation of the batches it has seen during training.So, the first result is due to default training=False and the second is due to default is_training=True.If you want the same result you may try:x = tf.convert_to_tensor([[5.0, 70.0], [5.0, 60.0]])print(tf.keras.layers.BatchNormalization()(x, training=True).numpy().tolist())print(tf.contrib.layers.batch_norm(x).numpy().tolist())#output#[[0.0, 0.9999799728393555], [0.0, -0.9999799728393555]]#[[0.0, 0.9999799728393555], [0.0, -0.9999799728393555]]orx = tf.convert_to_tensor([[5.0, 70.0], [5.0, 60.0]])print(tf.keras.layers.BatchNormalization()(x).numpy().tolist())print(tf.contrib.layers.batch_norm(x, is_training=False).numpy().tolist())#output#[[4.997501850128174, 69.96502685546875], [4.997501850128174, 59.97002410888672]]#[[4.997501850128174, 69.96502685546875], [4.997501850128174, 59.97002410888672]]

Advertisement

Answer