Compile model which has different dimensions of output and labels (in Tensorflow)

Question

Simplest examples which replicates the error: I understand, that in this case, output of model is (batch_size, 10) while my labels have (batch_size,) dimensions. This is why I use tf.nn.sparse_softmax_cross_entropy_with_logits. Before I can provide any kind of labels to this model, compilation fails with the following error: After some investigation, I see that compilation fails because tensorflow somehow thinks that

Accepted Answer

According to the docs, you just have to make sure your labels have the shape [batch_size]. Here is a working example with tf.squeeze:import tensorflow as tfdef loss(y, logits):    y = tf.squeeze(y, axis=-1)    loss = tf.reduce_mean(tf.nn.sparse_softmax_cross_entropy_with_logits(labels=y, logits=logits))    return lossInput = tf.keras.layers.Input(dtype=tf.float32, shape=(20,), name="X")hidden = tf.keras.layers.Dense(40, activation=tf.keras.activations.relu, name="hidden1")(Input)logits = tf.keras.layers.Dense(10, name="outputs")(hidden)optimizer = tf.keras.optimizers.Adam()model = tf.keras.Model(inputs=Input, outputs=logits)model.summary()model.compile(optimizer=optimizer, loss=loss)x = tf.random.normal((50, 20))y = tf.random.uniform((50, 1), maxval=10, dtype=tf.int32)model.fit(x, y, epochs=2)

Advertisement

Answer