Skip to content
Advertisement

Incompatibility between input and final Dense Layer (Value Error)

I’m following this tutorial from Nabeel Ahmed to create your own emotion detector using Keras (I’m a noob) and I’ve found a strange behaviour that I’d like to understand. The input data is a bunch of 48×48 images, each one with an integer value between 0 and 6 (each number stands for an emotion label), which represents the emotion present in the image.

JavaScript

In order to feed the data into the model, train_X and val_X are reshaped (as the tutorial explains)

JavaScript

The model, as it is in the tutorial, is this one:

JavaScript

However, when I try to use it, using the tutorial snippet, I get an error in the line of the validation_data like this

JavaScript

After reviewing the code and the documentation about the fit method, my only idea was to change the 7 in the last Dense layer of the model to 1, which mysteriously works. I’d like to know what is happening here if anyone could give me a hint.

Advertisement

Answer

You seem to be working with sparse integer labels, where each sample belongs to one of seven classes {0, 1, 2, 3, 4, 5, 6}, so I would recommend using SparseCategoricalCrossentropy instead of CategoricalCrossentropy as your loss function. Just change this parameter and your model should work fine. If you want to use CategoricalCrossentropy, you will have to one-hot encode your labels, for example with:

JavaScript
User contributions licensed under: CC BY-SA
1 People found this is helpful
Advertisement