How to fine-tune a functional model in Keras?

Question

Taking a pre-trained model in Keras and replacing the top classification layer to retrain the network to a new task has several examples using a Sequential model in Keras. A sequential model has methods model.pop() and model.add() which make this fairly easy. However, how is this achieved when using a functio…

Accepted Answer

You can use a pretrained functional model with the last layer removed as a layer. You may think of a model as a &#8220;bigger layer&#8221;. Then redefine a new model that wraps &#8220;bigger layer&#8221; and a new layer.An example:import tensorflow as tffrom keras.layers import Dense,Inputfrom keras.models import Sequential,Modelinput_tensor = Input(shape=(64,))x = Dense(32, activation='relu')(input_tensor)x = Dense(32, activation='relu')(x)output_tensor = Dense(10, activation=tf.nn.softmax)(x)model = Model(input_tensor, output_tensor)model.compile(loss='sparse_categorical_crossentropy', optimizer='sgd',              metrics=['accuracy'])print(model.summary())model.save_weights('my_model_weights.h5')# model.load_weights('my_model_weights.h5')def pop_layer(model):    if not model.outputs:        raise Exception('Sequential model cannot be popped: model is empty.')    model.layers.pop()    if not model.layers:        model.outputs = []        model.inbound_nodes = []        model.outbound_nodes = []    else:        model.layers[-1].outbound_nodes = []        model.outputs = [model.layers[-1].output]    return model# Remove last layer with custom function (from another post)model_old = pop_layer(model)# Now add a new layer to the modelmodel_new = Sequential()model_new.add(model_old)model_new.add(Dense(2, activation=tf.nn.softmax, name='fc2'))model_new.compile(loss='sparse_categorical_crossentropy', optimizer='sgd',              metrics=['accuracy'])print(model_new.summary())As a result, you can see that the parameters of the last layer of pretrained functional model are missing._________________________________________________________________Layer (type)                 Output Shape              Param #   =================================================================input_1 (InputLayer)         (None, 64)                0         _________________________________________________________________dense_1 (Dense)              (None, 32)                2080      _________________________________________________________________dense_2 (Dense)              (None, 32)                1056      _________________________________________________________________dense_3 (Dense)              (None, 10)                330       =================================================================Total params: 3,466Trainable params: 3,466Non-trainable params: 0_________________________________________________________________None_________________________________________________________________Layer (type)                 Output Shape              Param #   =================================================================model_1 (Model)              multiple                  3136      _________________________________________________________________fc2 (Dense)                  (None, 2)                 66        =================================================================Total params: 3,202Trainable params: 3,202Non-trainable params: 0_________________________________________________________________None

Advertisement

Answer