Incomparable weight shape between caffe and tensorflow / keras

Question

I am trying to convert a caffe model to keras, I have successfully been able to use both MMdnn and even caffe-tensorflow. The output I have are .npy files and .pb files. I have not had much luck with the .pb files, so I stuck to .npy files which contain the weights and biases. I have reconstructed an mAlexNet…

Accepted Answer

The problem is the bias vector.  It is shaped as a 4D tensor but Keras assumes it is a 1D tensor.  Just flatten the bias vector:import numpy as npweights_data = np.load('weights.npy', allow_pickle=True).item()model = define_malexnet()    for layer in model.layers:  if layer.name in weights_data.keys():    layer_weights = weights_data[layer.name]    layer.set_weights((layer_weights['weights'], layer_weights['bias'].flatten()))As a sanity check, once I create your model I will access the conv1 weights and your corresponding weights you cached then compare them both:In [22]: weights1 = model.layers[1].weights[0].numpy()In [23]: weights2 = weights_data['conv1']['weights']In [24]: np.allclose(weights1, weights2)Out[24]: TrueThe same for the biases:In [25]: bias1 = model.layers[1].weights[1].numpy()In [26]: bias2 = weights_data['conv1']['bias']In [27]: np.allclose(bias1, bias2)Out[27]: TrueNotice that I didn&#8217;t have to flatten the biases from your cached results because np.allclose flattens singleton dimensions internally.

Advertisement

Answer