Output tensors of a Functional model must be the output of a TensorFlow `Layer`

Question

So I&#8217;m trying to expand the Roberta Pretrained Model and I was doing a basic model for testing but I&#8217;m getting this error from TensorFlow: ValueError: Output tensors of a Functional model must be the output of a TensorFlow Layer. which is from the Model api of Keras but I don&#8217;t exactly know …

Accepted Answer

You need to pass a list of [input_ids , input_mask] to base_model.# !pip install transformers from transformers import TFAutoModelimport tensorflow as tfLEN_SEQ = 64# Define inputsinput_ids = tf.keras.layers.Input(shape=(LEN_SEQ,), name='input_ids', dtype='int32')input_mask = tf.keras.layers.Input(shape=(LEN_SEQ,), name='input_mask', dtype='int32')base_model = TFAutoModel.from_pretrained('roberta-base')for layer in base_model.layers:    layer.trainable = False# Check summary of tf_roberta_modelbase_model.summary()embedding = base_model([input_ids, input_mask])[1]# Or# embedding = base_model([input_ids, input_mask]).pooler_output# Define hidden layerslayer = tf.keras.layers.Dense(LEN_SEQ * 2, activation='relu')(embedding)layer = tf.keras.layers.Dense(LEN_SEQ, activation='relu')(layer)# Define outputoutput = tf.keras.layers.Dense(2, activation='softmax', name='output')(layer)model = tf.keras.models.Model(inputs=[input_ids, input_mask], outputs=[output])model.summary()Output:Model: "tf_roberta_model_2"_________________________________________________________________ Layer (type)                Output Shape              Param #   ================================================================= roberta (TFRobertaMainLayer  multiple                 124645632  )                                                                                                                                =================================================================Total params: 124,645,632Trainable params: 0Non-trainable params: 124,645,632_________________________________________________________________Model: "model"__________________________________________________________________________________________________ Layer (type)                   Output Shape         Param #     Connected to                     ================================================================================================== input_ids (InputLayer)         [(None, 64)]         0           []                                                                                                                                  input_mask (InputLayer)        [(None, 64)]         0           []                                                                                                                                  tf_roberta_model_2 (TFRobertaM  TFBaseModelOutputWi  124645632  ['input_ids[0][0]',               odel)                          thPoolingAndCrossAt               'input_mask[0][0]']                                             tentions(last_hidde                                                                               n_state=(None, 64,                                                                                768),                                                                                              pooler_output=(Non                                                                               e, 768),                                                                                           past_key_values=No                                                                               ne, hidden_states=N                                                                               one, attentions=Non                                                                               e, cross_attentions                                                                               =None)                                                                                                                                                               dense (Dense)                  (None, 128)          98432       ['tf_roberta_model_2[0][1]']                                                                                                        dense_1 (Dense)                (None, 64)           8256        ['dense[0][0]']                                                                                                                     output (Dense)                 (None, 2)            130         ['dense_1[0][0]']                                                                                                                  ==================================================================================================Total params: 124,752,450Trainable params: 106,818Non-trainable params: 124,645,632__________________________________________________________________________________________________

Advertisement

Answer