How to input embeddings directly to a huggingface model instead of tokens?

Question

I'm going over the huggingface tutorial where they showed how tokens can be fed into a model to generate hidden representations: But how can I input word embeddings directly instead of tokens? That is, I have another model that generates word embeddings and I need to feed those into the model Answer Most (every?) huggingface encoder model supports that with

Accepted Answer

Most (every?) huggingface encoder model supports that with the parameter inputs_embeds:import torchfrom transformers import RobertaModelm = RobertaModel.from_pretrained("roberta-base")my_input = torch.rand(2,5,768)outputs = m(inputs_embeds=my_input)P.S.: Don&#8217;t forget the attention mask in case this is required.

Advertisement

Answer