Gensim LDA Coherence Score Nan

Question

I created a Gensim LDA Model as shown in this tutorial: https://www.machinelearningplus.com/nlp/topic-modeling-gensim-python/ And it generates 10 topics with a log_perplexity of: lda_model.log_perplexity(data_df[&#8216;bow_corpus&#8217;]) = -5.325966117835991 But when I run the coherence model on it to calcul…

Accepted Answer

Solved!Coherence Model requires the original text, instead of the training corpus fed to LDA_Model &#8211; so when i ran this:coherence_model_lda = CoherenceModel(model=lda_model, texts=data_df['corpus'].tolist(), dictionary=dictionary, coherence='c_v')with np.errstate(invalid='ignore'):    lda_score = coherence_model_lda.get_coherence()  I got a coherence score of: 0.462Hope this helps someone else making the same mistake. Thanks!

Advertisement

Answer