Tag: stemming

Neither stemmer nor lemmatizer seem to work very well, what should I do?

countvectorizer lemmatization python stemming wordnet

I am new to text analysis and am trying to create a bag of words model(using sklearn’s CountVectorizer method). I have a data frame with a column of text with words like ‘acid’, ‘acidic’, ‘acidity’, ‘wood’, ‘woodsy’, ‘woody’. I thin…