Create table by grouping mean values by column and list of one-hot encoded columns (Python, pandas)

Question

I am working with tweets and I would like to report the mean sentiment score by topic and by community. This is what my dataframe looks like where each row is a document (tweet): I want to create a dataframe that contains a mean sentiment value in each cell like this: Any thoughts on how to go about this plea…

Accepted Answer

First you want to propagate the sentiment through the topic, then average out by community_id:(df.filter(like='topic')   .mul(df.sentiment, axis=0)   .groupby(df.community_id)   .mean())

Advertisement

Answer