How can I add to a dataframe count values of another?

Question

I have a problem that I would like to solve with a dataframe. The index of this table represents a cluster. I have a dataframe called "representative points" that has this structure: On the other hand I have a dataset containing a point with the cluster it belongs to. In this case the index does not mean anything important. the

Accepted Answer

I think that you may need something likeimport pandas as pda = pd.DataFrame({"lat": [76, 45, 32], "lon": [12, 34, 56]})b = pd.DataFrame({"lat": [32, 45, 13], "lon": [13, 13, 13], "cluster": [1, 2, 3]})a["cluster"] = a.indexgrouped = b.groupby("cluster").size().reset_index(name='counts')res = a.merge(grouped, on="cluster", how="outer")how=&#8217;outer&#8217; means that you keep both indices from a with no counts as well as clusters from b with no corresponding index in a. If you need something else, you may need &#8220;left&#8221;, &#8220;right&#8221; or &#8220;inner&#8221;.

Advertisement

Answer