Unique keywords on the column

Question

I&#8217;m new to pandas and I have a question. I have a dataframe like and I should remove duplicates on &#8220;Keywords&#8221; column, no matter if the duplicates are on the same row or on 3 different rows. No matter if it is written &#8220;warehouse&#8221; or &#8220;Warehouse&#8221; Everything value duplica…

Accepted Answer

One way using pandas.Series.str.split with explode:m = df["Keywords"].str.split("s*,s*").explode()m = m[~m.str.lower().duplicated(False)]df["Keywords"] = m.groupby(m.index).apply(", ".join)df = df.fillna("")Output:  Code                                        Keywords0    A                                      loan, land1    B  rental, Tenant, broker advisor, Lease and rent2    C                           Transport Air freight3    D

Advertisement

Answer