Count frequencies (unique rows) from a pandas list type column

Question

I have a dataframe (df) like this: And, I have list like this: For each element in l, I want to count the unique rows they appear in df. But I&#8217;m not getting the part where I can check if the value exists in the list-column of the dataframe. Anyway I can fix this? Or is there a more cleaner/efficient

Accepted Answer

Here we are using apply method that applies given function to each element of the column (in our case it is the function that tests whether an element belongs to the list or not), then we sum True values, i.e. rows in which we found requested values and eventually save it to the dictionary. And we do it for all requested letters. I have not tested performance of this solution.    import pandas as pd        df = pd.DataFrame([        {'id': 1, 'col': ['A', 'B', 'C', 'C']},        {'id': 2, 'col': ['B', 'C', 'D']},        {'id': 3, 'col': ['C', 'D', 'E']}])        letters = ["A", "C", "D", "F"]    res = {v: df['col'].apply(lambda x: v in x).sum()           for v in letters}    # output    # {'A': 1, 'C': 3, 'D': 2, 'F': 0}

Advertisement

Answer