How to reset the incrementing values when assigning values to groups in a pandas dataframe?

Question

I have a pandas dataframe which looks like this after the following code: For clarity, row_l0 relates to Category, row_l1 relates to Process and row_l2 to Parent. The row_l0 is correct, but I can't seem to be able to reset the count/grouping for the subsequent groups (row_l1 and row_l2) when I get to category B (and beyond). E.g. at index

Accepted Answer

With the following dataframe:import pandas as pddf = pd.DataFrame(    {        "Category": ["A", "A", "A", "A", "A", "A", "B", "B", "B"],        "Process": ["a.5", "a.6", "a.6", "a.6", "a.6", "a.6", "b.1", "b.2", "b.2"],        "Parent": [            "a.5.4",            "a.6.1",            "a.6.2",            "a.6.3",            "a.6.4",            "a.6.5",            "b.1.1",            "b.2.1",            "b.2.2",        ],    },)Here is one way to do it:df["row_l0"] = df["Category"].apply(    lambda x: {col: i + 1 for i, col in enumerate(df["Category"].unique())}[x])df["row_l1"] = df["Process"].apply(lambda x: x[-1])df["row_l2"] = [    j + 1    for count in df["Parent"].str[0].value_counts().to_dict().values()    for j in range(count)]print(df)# Output  Category Process Parent  row_l0 row_l1  row_l20        A     a.5  a.5.4       1      5       11        A     a.6  a.6.1       1      6       22        A     a.6  a.6.2       1      6       33        A     a.6  a.6.3       1      6       44        A     a.6  a.6.4       1      6       55        A     a.6  a.6.5       1      6       66        B     b.1  b.1.1       2      1       17        B     b.2  b.2.1       2      2       28        B     b.2  b.2.2       2      2       3

Advertisement

Answer