How to fill rows of a dataframe by following the order of elements in a list from a dictionary python

Question

I would like to fill the empty rows of the dataframe: based on the following dictionary: dict1 shows how many kids each id in df has. For instance, A is parent of K and J. J has no kids. G has A and H. The empty rows in df are belongs to id J,Y,Z, and G.The list gives us these

Accepted Answer

The code below works, but let me explain what I did to make it work.First of all, the structure is similar to yours: I use a for-loop to loop over the kids in new_list. Then I check if rule 1 needs to be applied, else rule 2 or 3 needs to be applied. For rule 2, I check if there is only one kid and if the kid is not in new_list. For rule 3, more than one kids need to be there and ALL of them need to be NOT in new_list.Then the trick for rule 4 is to use a while loop: do rule 123 and then compute again the new_list. If the length of this list is larger than zero. We do rule 123 again, updating our dataframe again.In the first loop kid J and G get their value.Second loop, kid Z gets his value.Finally, kid Y gets the value.new_list = list(df.id[df.test1.isnull()])while len(new_list) > 0:    for i in new_list:        if str(dic1[i][0]) == 'nan': # rule 1            df.loc[df.id == i, 'test1'] = 0        else:            if dic1[i][1] == 1:                kid = dic1[i][0][0]                if kid not in new_list: # rule 2                    df.loc[df.id == i, 'test1'] = df.test1[df.id == kid].values[0]            else:                kids = dic1[i][0]                if all(kid not in new_list for kid in kids): # rule 3                    max_value = df.test1[df.id.isin(kids)].max()                    df.loc[df.id == i, 'test1'] = max_value    new_list = list(df.id[df.test1.isnull()])

Advertisement

Answer