Iterating through multiple rows using multiple values from nested dictionary to update data frame in python

Question

I created nested dictionary to keep multiple values for each combination, example rows in the dictionary is as follows:- dict = {&#8216;A&#8217;: {B: array([1,2,3,4,5,6,7,8,9,10]), C: array([array([1,2,3,4,5,6,7,8,9,10],&#8230;}} There are multiple As and in that multiple arrays for each array. Now I want to …

Accepted Answer

EDIT Ver 2: Reference Dict and pick dict index valThe dictionary you created is a big confusing. I assume you wanted to reference it like the way I have shown (not an array of array as shown in C). Also assume B and C are values and not variables B and C.I created dictionary dct (dict is a reserved word in python), with different values to show that it picks the value not the index.import pandas as pdimport numpy as npdct = {'A': {'B': np.array([.2,.4,.6,.8,1.0,1.2,1.4,1.6,1.8,2.0]),              'C': np.array([.3,.6,.9,1.2,1.5,1.8,2.1,2.4,2.7,3.0])             }        }c = ['Col 1','Col 2','Col 3','Col 4']d = [['A','B',2,10], ['A','C',3,10]]df = pd.DataFrame(d,columns=c)#repeat the values as per times in Col 3. This will create dups in 1 and 2 df = df.loc[df.index.repeat(df['Col 3'])]#Now groupby Col 1 and Col 2 and count the number of times we have Col 3 value#This will give you index to reference the dictionarydf['Col 5'] = (df.groupby(['Col 1','Col 2'])['Col 3'].transform('cumcount'))#Using the cumcount as index, pick the value from dict using keys Col 1, Col 2 and index Col 5df['Col 5'] = df.apply(lambda x: dct[x['Col 1']][x['Col 2']][x['Col 5']],axis=1)print (df)The output of this will be:  Col 1 Col 2  Col 3  Col 4  Col 50     A     B      2     10    0.20     A     B      2     10    0.41     A     C      3     10    0.31     A     C      3     10    0.61     A     C      3     10    0.9If you want to multiply Col 5 with Col 4 value, its very simple. Change the equation to (multiply Col 4 to results from dictionary value):df['Col 5'] = df.apply(lambda x: x['Col 4'] * dct[x['Col 1']][x['Col 2']][x['Col 5']],axis=1)The result of this will be:  Col 1 Col 2  Col 3  Col 4  Col 50     A     B      2     10    2.00     A     B      2     10    4.01     A     C      3     10    3.01     A     C      3     10    6.01     A     C      3     10    9.0EDIT Ver 1: Not referencing dictionaryIf you are just looking to have increments of 10 in Col 5 for each group of Col 1 and Col 2, then you can do this.c = ['Col 1','Col 2','Col 3','Col 4']d = [['A','B',2,10],['A','C',3,10]]import pandas as pddf = pd.DataFrame(d,columns=c)df = df.loc[df.index.repeat(df['Col 3'])]df['Col 5'] = (df.groupby(['Col 1','Col 2'])['Col 3'].transform('cumcount')+1)*10print (df)The output of this will be:  Col 1 Col 2  Col 3  Col 4  Col 50     A     B      2     10     100     A     B      2     10     201     A     C      3     10     101     A     C      3     10     201     A     C      3     10     30If you want Col 3 to have a value of 1, then:df['Col 3'] = 1This will then result in:  Col 1 Col 2  Col 3  Col 4  Col 50     A     B      1     10     100     A     B      1     10     201     A     C      1     10     101     A     C      1     10     201     A     C      1     10     30If you need it to reference the dictionary, then I need to change the code.

Advertisement

Answer

EDIT Ver 2: Reference Dict and pick dict index val

EDIT Ver 1: Not referencing dictionary