Adding a value to a column in a DataFrame depending on a value in another column

Question

I have a DataFrame with multiple columns. The column weighting_factor is empty. Now I want to append values to that column row by row, if the value of index_1 lies between specific integer boarders. I tried whereas wf_tooold is a float and oldest_max is an int. The error that I get is What would be a good way to fill

Accepted Answer

You basically want to update a filtered number of rows with a value, so you do that with:df.loc[df['index_1'] <= oldest_max, 'weighting_factor'] = wf_tooldfor example with oldest_max = 4 and wf_toold = 14.25, we get:>>> df    index_1 weighting_factor0         1            14.251         2            14.252         3            14.253         4            14.254         5                 5         6                 6         7                 7         8                 8         9                 9        10                 10       11                 11       12It might however be better to give weighting_factor a NaN as starting value, otherwise pandas will see the weighting_factor as a Series of objects, not floats:from numpy import NaNdf['weighting_factor']= NaNyou can check between a lower bound and an upperbound with:df.loc[df['index_1'].between(old_min, oldest_max), 'weighting_factor'] = wf_toold

Advertisement

Answer