Can apply function change the original input pandas df?

Question

I always assume that the apply function won&#8217;t change the original pandas dataframe and need the assignment to return the changes, however, could anyone help to explain why this happen? returns So, apply function changed the original pd.DataFrame without return, but if there&#8217;s an non-basic type col…

Accepted Answer

Series are mutable objects. If you modify them during an operation, the changes will be reflected if no copy is made.This is what happens in the first case. My guess: no copy is made as your DataFrame has a homogenous dtype (integer), so all the DataFrame is stored as a unique array internally.In the second case, you have at least one item being a list. This make the dtype object, the DataFrame not a single dtype and apply must generate a new Series before running due to the mixed type of the row.You can actually reproduce this just by changing a single element to another type:def f(row):    row['a'] = 10    row['b'] = 20df_x = pd.DataFrame({'a':[10,11,12],                     'b':[3,4,5],                     'c':[1,1.,1]}) # floatdf_x.apply(f, axis = 1)df_x# different types# no mutation    a  b    c0  10  3  1.01  11  4  1.02  12  5  1.0Take home message: never modify a mutable input in a function (unless you want it and know what you&#8217;re doing).

Advertisement

Answer