Pandas .transform() results in NaN values after update to newer version

Question

I have some code that used to function ~3-4 years ago. I've upgraded to newer versions of pandas, numpy, python since then and it has broken. I've isolated what I believe is the issue, but don't quite understand why it occurs. Problem: the last line "dc" is a pandas.Series with only NaN values. It should have no NaN values. Relevant

Accepted Answer

The cause of the NaNs is that your function outputs a DataFrame/Series with different indices, thus causing reindexing to NaNs.You can return a numpy array in your function:def function_name(S):    lambdas = df2.reindex(S.index.droplevel(['mouse']))*len(S)    return (-lambdas/np.expm1(-lambdas) - 1).to_numpy()  # convert to array heregb = df1.groupby(level=['mouse','target'])d_collisions = gb.transform(function_name)output:mouse  target  barcodeCAT    A       AAAT        6.338965               AAAG        2.815679               AAAC        0.547306               AAAD        1.811785       B       AAAZ        1.881744               AAAX       10.986611               AAAW        5.124226               AAAM        0.250513dtype: float64

Advertisement

Answer