Pandas get difference from first row at a set dtime with groupby

Question

If I have a dataframe with [Group], [DTime] and [Value] columns For each [Group] I&#8217;m trying to find the difference between the first [Value] and every subsequent value from a set [DTime], for this example say it&#8217;s the start of the df at 2015-01-01. Ultimately I would like to plot a timeseries of […

Accepted Answer

Try this:df['Difference'] = (df['Value'] -                     df.sort_values('Dtime').groupby('Group')['Value']                                           .transform('first'))Output:  Group       Dtime        Value  Difference0  Grp1  2015-01-01  1261.406773    0.0000001  Grp1  2015-01-02  1252.660231   -8.7465422  Grp1  2015-01-03  1223.076426  -38.3303473  Grp2  2015-01-01  1214.402352    0.0000004  Grp2  2015-01-02  1422.532532  208.1301805  Grp2  2015-01-03  1262.990213   48.587861

Advertisement

Answer