pandas groupby dataframes, calculate diffs between consecutive rows

Question

Using pandas, I open some csv files in a loop and set the index to the cycleID column, except the cycleID column is not unique. See below: This prints the 2 columns (cycleID and mean) of the dataframe I am interested in for further computations: The objective is to use the rows corresponding to the same cycle…

Accepted Answer

You can use groupbys2= df.groupby(['cycleID'])['mean'].diff()s2.dropna(inplace=True)output1   -8.453876e-123   -1.486037e-115   2.482933e-127   -3.388330e-128   3.000000e-12UPDATEd = [[1, 1.5020712104685252e-11],[1, 6.56683605063102e-12],[2, 1.3993315187144084e-11],[2, -8.670502467042485e-13],[3, 7.0270625256163566e-12],[3, 9.509995221868016e-12],[4, 1.2901435995915644e-11],[4, 9.513106448422182e-12]]df = pd.DataFrame(d, columns=['cycleID', 'mean'])df2 = df.groupby(['cycleID']).diff().dropna().rename(columns={'mean': 'difference'})df2['mean'] = df['mean'].iloc[df2.index]       difference    mean1   -8.453876e-12   6.566836e-123   -1.486037e-11   -8.670502e-135   2.482933e-12    9.509995e-127   -3.388330e-12   9.513106e-12

Advertisement

Answer

UPDATE