replicating data in same dataFrame

Question

I want to replicate the data from the same dataframe when a certain condition is fulfilled. Dataframe: I want to replicate the dataframe when going through a loop and there is a difference greater than 4 in row.hour. Expected Output: i want to replicate the rows when the iterating through all the row and ther…

Accepted Answer

My understanding is: you want to compare &#8216;Hour&#8217; values for two successive rows.If the difference is > 4 you want to add the previous row to the DF.If that is what you want try this:Create a DF:       j = pd.DataFrame({'Hour':[1, 2, 4,10,15,16,17,19],              'Wage':[15,17,20,25,26,30,40,15]})Define a function:   def f1(d):     dn = d.copy()     for x in range(len(d)-2):         if (abs(d.iloc[x+1].Hour - d.iloc[x+2].Hour) > 4):            idx = x + 0.5            dn.loc[idx] = d.iloc[x]['Hour'], d.iloc[x]['Wage']     dn = dn.sort_index().reset_index(drop=True)     return dnCall the function passing your DF:   nd = f1(j)     Hour   Wage    0   1   15   1    2   17   2    2   17   3    4   20   4    4   20   5    10  25   6    15  26   7    16  30   8    17  40   9    19  15

Advertisement

Answer