How to resample a dataframe an include start and end times?

Question

So I am working with tick data and I am attempting to resample the dataframe to minute bars, but when resample is called the time series begins and ends the first instance that a tick exists. How would I resample this data such that the first and last times can be specified to a certain start and end time? Edit

Accepted Answer

You can add start and end datetimes manually:#removed minutes and secondsdf1 = df.rename(lambda x: x.floor('H'))#removed duplicated DatetimeIndex - output empty dfdf1 = df1.loc[~df1.index.duplicated(), []]#join togetherdf1 = pd.concat([df, df1, df1.rename(lambda x: x + pd.Timedelta('00:59:00'))])print (df1)                              Code  ValTimestamp                              1970-01-19 14:50:27.600073933    A  5.01970-01-19 14:55:29.600124359    A  6.01970-01-19 14:00:00.000000000  NaN  NaN1970-01-19 14:59:00.000000000  NaN  NaNdf2 = df1.resample('1T').agg('sum')print (df2)For add values per days:df1 = df.rename(lambda x: x.floor('D'))df1 = df1.loc[~df1.index.duplicated(), []]df1 = pd.concat([df, df1, df1.rename(lambda x: x + pd.Timedelta('23:59:00'))])print (df1)                              Code  ValTimestamp                              1970-01-19 14:50:27.600073933    A  5.01970-01-19 14:55:29.600124359    A  6.01970-01-19 00:00:00.000000000  NaN  NaN1970-01-19 23:59:00.000000000  NaN  NaNdf2 = df1.resample('1T').agg('sum')print (df2)

Advertisement

Answer