Merging segments from the same trips into a single trip for analysis

Question

In the MWE below, I show my attempt to line-plot trips (from my df aggregated per month): I realised in my df, some trips contains jump (maybe due to data log), so they should be merged into single trip before aggregation. In the given df example above (before grouping). User 154 does undertake 2-trips, not 3. First started at 10:10:00

Accepted Answer

You can use a custom function:offset = pd.DateOffset(minutes=30)merge_trip = lambda x: x['start'].ge(x['end'].shift() + offset).cumsum().add(1)df['Trips'] = df.groupby('user').apply(merge_trip).droplevel('user')out = df.groupby('Date', as_index=False)['Trips'].max()Output:>>> out      Date  Trips0  2007-05      21  2008-05      42  2008-06      1>>> df   user               start                 end  mode     Date  Trips0   154 2007-05-01 10:10:00 2007-05-01 10:36:00   bus  2007-05      11   154 2007-05-01 10:36:00 2007-05-01 11:00:00  walk  2007-05      12   154 2007-05-01 15:30:00 2007-05-01 15:55:00  taxi  2007-05      23    73 2008-05-19 16:25:54 2008-05-19 16:29:22  walk  2008-05      14    73 2008-05-21 02:21:37 2008-05-21 02:25:04  walk  2008-05      25    73 2008-05-22 01:30:09 2008-05-22 01:33:51  walk  2008-05      36    73 2008-05-29 01:55:59 2008-05-29 01:59:25  walk  2008-05      47    62 2008-06-20 04:21:40 2008-06-20 05:33:46   bus  2008-06      18    62 2008-06-20 05:40:31 2008-06-20 05:53:11  walk  2008-06      1

Advertisement

Answer