Skip to content
Advertisement

How to add a row based on last user event in pandas?

Imagine I have a dataframe with user events

JavaScript

For each user after his last (by timestamp) event I want to add new row with ‘End’ event with the same timestamp as in previous event:

JavaScript

I have no idea how to do that. In SQL I would do that with LAG() or LEAD(). But what about pandas?

Advertisement

Answer

Use DataFrame.drop_duplicates for last row of User_id, change event_name to End and add to original by concat with sorting index (added safest sort mergesort):

JavaScript
User contributions licensed under: CC BY-SA
5 People found this is helpful
Advertisement