pandas group by and fill in the missing time interval sequence

Question

I have a data frame like as shown below What I would like to do is a) FIll in the missing time by generating a sequence number (ex:1,2,3,4) and copy the value (for all other columns) from the previous row I was trying something like below But this doesn't help me get the expected output I expect my output to

Accepted Answer

Let&#8217;s set the time column as the index of dataframe then groupby the dataframe on person_id then for each group classified by person_id reindex the group to conform its index with the range of values specified in time column, finally concat all the groups to get the desired dataframe:grp = df.set_index('time').groupby('person_id')groups = [g.reindex(range(g.index.min(), g.index.max() + 1)).ffill().reset_index() for _, g in grp]out = pd.concat(groups, ignore_index=True).reindex(df.columns, axis=1)Alternatively you can first create tuple pairs for each person_id and corresponding range of values specified in time column  then  reindex the dataframe:grp = df.groupby('person_id')['time']idx = [(k, n) for k, t in grp  for n in range(t.min(), t.max() + 1)]out = df.set_index(['person_id', 'time']).reindex(idx).ffill().reset_index()Result (for person_id 11):    person_id  time  value0        11.0    -1  101.01        11.0     0  101.02        11.0     1  101.03        11.0     2  101.04        11.0     3  101.05        11.0     4  101.06        11.0     5  102.07        11.0     6  102.08        11.0     7  102.09        11.0     8  102.010       11.0     9  102.011       11.0    10  102.012       11.0    11  102.013       11.0    12  102.014       11.0    13  102.015       11.0    14  102.016       11.0    15  102.017       11.0    16  102.018       11.0    17  121.0

Advertisement

Answer