Add missing timestamps for each different ID in dataframe

Question

I have two dataframes (simple examples shown below): df1 shows every timestamp I am interested in. df2 shows data sorted by timestamp and ID. What I need to do is add every single timestamp from df1 that is not in df2 for each unique ID and add zero to the value column. This is the outcome I&#8217;m intereste…

Accepted Answer

Try:x = (    df2.groupby("ID column")    .apply(lambda x: x.merge(df1, how="outer").fillna(0))    .drop(columns="ID column")    .droplevel(1)    .reset_index()    .sort_values(by=["ID column", "time column"]))print(x)Prints:    ID column         time column  Value0           1 2022-01-01 00:00:00   10.04           1 2022-01-01 00:15:00    0.01           1 2022-01-01 00:30:00    9.05           1 2022-01-01 00:45:00    0.06           1 2022-01-02 00:00:00    0.07           1 2022-01-02 00:15:00    0.02           1 2022-01-02 00:30:00    5.03           1 2022-01-02 00:45:00   15.08           2 2022-01-01 00:00:00    6.09           2 2022-01-01 00:15:00    2.011          2 2022-01-01 00:30:00    0.012          2 2022-01-01 00:45:00    0.013          2 2022-01-02 00:00:00    0.014          2 2022-01-02 00:15:00    0.015          2 2022-01-02 00:30:00    0.010          2 2022-01-02 00:45:00    7.0

Advertisement

Answer