Add missing timestamps for each different ID in dataframe

Question

I have two dataframes (simple examples shown below): df1 shows every timestamp I am interested in. df2 shows data sorted by timestamp and ID. What I need to do is add every single timestamp from df1 that is not in df2 for each unique ID and add zero to the value column. This is the outcome I'm interested in My

Accepted Answer

Try:x = (    df2.groupby("ID column")    .apply(lambda x: x.merge(df1, how="outer").fillna(0))    .drop(columns="ID column")    .droplevel(1)    .reset_index()    .sort_values(by=["ID column", "time column"]))print(x)Prints:    ID column         time column  Value0           1 2022-01-01 00:00:00   10.04           1 2022-01-01 00:15:00    0.01           1 2022-01-01 00:30:00    9.05           1 2022-01-01 00:45:00    0.06           1 2022-01-02 00:00:00    0.07           1 2022-01-02 00:15:00    0.02           1 2022-01-02 00:30:00    5.03           1 2022-01-02 00:45:00   15.08           2 2022-01-01 00:00:00    6.09           2 2022-01-01 00:15:00    2.011          2 2022-01-01 00:30:00    0.012          2 2022-01-01 00:45:00    0.013          2 2022-01-02 00:00:00    0.014          2 2022-01-02 00:15:00    0.015          2 2022-01-02 00:30:00    0.010          2 2022-01-02 00:45:00    7.0

Advertisement

Answer