Pandas combine rows in groups to get rid of Nans

Question

I want to do something similar to what pd.combine_first() does, but as a row-wise operation performed on a shared index. And to also add a new column in place of the old ones - while keeping the original_values of shared column names. In this case the 'ts' column is one that I want to replace with time_now. My desired output

Accepted Answer

The problem was that I forgot to put the dictionary into a list to create a records oriented dataframe. Additionally when using a similar function, the index might need to be dropped to be reset, as duplicated columns might be created.I still wonder if there&#8217;s a better way to do what I want, since it&#8217;s kind of slow.def func(df):    df2 = df.set_index('ts').stack().reset_index()    rows = dict(zip(df2['level_1'],df2[0]))    ts = df2['ts'].unique().tolist()    for cnt,value in enumerate(ts):        rows[f'ts_{cnt}'] = value    # drop all rows    df2 = pd.DataFrame([rows])    df2['time'] = time_now    return df2#run thisdf.groupby('unique_id').apply(func)

Advertisement

Answer