Pandas subtract each column in dataframe_a from all columns of dataframe_b and write result to third dataframe

Question

I have dataframe_a and dataframe_b filled with an variable number of columns but the same number of rows. I need to subtract each column of dfb from all dfa columns and create a new dataframe containing the subtracted values. Right now I'm doing this manually: then I'm using the concat function to concatenate all the columns: This all seems horribly

Accepted Answer

Setupimport pandas as pdimport numpy as npfrom itertools import productdfa = pd.DataFrame([[8, 7, 6]], range(5), [*'ABC'])dfb = pd.DataFrame([[1, 2, 3, 4]], range(5), [*'DEFG'])Pandas&#8217; concatI use the operator method rsub with the axis=0 argument.  See this Q&A for more informationpd.concat({c: dfb.rsub(s, axis=0) for c, s in dfa.items()}, axis=1)   A           B           C            D  E  F  G  D  E  F  G  D  E  F  G0  7  6  5  4  6  5  4  3  5  4  3  21  7  6  5  4  6  5  4  3  5  4  3  22  7  6  5  4  6  5  4  3  5  4  3  23  7  6  5  4  6  5  4  3  5  4  3  24  7  6  5  4  6  5  4  3  5  4  3  2Numpy&#8217;s broadcastingYou can play around with it and learn how it worksa = dfa.to_numpy()b = dfb.to_numpy()c = a[..., None] - b[:, None]df = pd.DataFrame(dict(zip(    product(dfa, dfb),    c.reshape(5, -1).transpose())))df   A           B           C            D  E  F  G  D  E  F  G  D  E  F  G0  7  6  5  4  6  5  4  3  5  4  3  21  7  6  5  4  6  5  4  3  5  4  3  22  7  6  5  4  6  5  4  3  5  4  3  23  7  6  5  4  6  5  4  3  5  4  3  24  7  6  5  4  6  5  4  3  5  4  3  2

Pandas subtract each column in dataframe_a from all columns of dataframe_b and write result to third dataframe

Advertisement

Answer

Setup

Pandas’ `concat`

Numpy’s broadcasting