How to identify minimum squared value of an entire pandas dataframe column by column?

Question

I have a pandas dataframe like this: How could I calculate the sum of the squared values for the entire column (I am trying something like deviation = df[columnName].pow(2).sum() in a loop, but ideas are very welcome!) but also afterwards identifying the column that has the smallest of those sums and the actual smallest sum? Edit: Adding desired output Desired

Accepted Answer

You can calculate the sum of square on the entire data frame, which returns a Series object with the column names as index. And then you can find the minimum value as well as minimum index using min and idxmin:col_squares = df.pow(2).sum()col_squares#column1     26.00#column2     74.00#column3    242.26#dtype: float64col_squares.min(), col_squares.idxmin()#(26.0, 'column1')

Advertisement

Answer