Calculate RMS, Count, SUM to array inside all columns of pandas dataframe

Question

I would like to calculate RMS, Count, SUM to array inside all columns of pandas dataframe and then fulfill outputs into new three dataframes as shown below P.S > solution should deal with N numbers of columns, in my case, I have around 300 columns x,y,z,a,b,c ....... etc ...... N ID x y z ….. EF407412 [471, 1084, 1360, 2284]

Accepted Answer

Updating answer as per the OP&#8217;s comment &#8211; for any number of Columns Check Below code:import pandas as pd from ast import literal_evalimport numpy as np df = pd.DataFrame({'ID':['EF407412','KM043272']                   , 'x': ['[471, 1084, 1360, 2284]','[2173]']                   , 'y': ['[1408, 1572, 2277]','[1293, 2354,]']                   , 'z': ['[977, 1003, 1493, 1519, 1650, 1676, 2804]','[1200]']} )col_num = df.shape[1]COUNTdf[[i+"_count" for i in df.columns[1:col_num]]] = df.apply(lambda x: ("{},"*(col_num-1))[:-1].                                                    format( *(tuple([len(literal_eval(x[col])) for col in df.columns[1:col_num]] ))),axis=1).                                                    astype('str').str.split(',', expand=True).valuesdf[['ID']+([ col for col in df.columns if col.endswith('count')])]OUTPUT:SUMdf[[i+"_sum" for i in df.columns[1:col_num]]] = df.apply(lambda x: ("{},"*(col_num-1))[:-1].                                                    format( *(tuple([sum(literal_eval(x[col])) for col in df.columns[1:col_num]] ))),axis=1).                                                    astype('str').str.split(',', expand=True).valuesdf[['ID']+([ col for col in df.columns if col.endswith('sum')])]Output:RMSdf[[i+"_rms" for i in df.columns[1:col_num]]] = df.apply(lambda x: ("{},"*(col_num-1))[:-1].                                                    format( *(tuple([np.sqrt(np.mean(np.square(literal_eval(x[col])))) for col in df.columns[1:col_num]] ))),axis=1).                                                    astype('str').str.split(',', expand=True).valuesdf[['ID']+([ col for col in df.columns if col.endswith('rms')])]Output:

Advertisement

Answer

COUNT

SUM

RMS