Optimal way to acquire percentiles of DataFrame rows

Question

Problem I have a pandas DataFrame df: My desired output, i.e. new_df, contains the 9 different percentiles including the median, and should have the following format: Attempt The following was my initial attempt: However, instead of returning the percentiles of all columns, it calculated these percentiles for each val column and therefore returned 1000 columns. As it calculated the percentiles

Accepted Answer

You can get use .describe() function like this:# Create Dataramedf = pd.DataFrame(np.random.randn(5,3))# .apply() the .describe() function with "axis = 1" rowsdf.apply(pd.DataFrame.describe, axis=1)output:   count      mean       std       min       25%       50%       75%       max0    3.0  0.422915  1.440097 -0.940519 -0.330152  0.280215  1.104632  1.9290491    3.0  1.615037  0.766079  0.799817  1.262538  1.725259  2.022647  2.3200362    3.0  0.221560  0.700770 -0.585020 -0.008149  0.568721  0.624849  0.6809783    3.0 -0.119638  0.182402 -0.274168 -0.220240 -0.166312 -0.042373  0.0815654    3.0 -0.569942  0.807865 -1.085838 -1.035455 -0.985072 -0.311994  0.361084if you want other percentiles than the default 0.25, .05, .075 you can create your own function where you change the values of .describe(percentiles = [0.1, 0.2...., 0.9])

Problem

Attempt

Advertisement

Answer