How to set groups by the percentiles of whole sample?

Question

I am new to pandas, and I want to figure out how to group values based on sample quantiles. For example, I have a dataframe with a column a. df = pd.DataFrame(np.random.randint(0,100,size=(100, 1)), columns=list('a')) Then what I want to do is to divide the values in a into 10 different group by their deciles, and named the label of their

Accepted Answer

First part answer is subtract 1 with integer division by 10 and add 1 for start groups from 1:df = pd.DataFrame({'a':range(1,101)})df['b'] = 'group ' + (df.a.sub(1) // 10 + 1).astype(str)print(df)      a         b0     1   group 11     2   group 12     3   group 13     4   group 14     5   group 1..  ...       ...95   96  group 1096   97  group 1097   98  group 1098   99  group 1099  100  group 10EDIT: For deciles use qcut:df['b'] = pd.qcut(df.a, 10, labels=False)

	a	b
60	2	group 1
30	3	group 1
94	3	group 1
92	3	group 1
63	3	group 1
…	…	…
47	92	group 10
58	98	group 10
66	99	group 10
73	99	group 10
24	100	group 10

Advertisement

Answer