Skip to content
Advertisement

Filter out dataframe based on values being within the 90th percentile

Suppose I have this dataframe

JavaScript

Now I want to go through each column and filter out the low percentiles keeping only values that are contained in the 90th percentile.

Thus since apple and bob are each within their associated 90th percentiles I would have this dataframe

JavaScript

How do I achieve this?

Advertisement

Answer

Hope this helps:

  1. Calculate 90th percentile and keep them in a dictionary:
JavaScript

2.Calculate value counts of each value:

JavaScript
  1. Compare value counts and percentile:
JavaScript

Note: There are some nan values in value_dic but it won’t hurt the performance.

User contributions licensed under: CC BY-SA
1 People found this is helpful
Advertisement