I am trying to calculate the % of men and women in a dataframe column named “gender”.
“gender” is defined as an object taking 3 values : “Man” “Woman” “nan” (NA).
I did this :
JavaScript
x
11
11
1
total = len(df['gender'])
2
men = len(df[df['gender']=="Man"])
3
women = len(df[df['gender']=="Woman"])
4
5
pct_men = round(men/total*100,1)
6
pct_women = round(women/total*100,1)
7
8
print(f'{pct_men}%')
9
print(f'{pct_women}%')
10
11
But it returns 0.0% for both.
When i check ‘total’ value it returns : 10123033 but zero for both ‘men’ and ‘women’ Thanks.
Advertisement
Answer
pay attention maybe in your dataframe “man” or “men” or “Men” exist instead of “Man” and for woman …
try this:
JavaScript
1
14
14
1
import pandas as pd
2
df = pd.DataFrame({'gender': {0: 'Man', 1:'Woman', 2:'Man'}})
3
4
total = len(df['gender'])
5
men = len(df[df['gender']=="Man"])
6
women = len(df[df['gender']=="Woman"])
7
8
9
pct_men = round(men/total*100,1)
10
pct_women = round(women/total*100,1)
11
12
print(f'{pct_men}%')
13
print(f'{pct_women}%')
14
output:
JavaScript
1
3
1
66.7%
2
33.3%
3