I am trying to add a ‘sex’ column to an existing ‘tips’ dataframe. There are 244 rows that need to be filled randomly with either ‘Male’ or ‘Female’. I have tried using a for loop to iterate through each row and assign either list option, but I can’t quite get it right.
sex = ['Male', 'Female']
def sex():
    for row in tips['sex']:
        sex[random.randint(0,1)]
tips['sex'] = sex()
Advertisement
Answer
You can use np.random.choice for this:
import numpy as np
import pandas as pd
df = pd.DataFrame({'x': [1, 3, 4, 5, 7]})
df['sex'] = np.random.choice(['Male', 'Female'], size=len(df))
df
   x     sex
0  1    Male
1  3    Male
2  4    Male
3  5  Female
4  7    Male