I am trying to add a ‘sex’ column to an existing ‘tips’ dataframe. There are 244 rows that need to be filled randomly with either ‘Male’ or ‘Female’. I have tried using a for loop to iterate through each row and assign either list option, but I can’t quite get it right.
sex = ['Male', 'Female'] def sex(): for row in tips['sex']: sex[random.randint(0,1)] tips['sex'] = sex()
Advertisement
Answer
You can use np.random.choice
for this:
import numpy as np import pandas as pd df = pd.DataFrame({'x': [1, 3, 4, 5, 7]}) df['sex'] = np.random.choice(['Male', 'Female'], size=len(df)) df x sex 0 1 Male 1 3 Male 2 4 Male 3 5 Female 4 7 Male