pd.get_dummies() not converting categorical data to one hot encoded vectors when multiple features are used

Question

I just got started on Kaggle and for my first project I was working on the Titanic dataset. I ran the following codeblock Although I'm getting the output as: The Pclass, SibSp and Parch variables did not convert to one_hot encoded vectors though the Sex attribute did. I didn't understand why because when I try to run pd.get_dummes() function on

Accepted Answer

Use OneHotEncoder from sklearnfrom sklearn.preprocessing import OneHotEncoderdf = pd.DataFrame({'Pclass': [0, 1, 2], 'SibSp': [3, 1, 0],                   'Parch': [0, 2, 2], 'Sex': [0, 1, 1]})ohe = OneHotEncoder()data = ohe.fit_transform(df[['Pclass', 'SibSp', 'Parch', 'Sex']])df1 = pd.DataFrame(data.toarray(), columns=ohe.get_feature_names_out(), dtype=int)Output:>>> df   Pclass  SibSp  Parch  Sex0       0      3      0    01       1      1      2    12       2      0      2    1>>> df1   Pclass_0  Pclass_1  Pclass_2  SibSp_0  SibSp_1  SibSp_3  Parch_0  Parch_2  Sex_0  Sex_10         1         0         0        0        0        1        1        0      1      01         0         1         0        0        1        0        0        1      0      12         0         0         1        1        0        0        0        1      0      1

Advertisement

Answer