Consolidating categories in columns

Question

I have a df with a race column, which has 4 categories. However, I would like to only have three categories by combining the last two categories. This is what my current df looks like: I want to consolidate the race==3 and race ==4 into one value (which would be race ==3). So my new df output would look something

Accepted Answer

Replace Race 4 by 3 and group data by Race + Sexdf.loc[df['Race']==4, 'Race']=3df = df.groupby(['Race','Sex'],as_index=False)['population'].sum()You getYear State Race Sex  population    2006  CA   1    1    5048932006  CA   1    2    7837602006  CA   2    1    8000622006  CA   2    2    7683002006  CA   3    1    9131712006  CA   3    2    701451

Advertisement

Answer