How to turn a Pandas DataFrame into a table of vectors

Question

I have a two columns Pandas data frame containing a list of user_ids and some URLs they have visited. It looks like this: I want to create a vector representation of itself, like this: I've tried different things, but keep hitting a wall. Any ideas? Answer What you're describing is a pivot of the url column This puts the users

Accepted Answer

What you&#8217;re describing is a pivot of the url column# Make datadf = pd.DataFrame([               ['user1', 'url1'],                ['user1', 'url3'],                ['user1', 'url5'],               ['user2', 'url2'],               ['user2', 'url4'],               ['user2', 'url5'],               ['user3', 'url1'],               ['user3', 'url4'],               ['user3', 'url5']               ], columns=['users', 'urls'])# add column to fill pivoted valuesdf['count'] = 1new_df = df.pivot(index='users',columns='urls',values='count').fill_na(0)new_df# urls   url1  url2  url3  url4  url5# users                              # user1   1.0   0.0   1.0   0.0   1.0# user2   0.0   1.0   0.0   1.0   1.0# user3   1.0   0.0   0.0   1.0   1.0This puts the users column in the index, but you can use reset_index to make it a regular column again.

Advertisement

Answer