Loop through multiple small Pandas dataframes and create summary dataframes based on a single column

Question

I have a bunch of small dataframes each representing a single match in a game. I would like to take these dataframes and consolidate them into a single dataframe for each player without knowing the player's names ahead of time. The starting dataframes look like this: And I would like to get to a series of frames looking like this

Accepted Answer

Let&#8217;s try concat + groupby then build out a dict:dfs = {group_name: df_       for group_name, df_ in pd.concat([df1, df2]).groupby('NAME')}dfs:{'player1':       NAME  VAL1  VAL2  VAL30  player1     3     5     7, 'player2':       NAME  VAL1  VAL2  VAL31  player2     2     6     80  player2     5     7     7, 'player3':       NAME  VAL1  VAL2  VAL32  player3     3     6     71  player3     2     6     8, 'player5':       NAME  VAL1  VAL2  VAL32  player5     3     6     7}Each player&#8217;s DataFrame can then be accessed like:dfs['player1']:      NAME  VAL1  VAL2  VAL30  player1     3     5     7Or as a list:dfs = [df_ for _, df_ in pd.concat([df1, df2]).groupby('NAME')]dfs:[      NAME  VAL1  VAL2  VAL30  player1     3     5     7,       NAME  VAL1  VAL2  VAL31  player2     2     6     80  player2     5     7     7,       NAME  VAL1  VAL2  VAL32  player3     3     6     71  player3     2     6     8,       NAME  VAL1  VAL2  VAL32  player5     3     6     7]Each player&#8217;s DataFrame can then be accessed like:dfs[1]:      NAME  VAL1  VAL2  VAL31  player2     2     6     80  player2     5     7     7

Advertisement

Answer