Create a new list of dictionary from the index in dataframe Python with the fastest way

Question

I have a ~200mil data in dictionary index_data: Key is a value in CustId and Value is an index of CustID in df_data: I have a DataFrame df_data: NOTE: If CustID is duplicate, only column Score have different data in each row I want to create a new list of dict(Total_Score is an avg Score of each CustID, Numbe…

Accepted Answer

In [353]: dfOut[353]:              CustID  Score  Number1  Number2  Phone0  3396623046050748      2        2        3   00001  3396623046050748      6        2        3   00002  3749192045350356      1       56       23   22223  4605074846433127     67      532      321   33334   112884719857303      3       11       66   44445   507466746864539      7       22       96   5555    In [351]: d = df.groupby(['CustID', 'Phone', round(df.Number2.div(df.Number1), 2)])['Score'].mean().reset_index(name='Total_Score').rename(columns={'level_2': 'Number'}).to_dict('records')In [352]: dOut[352]: [{'CustID': 112884719857303, 'Phone': 4444, 'Number': 6.0, 'Total_Score': 3}, {'CustID': 507466746864539, 'Phone': 5555, 'Number': 4.36, 'Total_Score': 7}, {'CustID': 3396623046050748, 'Phone': 0000, 'Number': 1.5, 'Total_Score': 4}, {'CustID': 3749192045350356, 'Phone': 2222, 'Number': 0.41, 'Total_Score': 1}, {'CustID': 4605074846433127, 'Phone': 3333, 'Number': 0.6, 'Total_Score': 67}]

Advertisement

Answer