Nested dictionary to CSV convertion optimization

Question

I have a dictionary like this: My function to transform that into a CSV is this one: My output is a csv file like this: It is working, but it isn&#8217;t well optimized. The process is very slow when I run into a dictionary with more than > 10,000 entries. Any ideas on how to speed this process up? Thank

Accepted Answer

I&#8217;d start with getting rid of pandas.append. Appending rows to DataFrames is inefficient. You can create a DataFrame in one go:result = []for x in no_empty_keys:    for y in no_empty_keys[x]:        result.append(            {                'Epitope ID': x,                'PDB': y[0],                'Percent Identity': y[2],                'Epitope Mapped': y[3],                'Epitope Sequence': y[1],                'Starting Position': y[4],                'Ending Position': y[5]            }        )epitope_df = epitope_df.from_records(result)epitope_df.to_csv('new.csv', index=False)

Advertisement

Answer