Convert nested dict to dataframe, syntax error?

Question

Problem I am converting multiple nested dicts to dataframes. I have a slightly different dict that I haven&#8217;t been able to convert to a dataframe using my attempted solution. I am providing a shortened copy of my dict with dummy values as the reprex. Reprex dict: My attempted solution Code is based on si…

Accepted Answer

As mentioned in the comments, the TypeError is due result['result'] (which is a dictionary) not being usable as a key. If you used some thing [like result['metric']] then the error would no longer be raised, but I think that the resulting structure is not the outcome you want.For flattening nested data, I often take a recursive approach. Below is a simplified version of my flattenObj function:def flattenDict(orig:dict, kList=[], kSep='_', rename={}):    if not isinstance(orig, dict): return [(kList, orig)]    tList, dCt = [], len([v for v in orig.values() if isinstance(v,dict)])    for k, v in orig.items():        kli = kList + ([] if isinstance(v,dict) and dCt==1 else [str(k)])        tList += flattenDict(v, kli, None)    if not isinstance(kSep, str): return tList    return {kSep.join([rename.get(k,k) for k in kl]):v for kl,v in tList}# import pandas as pdnrMap = {'current':'cur','reference':'ref'}rows = [flattenDict(result, rename=nrMap) for result in reprex_dict['metrics']]rowsDf = pd.DataFrame(rows)rows[{'metric': 'DatasetCorrelationsMetric',  'cur_pearson_target_prediction_correlation': None,  'cur_pearson_abs_max_features_correlation': 0.1,  'cur_cramer_v_target_prediction_correlation': None,  'cur_cramer_v_abs_max_features_correlation': None,  'ref_pearson_target_prediction_correlation': None,  'ref_pearson_abs_max_features_correlation': 0.7,  'ref_cramer_v_target_prediction_correlation': None,  'ref_cramer_v_abs_max_features_correlation': None}]rowsDf.T [Transposed to fit better]If you don&#8217;t want the metric column, you can either drop it or omit it by defining rows asrows = [flattenDict(result['result'], rename=nrMap) for result in reprex_dict['metrics']]

Convert nested dict to dataframe, syntax error?

Problem

Reprex dict:

My attempted solution

Expected dataframe format:

Error message

Advertisement

Answer