How do I fill a dictionary with indices in a for loop?

Question

I have a transposed Dataframe tr: 7128 8719 14051 14636 JDUTC_0 2451957.36 2452149.36 2457243.98 2452531.89 JDUTC_1 2451957.37 2452149.36 2457243.99 2452531.90 JDUTC_2 2451957.37 2452149.36 2457244.00 2452531.91 JDUTC_3 NaN 2452149.36 NaN NaN JDUTC_4 NaN 2452149.36 NaN NaN JDUTC_5 NaN 2452149.36 NaN NaN JDUTC_6 1.23 2452149.37 NaN NaN JDUTC_7 NaN NaN NaN NaN JDUTC_8 NaN NaN NaN NaN JDUTC_9 NaN NaN NaN NaN

Accepted Answer

With the correct name convention, I would change your codeafter:import numpy as npimport pandas as pdimport sysif sys.version_info[0] < 3:    from StringIO import StringIOelse:    from io import StringIOs = StringIO("""idx 7128    8719    14051   14636JDUTC_0 2451957.36  2452149.36  2457243.98  2452531.89JDUTC_1 2451957.37  2452149.36  2457243.99  2452531.90JDUTC_2 2451957.37  2452149.36  2457244.00  2452531.91JDUTC_3 NaN 2452149.36  NaN NaNJDUTC_4 NaN 2452149.36  NaN NaNJDUTC_5 NaN 2452149.36  NaN NaNJDUTC_6 1.23    2452149.37  NaN NaNJDUTC_7 NaN NaN NaN NaNJDUTC_8 NaN NaN NaN NaNJDUTC_9 NaN NaN NaN NaN""")tr = pd.read_csv(s, sep="t", index_col=0)(people should give minimal working code &#8211; but often forget to give e.g. the code to build the data frame etc. and the imports)to:a = {}b = []for name, values in tr.items():    b.clear() # this is problematic as you know    for ind, val in enumerate(values):        if np.isnan(val):            b.append(ind)            continue        else:            pass    a[name] = bcontinue and pass are not necessary &#8211; they just say &#8220;go on&#8221; with the loop.In Python, you are not forced to give the else branch:for name, values in tr.items():    b.clear() # This is still problematic at this state.    for ind, val in enumerate(values):        if np.isnan(val):            b.append(ind)    a[name] = bSuch collection of data using for-loops are better done with list-comprehensions:a = {}for name, values in tr.items():    b = [ind for ind, val in enumerate(values) if np.isnan(val)]    a[name] = b# now the result is already correct!And finally, you can even build list-comprehensions for dictionaries &#8211;making this entire code a one-liner &#8211; but a readable one &#8211; when one is familiar with list comprehensions:a = {name: [i for i, x in enumerate(vals) if np.isnan(x)] for name, vals in tr.items()}You can see the result:a# which returns:{'7128': [3, 4, 5, 7, 8, 9], '8719': [7, 8, 9], '14051': [3, 4, 5, 6, 7, 8, 9], '14636': [3, 4, 5, 6, 7, 8, 9]}List-comprehensions are going into the direction of Functional Programming (FP).Which exactly deals with the problem of not to apply mutation (like the b.append() or b.clear() methods &#8211; because &#8211; as you have seen: your case is a demonstration of how easily a bug is generated when using mutation. &#8211; and would contribute to the discussion &#8211; why FP &#8211; while it at the first sight looks brain-unfriendly &#8211; isactually the more brain-friendly way to program.List comprehensions are the Pythonic form of &#8220;map&#8221; &#8211; and if you use a &#8220;if&#8221; inside list comprehensions &#8211; this is the Pythonic equivalent to &#8220;filter&#8221; which FP people know like a second brain for breathing.

	7128	8719	14051	14636
JDUTC_0	2451957.36	2452149.36	2457243.98	2452531.89
JDUTC_1	2451957.37	2452149.36	2457243.99	2452531.90
JDUTC_2	2451957.37	2452149.36	2457244.00	2452531.91
JDUTC_3	NaN	2452149.36	NaN	NaN
JDUTC_4	NaN	2452149.36	NaN	NaN
JDUTC_5	NaN	2452149.36	NaN	NaN
JDUTC_6	1.23	2452149.37	NaN	NaN
JDUTC_7	NaN	NaN	NaN	NaN
JDUTC_8	NaN	NaN	NaN	NaN
JDUTC_9	NaN	NaN	NaN	NaN

Advertisement

Answer