Python Pandas Mixed Type Warning – “dtype” preserves data?

Question

I have this code that gives this warning: I have searched across both google and stackoverflow and people seem to give two kinds of solutions: low_memory = False converters Problem with #1 is it merely silences the warning but does not solve the underlying problem (correct me if I am wrong). Problem with #2 is converters might do things we

Accepted Answer

You have different choices to read your file>>> %cat data.csvCol_2112242.24-232e-3empty.90832Case 1: let Pandas determines datatype# df = pd.read_csv('data.csv')>>> df    Col_210       121   242.242  -232e-33    empty4   .90832>>> df.info()... 0   Col_21  5 non-null      object...Case 2: add strings to recognize NaN values and let Pandas determines datatype# df = pd.read_csv('data.csv', na_values='empty')>>> df      Col_210   12.000001  242.240002   -0.232003        NaN4    0.90832>>> df.info()... 0   Col_21  4 non-null      float64...Case 3: add strings to recognize NaN values but keep data as plain text# df = pd.read_csv('data.csv', na_values='empty', dtype={'Col_21': str})>>> df    Col_210       121   242.242  -232e-33      NaN4   .90832>>> df.info()... 0   Col_21  4 non-null      object...

Advertisement

Answer