I have a list of column names & want to drop the rows that have more than 1 NaN
values but this error occurs: dropna() got an unexpected keyword argument 'thresh'
.
My pandas is updated, the version is 1.1.5
Previously I’ve done a little data cleaning, think it caused my df rows to become str, but I converted them to np.nan
…
My Whole Code:
JavaScript
x
17
17
1
df = pd.read_csv('dataframe.csv', index=False)
2
col_list = ['col1', 'col2', 'col3']
3
4
""" # DataClean
5
for col in col_list:
6
df[col] = pd.to_numeric(df[col]
7
.str.replace('[^0-9- ]', '')
8
.str.split(' - ')
9
.explode()
10
).mean(level=0)
11
"""
12
13
df = df.replace('NaN', np.nan)
14
15
for col in col_list:
16
df[col] = df[col].dropna(thresh=1, axis=0)
17
Advertisement
Answer
When you use dropna(thresh=1, axis=0)
it will drop rows that have just nan
values, for your purpose, you can do the following:
JavaScript
1
2
1
df.dropna(subset=col_list, how='any', axis=0)
2