Python DataFrame String replace accidently Returing NaN

I encounter a weird problem in Python Pandas, while I read a excel and replace a character “k”, the result gives me NaN for the rows without “K”. see below image

It should return 173 on row #4，instead of NaN, but if I create a brand new excel, and type the same number. it will work.

or if i use this code,

df = pd.DataFrame({ 'sales':['75.8K','6.9K','7K','6.9K','173','148']})
df

JavaScript
​x
 
df = pd.DataFrame({ 'sales':['75.8K','6.9K','7K','6.9K','173','148']})
df
​

then it will works well. Why? please advise!

Answer

Try this:

df['nums'] = df['sales'].astype(str)
df['nums'] = pd.to_numeric(df['nums'].str.replace('K', ''))

JavaScript
 
df['nums'] = df['sales'].astype(str)
df['nums'] = pd.to_numeric(df['nums'].str.replace('K', ''))
​

Advertisement

Answer