How to remove quotes from Numeric data in Python

Question

I have one numeric feature in a data frame but in excel some of the values contain quotes which need to be removed. Below table is what my data appears to be in Excel file now I want to remove quotes from last 3 rows using python. Col1 Col2 123 A 456 B 789 C "123" D "456" E "789"

Accepted Answer

One way is to use converter function while reading Excel file. Something along those lines (assuming that data provided is in Excel file in columns &#8216;A&#8217; and &#8216;B&#8217;):import pandas as pddef conversion(value):    if type(value) == int:        return value    else:        return value.strip('"')df = pd.read_excel('remove_quotes_excel.xlsx', header=None,                   converters={0: conversion})# df     0  10  123  A1  456  B2  789  C3  123  D4  456  E5  789  FBoth columns are object type, but now (if needed) it is straightforward to convert to int:df[0] = df[0].astype(int)

Advertisement

Answer