Python Error: ‘float’ object has no attribute ‘replace’

Question

I am an R User that is trying to learn more about Python. I found this Python library that I would like to use for address parsing: https://github.com/zehengl/ez-address-parser I was able to try an example over here: I have the following file that I imported: I then applied the above function and export the f…

Accepted Answer

Looking at the code from the library, we have this method for parse in the AddressParser class, and then this function for tokenize that is called by parse# method of AddressParserdef parse(self, address):        if not self.crf:            raise RuntimeError("Model is not loaded")        tokens = tokenize(address)        labels = self.crf.predict([transform(address)])[0]        return list(zip(tokens, labels))def tokenize(s):    s = s.replace("#", " # ")    return [token for token in split(fr"[{puncts}s]+", s) if token]We can see here that tokenize calls replace, and so that is likely where your error is coming from. tokenize is probably expecting a str here (not a float), and that s.replace() is almost certainly for a string replacement.So, your column likely has floats in it when it expects strings. The tokenize function should probably handle that better, but now it is up to you.You should be able to resolve this by forcing your Address column to be strings (pandas will call it &#8216;object&#8217;).df1['string_address'] = df1['ADDRESS'].astype(str)df1['Address_Parse'] = df1['string_address'].apply(ap.parse)

Advertisement

Answer