I’ve web scraped addresses and the address strings have unwanted elements like “n” and “<br/>”, how do I remove them?
JavaScript
x
2
1
Rosemount Viaduct,<br />rnAberdeen<br />rn
2
Advertisement
Answer
You can clean these html leftovers with a regular expression:
JavaScript
1
5
1
import re
2
3
value = "Rosemount Viaduct,<br />rnAberdeen<br />rn"
4
clean_value = re.sub(r'<brs/>rn', r'', value)
5