I have the following function to keep only ASCII characters:
JavaScript
x
6
1
import re
2
3
def remove_special_chars(txt):
4
txt = re.sub(r'[^x00-x7F]+', '', txt)
5
return txt
6
But now I also want to keep the copyright symbol (©). What should I add to the pattern?
Advertisement
Answer
Add the copyright symbol’s hex xA9
(source) to your match group:
JavaScript
1
2
1
txt = re.sub(r'[^x00-x7FxA9]+', '', txt)
2