How to remove white space in between ascii and nonascii chars?

Question

For example: I want to find at least 3 ascii chars, followed by a space, then followed by a nonascii char, and replace the white space with empty string. My code has two issues: How to write the replacement string for (s)? How to make it also work for the reverse order of s2?: [^a-zA-Z0-9] Answer Put the strings that

Accepted Answer

Put the strings that you want to keep in the result in capture groups, then reference them in the replacement.s = re.sub(r'([a-zA-Z0-9]{3})s([^a-zA-Z0-9])', r'12', s1)You don&#8217;t need to use {3,}, you can just use {3}. This will copy the last 3 characters to the result. All the preceding characters will be copied by default because they&#8217;re not being replaced.You can also do it with lookarounds, by matching a space that&#8217;s preceded by 3 ASCII characters and followed by a non-ASCII. Then you replace the space with an empty string.s = re.sub(r'(?<=[a-zA-Z0-9]{3})s(?=[^a-zA-Z0-9])', '', s1)You can use alternative in this method to match both orderss = re.sub(r'(?<=[a-zA-Z0-9]{3})s(?=[^a-zA-Z0-9])|(?<=[^a-zA-Z0-9])s(?=[a-zA-Z0-9]{3})', '', s1)

Advertisement

Answer