Regex Pattern in Python for special charaters

Question

I asked a similar question a few days ago on here and it was great help! A new challenge I wanted build is to further develop the regex pattern to look for specific formats in this iteration, and I thought I have solved it using regex 101 to build/test a regex code but when applied in Python received 'pattern contain

Accepted Answer

Your (?<={)[d+. ]+(?=}) regex contains no capturing groups while Series.str.extractall requires at least one capturing group to output a value.You need to use(df.stack().str.extractall(r'{s*(d+(?:.d+)?)s*}')[0].astype(float) .groupby(level=[0,1]).sum().unstack())Output:   CF1  CF20  3.0  2.01  3.0  5.0The {s*(d+(?:.d+)?)s*} regex matches{ &#8211; a { chars* &#8211; zero or more whitespaces(d+(?:.d+)?) &#8211; Group 1 (note this group captured value will be the output of the extractall method, it requires at least one capturing group): one or more digits, and then an optional occurrence of a . and one or more digitss* &#8211; zero or more whitespaces} &#8211;  a } char.See the regex demo.

Advertisement

Answer