How to have categorical regex groups with Python

Question

I have a text which corresponds to a pattern can must be split into categories. I thought of using groups to capture parts of the text that correspond to a particular category patern, and then map that part to my category. Unfortunately, as far as I know group names in Python regex cannot have the same name, …

Accepted Answer

You can install PyPi regex library, use your current pattern without any modifications and upon getting a match using regex.search, access the match.capturesdict:import regexpattern = r"(?P[A-Z]{4})(?Pdd.dd-)(?P[0-9]{4})(?P[A-Z]{3})(?P.*)"text = " someNooise AABT12.20-1215BTTFFFF SomemoreNoize"result = regex.search(pattern, text)print(result.capturesdict() )# => {'catA': ['AABT', '1215', 'BTT'], 'catB': ['12.20-', 'FFFF SomemoreNoize']}See the Python demo.The PyPi regex module supports patterns with identically named capturing groups.

Advertisement

Answer