Extracting Specific Text From column in dataframe

Question

I have the following dataframe and I&#8217;m trying to extract the string that has the ABC followed by it&#8217;s numbers. Description ABC12345679 132465 Test ABC12346548 Test ABC1231321 4645 I have tried: But its giving me what it comes after on instances that there&#8217;s more text after the ABC* like so: …

Accepted Answer

We can use regex to extract the necessary part of the string.Here we are checking for atleast one [A-C] and 0 or more[0-9]data['extract'] = data.Description.str.extract(r'([A-C]+[0-9]*)')or (based on need)data['extract'] = data.Description.str.extract(r'([A-C]+[0-9]+)')Output    Description             extract0   ABC12345679 132465      ABC123456791   Test ABC12346548        ABC123465482   Test ABC1231321 4645    ABC1231321

Advertisement

Answer