I have a dataset that looks like this
| ID | Details |
|---|---|
| 1 | he wants to invest, Project: Emaar, budget [] |
| 2 | she is interested in renting, Project: W Residence, bedrooms=2 |
| 3 | wants to sell, Project: Dubai View; callback |
I need to extract project name, which is located between a word ‘Project:’ and closet character (for e.x. , | ;)
So that in the result it looks like this:
| ID | Details |
|---|---|
| 1 | Emaar |
| 2 | W Residence |
| 3 | Dubai View |
Advertisement
Answer
If the comma & semi-colon are always at the end of the project name and your projects only have letters & spaces in their names, then you could use this regex:
Project: ([A-Za-z ]+)[;,]