Skip to content
Advertisement

Tag: information-extraction

pdfplumber | Extract text from dynamic column layouts

Attempted Solution at bottom of post. I have near-working code that extracts the sentence containing a phrase, across multiple lines. However, some pages have columns. So respective outputs are incorrect; where separate texts are wrongly merged together as a bad sentence. This problem has been addressed in the following posts: Solution 1 Solution 2 Question: How do I “if-condition” whether

Advertisement