How to extract specific line in text file

Question

I am text mining a large document. I want to extract a specific line. I want to extract the description immediately under ITEM DESCRIPTION. I have tried many unsuccessful attempts. My latest attempt was: But it did not find the text. Is there a way to find ITEM DESCRIPTION and get the line after it or somethi…

Accepted Answer

The following function finds the description on the line below some given pattern, e.g. &#8220;ITEM DESCRIPTION&#8221;, and also ignores any blank lines that may be present in between. However, beware that the function does not handle the special case when the pattern exists, but the description does not.txt = '''CONTINUED ON NEXT PAGE CONTINUATION SHEET REFERENCE NO. OF DOCUMENT BEING CONTINUED:    PAGE 4 OF 16 PAGESSPE2DH-20-T-0133 SECTION BPR: 0081939954 NSN/MATERIAL: 6530015627381ITEM DESCRIPTIONBOTTLE, SAFETY CAPBOTTLE, SAFETY CAP RPOO1: DLA PACKAGING REQUIREMENTS FOR PROCUREMENTRAQO1: THIS DOCUMENT INCORPORATES TECHNICAL AND/OR QUALITY REQUIREMENTS (IDENTIFIED BY AN 'R' OR AN 'I' NUMBER) SET FORTH IN FULL TEXT IN THE DLA MASTER LIST OF TECHNICAL AND QUALITY REQUIREMENTS FOUND ON THE WEB AT:'''I&#8217;ve assumed you got your text as a text string, and thus the function below will split it into a list of lines ..pattern = "ITEM DESCRIPTION" # to search fordef find_pattern_in_txt(txt, pattern):    lines = [line for line in txt.split("n") if line] # remove empty lines    if pattern in lines: return lines[lines.index(pattern)+1]    return Noneprint(find_pattern_in_txt(txt, pattern)) # prints: "BOTTLE, SAFETY CAP"

Advertisement

Answer