Trying to get only the text between two strong tags

Question

I am currently trying to get only the HTML text (a list of names) that is between the first two occurrences of the strong tag. Here is a short example of the HTML I scraped Hers is some quick code that I wrote with the basic logic of counting the number of strong tags occurring. I know after the second

Accepted Answer

To answer the question as is, leaving the opportunity to scrape the “Title of Article” and “Footnotes”. You can use findChildren() then decompose() to remove unwanted elements. From the output of this code you can extract the data you need quite easily. It works even if the text “PRESENT” and “Section Header” are not present. It can easily be adapted to remove elements before the first “Strong” tag if needed.from bs4 import BeautifulSoup, elementhtml = """

blah blah

Title of Article

Section Header 1

A paragraph with some information and footnotes¹

PRESENT:

John Smith, Farmer
William Dud, Bum
Luke Brain, Terrible Singer
Charles Evans, Doctor
Stanley Fish, Fisher

George Jungle, Savage

William, Baller

Roy Williams, Coach

Section Header 2
A second paragraph with lots of text and footnotes