How to extract deeply nested tags using Beautiful Soup

Question

I have the content below and I am trying to understand how to extract the

tag copy using Beautiful Soup (I am open to other methods). As you can see the

tags are not both nested inside the same

. I gave it a shot with the following method but that only seems to work when both &l…

Accepted Answer

As p tags are inside div class="inside-panel-1, so we can easily grab them by calling find_all method as follows:from bs4 import BeautifulSouphtml = """

Some Title

I want to extract this copy

"""soup = BeautifulSoup(html, 'html.parser')# print(soup.prettify())p_tags = soup.select('div.top-panel div[class="inside-panel-1"]')for p_tag in p_tags: print(p_tag.get_text(strip=True))Output:I want to extract this copyI want to extract this copy

Advertisement

Answer