Can’t get tags when scraping data

Question

I am trying to scrape all tr tags using BeautifulSoup, but it returns none. Code: Even though there are tr tags in this url, it returns none and throws an IndexError. Why is this happening? Answer In page source table is located inside comment. You need to extract comment content and then parse it as HTML:

Accepted Answer

In page source table is located inside comment. You need to extract comment content and then parse it as HTML:from bs4 import BeautifulSoupfrom bs4 import Commenturl = 'https://www.pro-football-reference.com/years/2020/defense_advanced.htm'html = urlopen(url)soup = BeautifulSoup(html, "lxml")comment = soup.find(text=lambda text: isinstance(text, Comment) and 'class="table_outer_container"' in text)stats_page = BeautifulSoup(comment, "lxml")column_headers = stats_page.findAll('tr')[0]column_headers = [i.getText() for i in column_headers.findAll('th')]

Advertisement

Answer