How to scrape billboard using find_all?

Question

I have a pretty specific question if anyone could help that would be awesome. I am trying to scrape the songs from https://www.billboard.com/charts/hot-100/ and I am stuck on this code. What should I put for tag = soup.find_all(&#8216;???&#8217;) to get the title &#8216;About Damn Time. Answer Try to select t…

Accepted Answer

Try to select the row containers, iterate the ResultSet get your expected text from the <h3>:for e in soup.find_all(attrs={'class':'o-chart-results-list-row-container'}):    print(e.h3.get_text(strip=True))ExampleIf you try to store different information, avoid a bunch of differnet lists. Instead use one list with dict of each item and its information.import requestsimport bs4url = 'https://www.billboard.com/charts/hot-100/'res = requests.get(url)soup = bs4.BeautifulSoup(res.text)data = []for e in soup.find_all(attrs={'class':'o-chart-results-list-row-container'}):    data.append({        'title':e.h3.get_text(strip=True),        'author':e.h3.find_next('span').get_text(strip=True)    })data

Advertisement

Answer

Example