How to scrape billboard using find_all?

Question

I have a pretty specific question if anyone could help that would be awesome. I am trying to scrape the songs from https://www.billboard.com/charts/hot-100/ and I am stuck on this code. What should I put for tag = soup.find_all('???') to get the title 'About Damn Time. Answer Try to select the row containers, iterate the ResultSet get your expected text from

Accepted Answer

Try to select the row containers, iterate the ResultSet get your expected text from the <h3>:for e in soup.find_all(attrs={'class':'o-chart-results-list-row-container'}):    print(e.h3.get_text(strip=True))ExampleIf you try to store different information, avoid a bunch of differnet lists. Instead use one list with dict of each item and its information.import requestsimport bs4url = 'https://www.billboard.com/charts/hot-100/'res = requests.get(url)soup = bs4.BeautifulSoup(res.text)data = []for e in soup.find_all(attrs={'class':'o-chart-results-list-row-container'}):    data.append({        'title':e.h3.get_text(strip=True),        'author':e.h3.find_next('span').get_text(strip=True)    })data

Advertisement

Answer

Example