Code scrapes first webpage twice, but then scrapes the next six as it’s meant to

Question

I'm trying to scrape football scores from 8 pages online. For some reason my code is scraping the results from the first page twice, it goes on to scrape the next 6 pages as it should, then leaves out the final page. Here is my code Help would be much appreciated EDIT: I fixed it by shifting the loop up

Accepted Answer

You should put a delay between driver.get(url) and soup = BeautifulSoup(driver.page_source, 'lxml') to let the new page loaded.Without that the first iteration reads the first page correctly sincesoup = BeautifulSoup(driver.page_source, 'lxml') action waits for page (any) to be loaded before scraping it content, but in the second iteration you will read the content of the first page again since the second page is still not loaded.The time.sleep(5) command in it&#8217;s wrong locating will cause all the next pages to be scraped but with delay of 1 iteration causing the last page to not being scraped.With delay at the correct place it will work correctlyfor i in range(1,9,1):    url = 'https://www.oddsportal.com/soccer/england/premier-league-2020-2021/results/#/page/' + str(i) + '/'    driver.get(url)    time.sleep(5)        soup = BeautifulSoup(driver.page_source, 'lxml')

Advertisement

Answer