Skip to content
Advertisement

Webscaping table data with drop down menu help in either Pandas, Beautiful Soup or Selenium

I am trying to scrape data from this website:

https://www.shanghairanking.com/rankings/grsssd/2021

Initially pandas gets me out the gates and I can scrape the table but I am struggling with the drop down menus. I want to select the options next to the total score box which are PUB, CIT, etc. When I inspect the element it looks like maybe Javascript and the usual methods of interating over these options don’t work. I have tried Beutifalsoup and most recently Selenium to select the drop downs by hand. This works for the default table data ”’

JavaScript

”’ Doesn’t get me anywhere.

Advertisement

Answer

Your code would not work as you first have to click the dropdown open and then traverse through the options in the dropdown. Here is the refactored code.

Note that I have used time.sleep for instant purposes but for a robust code and good practice, use explicit wait such as WebdriverWait

JavaScript
User contributions licensed under: CC BY-SA
3 People found this is helpful
Advertisement