How to extract the URL of a webpage without knowing beforehand?

Question

I&#8217;m trying to make an iterative web search that pulls up a google search page ONLY when it needs to. Therefore, I don&#8217;t know the URLs ahead of time. I am aware of the .current_url argument from Selenium but it does not give me what I want. When I do print(driver.current_url) I only get https://www…

Accepted Answer

Actually there is no need to go to the google home page to do a regular search. You can directly go on the page of your search like here:def search(driver, text):    driver.get("https://www.google.com/search?q={}".format(text))But if you want to add several other parameters to your search I advise you to look at the module google. It will directly give you the links of the first results of your search like that:>>> import googlesearch>>> query = "A computer science portal">>> for j in googlesearch.search(query, tld="co.in", num=10, stop=10, pause=2):    print(j)    https://www.geeksforgeeks.org/page/4/https://www.geeksforgeeks.org/https://en.wikipedia.org/wiki/Portal:Computer_programminghttps://en.wikiversity.org/wiki/Portal:Computer_Sciencehttp://www.pearltrees.com/u/17097488-geeksforgeeks-computer-sciencehttps://studentportal.gu.se/english/my-studies/csehttps://www.computerscienceonline.org/https://portal.cs.nuim.ie/https://www.quora.com/What-are-the-top-websites-computer-science-students-must-visitIf you do not want to use it directly you can look at the code of the module. As it is not on github you can read the code at the location pip installed it. The code is not very complicated and the interesting part concerning how to produce google search urls is not more than 100 lignes.

Advertisement

Answer