InvalidArgumentException: invalid argument error using Selenium and Pandas scraping urls reading from a CSV

Question

I was trying to scrape a website, where possible urls are in csv. So after reading it through for loop to call my method, where I would open the url and gonna scrape the contents of the site. But due to some reason I am unable to loop and open the urls Here is my code: Error on console I

Accepted Answer

You need to put driver = webdriver.Chrome(chrome_options=chrome_options) inside the loop. Once driver.quit() is called you have to define driver again.from selenium import webdriverimport timeimport pandas as pdchrome_options = webdriver.ChromeOptions();chrome_options.add_argument('--user-agent="Mozilla/5.0 (Windows Phone 10.0; Android 4.2.1; Microsoft; Lumia 640 XL LTE) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/42.0.2311.135 Mobile Safari/537.36 Edge/12.10166"')# driver = webdriver.Chrome(chrome_options=chrome_options)csv_data = pd.read_csv("./data_new.csv")df = pd.DataFrame(csv_data);# urls = df['url'] urls = ['https://stackoverflow.com/',    'https://www.yahoo.com/']print(urls[: 5])def scrap_site(url):    ############## OPEN THE DRIVER HERE ##############    driver = webdriver.Chrome(chrome_options=chrome_options)    ############## OPEN THE DRIVER HERE ##############        print("Recived URL ---> ", url)        driver.get(url)    time.sleep(5)    driver.quit()for url in urls:    print("URL ---> ", url)    scrap_site(url)

Advertisement

Answer