Getting availabilities from a dynamic website with BeatifulSoup

Question

I am trying to scrape a website like this: https://seeksophie.com/options/1-5hr-basic-candle-workshop. From this website, I'd like to get all date schedules (for 1 year) for the activity, and all of dates in the website are in form of span component. It is important for me to get notAllowed and flatpickr-disabled class from the component as I will have to filter available

Accepted Answer

Finally, after 3 hours :)I don&#8217;t want/going to explain all the wrong things that you have done in your script, but I am going to explain my code.I have to execute all these JavaScript because website is not allowing me to click in next month button.(ie. If it works fine without executing these scripts then you may  delete that JavaScript line). You are using html.parser as parser but I am using lxml because it is faster then html.parser and other things are straight forward, just clicking on next month button and scraping spans from source code. You can now do other things with these spans.Here&#8217;s codedriver.get('https://seeksophie.com/options/1-5hr-basic-candle-workshop')driver.execute_script("""document.querySelector("#js-booking-bottom-bar").remove()""")n=driver.find_element_by_xpath("/html/body/div[3]/div[4]/div[5]/div/div/div[2]/div/div[1]/div/div[2]/div[1]/span[2]")all_spans=[]for i in range(12):    page=driver.page_source    soup=BeautifulSoup(page,"lxml")    all_spans.extend(soup.find_all("div",class_="dayContainer")[1].find_all("span"))    try:        driver.execute_script("""document.querySelector("#js-modal-first-order-bonus").remove()""")        driver.execute_script("""document.querySelector(".modal-backdrop").remove()""")    except:        pass    n.click()print(all_spans)And finally if it helps you with your problem then don&#8217;t forget to mark this as answer.

Advertisement

Answer