Getting first sibling of second instance using Xpath

Question

I'm trying to extract information from this page: I'm trying to extract the time (6:30 PM). My strategy is to find the second instance of the date (Mar. 31st, 2022), and then get the first sibling of that. Photo here (I want the part boxed in yellow): Here's what I've tried: However, this is not getting me what I want.

Accepted Answer

It seems it is first element with PM / AM so I would use find_element with'//div[contains(text(), " PM") or contains(text(), " AM")]'like thisitem = driver.find_element(By.XPATH, '//div[contains(text(), " PM") or contains(text(), " AM")]')print(item.text)I use space before PM/AM to make sure it is not inside word.Your xpath works when I add ( ) so it first gets divs and later select by index.Without () it may treats [text()="..."][1] like [text()="..." and 1].And it needs [2] instead of [1] because xpath start counting at 1, not 0"(//div[text()='" + first_date + "'])[2]/following-sibling::div"Full working examplefrom selenium import webdriverfrom selenium.webdriver.common.by import By#from webdriver_manager.chrome import ChromeDriverManagerfrom webdriver_manager.firefox import GeckoDriverManagerimport timeurl = 'https://www.bandsintown.com/e/103275458-nayo-jones-at-promise-of-justice-initiative?came_from=253&utm_medium=web&utm_source=city_page&utm_campaign=event'#driver = webdriver.Chrome(executable_path=ChromeDriverManager().install())driver = webdriver.Firefox(executable_path=GeckoDriverManager().install())driver.get(url)time.sleep(5)item = driver.find_element(By.XPATH, '//div[contains(text(), " PM") or contains(text(), " AM")]')print(item.text)print('---')first_date = driver.find_elements(By.CSS_SELECTOR, 'a[href^="https://www.bandsintown.com/a/"] + div + div')first_date = first_date[0].text        event_time = driver.find_elements(By.XPATH, "(//div[text()='" + first_date + "'])[2]/following-sibling::div")print(event_time[0].text)

Advertisement

Answer