Unable to find element BeautifulSoup

Question

I am trying to parse a specific href link from the following website: https://www.murray-intl.co.uk/en/literature-library. Element i seek to parse: However, using BeautifulSoup I am unable to obtain the desired element, perhaps due to cookies acceptance. I am still new at BS4, and hope someone can help me on …

Accepted Answer

To get correct tags, remove "focus-within" class (it&#8217;s added later by JavaScript):import requestsfrom bs4 import BeautifulSoupurl = "https://www.murray-intl.co.uk/en/literature-library"soup = BeautifulSoup(requests.get(url).content, "html.parser")links = soup.find_all("a", class_="btn btn--naked btn--icon-left btn--block")for u in links:    print(u.get_text(strip=True), u.get("href", ""))Prints:...Portfolio Holding Summarylibrary_books https://www.aberdeenstandard.com/docs?editionId=9123afa2-5318-4715-9783-e07d08e2e7cc...EDIT: To get only the specified link you can use for example CSS selector:link = soup.select_one('a:-soup-contains("Portfolio Holding Summary")')print(link["href"])Prints:https://www.aberdeenstandard.com/docs?editionId=9123afa2-5318-4715-9783-e07d08e2e7cc

Advertisement

Answer