Webscraping a particular element of html

Question

I’m having trouble scraping information from government travel advice websites for a research project I’m doing on Python. I’ve picked the Turkey page but the logic could extend to any country. The site is "https://www.gov.uk/foreign-travel-advice/turkey/safety-and-security" The code I'm using is: At the moment this is extracting all the html of the page. Having inspected the website the information I am

Accepted Answer

If you are only interested in data located inside govuk-govspeak direction-ltr class, therefore you can try these steps :Beautiful Soup supports the most commonly-used CSS selectors. Just pass a string into the .select() method of a Tag object or the BeautifulSoup object itself. For class use . and for id use #data = soup.select('.govuk-govspeak.direction-ltr')# extract h3 tagsh3_tags = data[0].select('h3')print(h3_tags)[

Local travel - Syrian border

,

Local travel – eastern provinces

,

Political situation

,...]#extract p tagsp3_tags = data[0].select('p')[

The FCO advise against all travel to within 10 ...]

Advertisement

Answer