How to find a tag within the same parent that has the child I want?

Question

Scraping a website that has multiple products on the same page, some that I don&#8217;t want to know the prices of. So I wanted to first see the product category to then get the price listed. The website code looks like this: I already know how to get to the category part with my own code, but I&#8217;m compl…

Accepted Answer

You are close to your goal but be aware that products.text will give you the whole section text, better use products.span.text to get the category text only.To get the price info, simply find the span with class="price" and check if it is available or not to avoid errors:price = products.find(class_='price').text if products.find('span', class_='price') else NoneExamplefrom bs4 import BeautifulSouphtml='''

...

Clothes

...

... 149.99

'''soup = BeautifulSoup(html, 'html.parser')for products in soup.find_all('section', class_='category'): category = products.span.text if category == 'Clothes': price = products.find(class_='price').text if products.find('span', class_='price') else None print(price)Output149.99As alternative an approach that is more lean, creates a structured output that is easy to process and deals with a list of permitted categories:from bs4 import BeautifulSoup html='''

...

Clothes

...

... 149.99

...

Shoes

...

... 90.99

''' soup = BeautifulSoup(html, 'html.parser') data = [] c_list = ['Clothes','Shoes'] for products in soup.select(f"section.category:-soup-contains({','.join(c_list)})"): data.append({ 'category' : products.span.text, 'price' : products.find(class_='price').text if products.find('span', class_='price') else None }) dataOutput[{'category': 'Clothes', 'price': '149.99'}, {'category': 'Shoes', 'price': '90.99'}]

Advertisement

Answer

Example

Output

Output