Skip to content
Advertisement

Tag: beautifulsoup

Scraping search results off of Sportchek with Beautiful Soup 4 to find prices

So I’m trying to web scrape search results from Sportchek with BS4, specifically this link “https://www.sportchek.ca/categories/men/footwear/basketball-shoes.html?page=1”. I want to get the prices off of the shoes here and put them all into a system to sort it, however, to do this I need to get the prices first and I cannot find a way to do that. In the HTML,

Using Decompose to remove empty tag

I am trying to search for emails in HTML elements. I want to run the code so that when there are no emails found in the HTML, to search in another element in the HTML and in the end if it is not found to set email as “N/A”. I am new to writing code and I am trying to

Can’t stratify output based on different headings and their corresponding paragraphs

I’m trying to fetch each heading and their corresponding paragraphs from the html elements below. The results should be stored within a dictionary. Whatever I’ve tried so far produces ludicrously haphazard output. I intentionally did not paste the current output only because of brevity of space. I’ve tried with (producing messy output): Output I wish to get: Answer Tricky problem.

The python parser does not read information from the site, but returns None

I’m making a python parser for the site: https://www.kinopoisk.ru/lists/series-top250/ The task is to pick film genres from films (displayed on the page as: ‘span’, class _ = ‘selection-film-item-meta__meta-additional-item’) I can’t understand why it gives the result: [{‘title’: None}, {‘title’: None}, {‘title’: None}, … {‘title’: None}] Answer I’m definitely getting some captcha blocks from my local machine https://www.kinopoisk.ru/**showcaptcha**?cc=1&retpath=https%3A//www.kinopoisk.ru/lists/series-top250%3F_ea4584… but running from

python web scraping issues with mechanize

I am trying to scrape web results from the website: https://promedmail.org/promed-posts/ I have followed beutifulsoup. mechanical soup and mechanize so far unable to scrape the search results. The content does not show the search results when typed in US. Any idea on what am I doing wrong here? Answer As you mention bs4 you can mimic the POST request the

screen scrape text values from span based on other text values from corresponding span with beautiful soup

I have some beautiful soup code, like the example code below. I’m using it to screen scrape financial data from yahoo finance about mutual funds. In this piece of code I’m trying to scrape the “Bond Ratings” percentages, and save them in a dictionary. I’ve been trying to select element values based on the span class=”Fl(end)”, but I’m finding that

Advertisement