Tag: web-scraping

Python Scrape Forum for Title for each Post

I’m new to Web scraping and new to Python. I want to scrape for the title of each Posting at the Forum of the URL, so then a new Post is created with 1 of the Titles below i’d like to receive a Mail with that Link of the Post. With searching for the div structItem-title i receive the 23

How to extract element from a webpage with special class name?

beautifulsoup python web-scraping

I have a txt file filed with multiple urls, each url is an article with text and their corresponding SDG (example of one article 1) The text parts of an article are in balises ‘div.text.-normal.content’ and then in ‘p’ And the SDGs are in ‘div.tax-section.text.-normal.small&#8217…

Python – Scraping a PDF file from a URL

pdf python web-scraping

I want to scrape pdf files from this site https://www.sigmaths.net/Reader.php?var=manuels/ph/physique_pilote_7b.pdf I tried this code for that but it doesn’t work. Can anybody tell me why, please? Answer your url is pointing to a reader https://www.sigmaths.net/Reader.php?var=manuels/ph/physique_7b.pdf,…

Delete unwanted elements of python webscraping loop results

beautifulsoup data-cleaning python web-scraping

I’m currently trying to extract text and labels (Topics) from a webpage with the following code : No code problem, but here is an extract of what I’ve obtained with the previous code : As I’m looking for a “clean” text result I tried to add the following code line in my loops in …

Python Beautiful Soup html.parser returns none

beautifulsoup html python web-scraping

I have a feeling the information is somewhere on stack overflow, but I can’t find it :-/ I’m looking to get the text from this website: https://www.uniprot.org/uniprot/P28653.fasta but my code returns ‘None.’ All help is super appreciated! returns Thanks!! Answer There is no html in th…

Beautiful Soup and requests problem it doesn’t show any text output

beautifulsoup python python-2.7 python-3.x web-scraping

I am using beautiful soup and requests to print full text of the article of this wedsite https://www.vanityfair.com/style/society/2014/06/monica-lewinsky-humiliation-culture This is my code: My code run without any error but it does’t show any text in output plz help me find my error Answer What happens…

BeautifulSoup choosing a line from output

beautifulsoup python web-scraping

I try to make a python script that gets the dam occupancy rates from a website. Here is the code: The output is like this: Genel Doluluk Oranı means occupancy rate. I need the value of occupancy rate which writes in next line like 29,48. How can I get this value from output? Answer Cause question and expected…

Can’t get tags when scraping data

beautifulsoup python web-scraping

I am trying to scrape all tr tags using BeautifulSoup, but it returns none. Code: Even though there are tr tags in this url, it returns none and throws an IndexError. Why is this happening? Answer In page source table is located inside comment. You need to extract comment content and then parse it as HTML:

Message: element click intercepted: Element … is not clickable at point (657, 594). Other element would receive the click with Selenium

python selenium selenium-webdriver web-scraping webdriverwait

I was making a webscraper to get gpu stocks from https://www.nvidia.com/en-us/shop/geforce/?page=1&limit=9&locale=en-us to get my hands on a 30 series card, I am using python with bs4 and selenium for this. I want to load more shopping items, on the website it has this load more button. So I grabbed i…

Why does linkextractor skip link?

hyperlink python scrapy web-scraping

I am Scraping some pages and am trying to use the LinkExtractor to get the URLs from the response. In general that is going quite ok, but the LinkExtractor is not able to extract the relative link to a pdf file that is found at line 111 of the html I have tried a lot, but haven’t been able to