Tag: web-scraping

Why does my Selenium URL_to_be statement not work?

Selenium does not seem to register that I manually go to the publish0x.com page. Does anyone know a solution? My goal is to manually do the captcha at the login page and afterwards, when I log in and land on the main page I want the script to resume. Answer There are two ways I can think of, one by

Webscraping Dynamic Website to Pull Recent News Article URLs

finance html json python web-scraping

I am attempting to pull investing news articles from a dynamic website using Python. I have tried a couple of tutorials that worked for static websites, but I have had issues pulling the URL to a specific article. The code I am working with is as follows: Which gets me a list of the links within the page in an

Stale element: iterating through webelement list python

python selenium web-crawler web-scraping

I’m using selenium for the first time to get some information about a fantasy soccer game I play with my friends (we have a competition). I’m facing issues iterating through a list of webelements. Apparently they become stale. Here’s some code and details: I was able to get to the competition’s page by myself. This page has cards for every

Getting no data when scraping a table

beautifulsoup python web-scraping

I am trying to scrape historical data from a table in coinmarketcap. However, the code that I run gives back “no data.” I thought it would be fairly easy, but not sure what I am missing. Output: Answer You don’t need to scrape the data, you can get request it: Output:

using Selenium to get texts inside ‘ul’ tag?

python selenium selenium-chromedriver selenium-webdriver web-scraping

Please help me to find the solution to get the text inside ‘ul’ tag. I want to get the information which is separated with commas like: ‘Contains Enzymatically Active B-Vitamins, Dietary Supplement, Non-GMO LE Certified’ website link: https://ca.iherb.com/pr/Life-Extension-BioActive-Complete-B-Complex-60-Vegetarian-Capsules/67051 picture: enter image description here This is the HTML code: Answer This should do it: Output:

How to select all tags HTML

beautifulsoup python web-scraping

From this webpage I need to select all tags <b> </b> with BeautifulSoup4. I have tried using find_all() and select() but they fail to show all <b> tags when used in the array Answer There are different parsers used in parsing a html document, the most used one is ‘html.parser’. I have used lxml here which uses both xml and

Pb to select a text from a dropdown list

beautifulsoup python selenium web web-scraping

I started learning Python/Selenium. After several attempts, I can’t find a way to extract the text from a dropdown list, and I would like to put a for loop after the first extraction. this is the image: enter image description here the link : enter link description here Answer You can do like this

How do I put a space between every data got from the FOR LOOP

for-loop python web-scraping

I am a new learner in python and was trying to get some data from 2 webpages using Beautiful Soup and FOR LOOP to loop over it and print that. From the above code you can see that I am able to get output from both the webpages in a sequence but I want to insert a space or some

Python – trying to get beautifulsoup to find words in a list, but it’s unable to find them

beautifulsoup if-statement list python web-scraping

I’m working on my first project that isn’t straight out of a book but I’m having trouble getting a function to work. The function receives a list of strings and a BeautifulSoup object and attempts to find each word in the soup.text. However, the code seems unable to find any words/strings at all even when I am certain it should

Recaptcha missing the element “data-sitekey” making it impossible to use any captcha service apis

javascript python recaptcha web-scraping

As the above title says, I’m attempting to use a captcha solving service but all of them request a code that’s attached to an element called “data-sitekey” that should be in the HTML of a page with recaptcha on it. My problem is that the webpage I’m looking at doesn’t have a “data-sitekey” element, meaning I can’t use any of

Advertisement

Posts navigation

Prev 1 … 21 22 23 … 31 Next