Skip to content
Advertisement

How can I scrape all the images from a website?

I have a website where I’d like to get all the images from the website.

The website is kind of a dynamic in nature, I tried using google’s Agenty Chrome extension and followed the steps:

  • I Choose one image that I want to extract using CSS selector, this will make the extension select the same other images automatically.
  • Viewed the Show button and select ATTR(attribute).
  • Changed src as an ATTR field.
  • Gave a name field name option.
  • Saved it & ran it in using Agenty platform/API.

This should yield me the result but it’s not, it is returning an empty output.

Is there any better option? Will BS4 a better option for this? Any help is appreciated.

Advertisement

Answer

I am assuming you want to download all images in the website. It is actually very easy to do this effectively using beautiful soup 4 (BS4).

JavaScript

You can also download the image with this tacked-on to the end:

JavaScript

Everything in two lines:

JavaScript

The new image should be in the same directory as the python file, but can be moved with:

JavaScript

In the case of the McMaster website, the images are linked differently, so the above methods won’t work. The following code should get most of the images on the website:

JavaScript

UPDATE: I found from some github post the below code that is MUCH more accurate:

JavaScript

Hope this helps!

User contributions licensed under: CC BY-SA
3 People found this is helpful
Advertisement