Skip to content

Tag: scrapy

XMLFeedSpider not Producing an Output CSV

Having an issue with XMLFeedSpider. I can get the parsing to work on the scrapy shell, so it seems there is something going on with either the request, or the spider’s engagement. Whether I add a start_request() method or not, I seem to get the same error. No output_file.csv is produced after running th…

How to renew scrapy Session

—– EDIT —- Rewrote the topic + content based on previous findings I am scraping using a proxy service that rotates my ip. In order to obtain a new ip, the connection needs to be closed with my proxy service, and a new one opened with the new request. For instance, if I go to http://ipinfo.io…

Following links and crawling them

I was trying to make a crawler to follow links, with this code I was able to get the links but the part of entering the links and getting the information I need was not working, so a friend helped me to come up with this code It gets the json with the page items, but in loop number 230