Downloading all JS files using Scrapy?

Question

I am trying to crawl a website searching for all JS files to download them. I am new to Scrapy and I have found that I can use CrawlSpider but seems I have an issue with LinkExtractors as my parser is not executed. Answer I found that LinkExtractor has tags and attrs parameters where the default are for &#821…

Accepted Answer

I found that LinkExtractor has tags and attrs parameters where the default are for &#8216;a&#8217; and &#8216;area&#8217; tags only. LinxExtractor DocumentationSo the solution is to add &#8221; tag:Rule(LinkExtractor(tags=('a', '<script>'), attrs('href','src')), callback='parse_item'),

Advertisement

Answer