Scrape information about Visit Page and App Name on google play store

Question

I have created the code below to scrape the app name and Visit Page Url from the google play store page. ASOS &#8211; Get ASOS (Line 1120) Visit website &#8211; Get http://www.asos.com &#8211; (q=)(Line 1121 source code) I can&#8217;t seem to print the HTML here. Please open edits for that. But Please help me…

Accepted Answer

BeautifulSoup provides a great deal of functions that you should be taking advantage of.For starters, your script can be cut down to the following:import requestsfrom bs4 import BeautifulSoupurl = 'https://play.google.com/store/apps/details?id=com.asos.app'r = requests.get(url)soup = BeautifulSoup(r.content, "html.parser")for a in soup.find_all('a', {'class': 'dev-link'}):    print "Found the URL:", a['href']BS4 can parse the raw HTML content and you can iterate through it via the data type.  In this scenario, you want a particular href link of class name dev-link.  Doing so, gets you the following output:Found the URL: https://www.google.com/url?q=http://www.asos.com&sa=D&usg=AFQjCNGl4lHIgnhUR3y414Q8idAzJvASqwFound the URL: mailto:androiddev@asos.comFound the URL: https://www.google.com/url?q=http://www.asos.com/infopages/pgeprivacy.aspx&sa=D&usg=AFQjCNH-hW1H0fYlsCjp4ERbVh29epqaXAI&#8217;m sure you can tweak it a bit more to get the results you want but please refer to BS4 for more information ==> https://www.crummy.com/software/BeautifulSoup/bs4/doc/

Advertisement

Answer