Skip to content
Advertisement

Python: Beautiful Soup’s “find_all” does not extract any content from HTML

I am currently trying to webscrape googles playstore. More specifically I want to create a dataset, that contains ratings of the disney+ app.

Based a tutorial on webscraping (Building a dataset of Graphic Cards on “Newegg.com”) I had no troubles in extracting the necesary information from the website. I did so by finding the correct container within the html code of the site. Here is the code that works percectly fine:

JavaScript

which gives me the desired output of html chunks to extract more specific information:

JavaScript

However if I try to replicate the code for disney+ on googles playstore, the function does not extract anything. Of course I also searched for the container within the html code that contains the review of a person. The Code:

JavaScript

Which produces the error:

JavaScript

I understand the error message, however I am puzzled on how to extract the review information the same way as in the example with graphic cards.

Any help is highly appreciated.

Best regards Neil

Advertisement

Answer

One simple option is HtmlSession

JavaScript

Output:-

JavaScript
User contributions licensed under: CC BY-SA
4 People found this is helpful
Advertisement