Skip to content
Advertisement

Why do I run into trouble webscraping this website in Python?

I am new to Python and I am trying to webscrape this website. What I am trying to do is to get just dates and articles’ titles from this website. I follow a procedure I found on SO which is as follows:

JavaScript

I got .title a , .date using SelectorGadget in the url I shared. However, print(movies) is empty. What am I doing wrong?

Can anyone help me?

Thanks!

Advertisement

Answer

The content is not part of index.en.html but is loaded in by js from

JavaScript

Then you can’t select pairs afaik, so you need to select for titles and dates separately:

JavaScript

Then you can print them out like this:

JavaScript

Result:

JavaScript

Full code:

JavaScript
User contributions licensed under: CC BY-SA
3 People found this is helpful
Advertisement