Skip to content
Advertisement

Extracting specific string matches from a Stock Website page

I am trying webscrape stock market caps, using this below code. At first I traditionally tried to fetch the list of market cap values using bs4. When I used print(x.find('span',{'class': 'Trsdu(0.3s)'}).text) to do this, I got AttributeError: 'NoneType' object has no attribute 'text' error.

JavaScript

I did not know how to resolve the above error specific to my code. So I took an alternative using regex to simply extract the required values and tried this below.

Main Code

JavaScript

Results

This is what print(str(marketCapArray) would output. (pasted only some part)

JavaScript

This is the output of print(pi). Also the final output.

JavaScript

Question

How can I avoid using regex replace(re.sub) in the above Main Code to achieve the given final output pi ? or Suggest me the right approach to do this. I feel my regex is unpleasant.

Advertisement

Answer

You can iterate row by row inside the <table>, where all information is stored. For example:

JavaScript

Prints:

JavaScript
User contributions licensed under: CC BY-SA
8 People found this is helpful
Advertisement