How to access a table in HTML without any name with Beautifulsoup

Question

I am trying to scrape a table from url. The table seems to have no name. I have scraped the links and its text to csv using below code. What I need is to scrape the table as it is. currently I am able to get just links. I need other columns too. I have tried following code but failed.

Accepted Answer

To save data to CSV, you can use this example:import requestsimport pandas as pdfrom bs4 import BeautifulSoupurl = 'https://www.sbp.org.pk/smefd/circulars/2020/index.htm'soup = BeautifulSoup(requests.get(url).content, 'html.parser')table = soup.select_one('table[width="95%"]:not(:has(table))')all_data = []for row in table.select('tr:not(:has(td[colspan]))'):    tds = [td.get_text(strip=True).replace('n', ' ').replace('t', ' ') for td in row.select('td') if td.get_text(strip=True)]    tds += [row.find_previous('td', {'colspan': '4'}).get_text(strip=True).replace('n', ' '), row.a['href']]    all_data.append(tds)df = pd.DataFrame(all_data)print(df)df.to_csv('data.csv', index=False, header=False)Saves data.csv (screenshot from LibreOffice):

Advertisement

Answer