How do I export a read_html df to Excel, when it related to table ID rather than data in the code?

Question

I am experiencing this error with the code below: I want to save the table I am scraping from wikipedia to an Excel file &#8211; but I can&#8217;t work out how to adjust the code to get the data list from the terminal to the Excel file using to_excel. I can see it works for a similar problem when a

Accepted Answer

pandas.read_html() creates a list of tables respectiv dataframe objects, so you have to pick one by index in your case [0] &#8211; You also do not need requests and BeautifulSoup, separatly, just go with pandas.read_html()pd.read_html(wiki_url,attrs={'id': table_id})[0]Exampleimport pandas as pdwiki_url = 'https://en.wikipedia.org/wiki/List_of_current_members_of_the_United_States_House_of_Representatives'table_id = 'votingmembers'congress_table = soup.find('table', )df = pd.read_html(wiki_url,attrs={'id': table_id})[0]df.to_excel (r'C:UsersnameOneDriveCode.vscodeTest.xlsx', index = False, header=True)

Advertisement

Answer

Example