Beautifulsoup: extracting td list in table

Question

I&#8217;m stuck with a BeautifulSoup problem that I think is simple but I can&#8217;t seem to solve. It is about extracting each td from the following table to create a loop and a list: What I need is to create a dictionary with some elements of each tr to create a dataframe later. I would like to have a list

Accepted Answer

If you feel like learning something new, you don’t even need bs4 (well, sort of). All you need is pandas (you get a dataframe out of the box) to get this:- ----------- -------- -- ---------------- ----------------------------------------------- -- --------------0 Barcelona Player 1 16 Tarjeta Amarilla Derribar a un contrario en la disputa del balón 88 Segundo tiempo1 Real Madrid Player 2 8 Tarjeta Amarilla Sujetar a un adversario impidiendo su avance. 12 Primer tiempo- ----------- -------- -- ---------------- ----------------------------------------------- -- --------------With this:import pandas as pdfrom tabulate import tabulatesample_html = """

Team	Name	Number	Tipo	Motivo	Minute	Bloque
Barcelona	Player 1	16	Tarjeta Amarilla	Derribar a un contrario en la disputa del balón	88	Segundo tiempo
Real Madrid	Player 2	8	Tarjeta Amarilla	Sujetar a un adversario impidiendo su avance.	12	Primer tiempo

"""df = pd.read_html(sample_html, flavor="bs4")df = pd.concat(df)print(tabulate(df))df.to_csv("your_table.csv", index=False)The code also dumps your table to a .csv file:

Advertisement

Answer