python web scraping data to csv

Question

I was trying to use python web scraping then output a csv file but the print format is not matching csv format. outputenter image description here how to print this expecting results? enter image description here Thanks Below is my script Answer The first problem is you use "writerows", which will lead csv write become several rows as it can.

Accepted Answer

The first problem is you use &#8220;writerows&#8221;, which will lead csv write become several rows as it can. So when your text is &#8220;2021/12/23&#8221;, the converter will become [&#8216;2&#8217;, &#8216;0&#8217;, &#8216;2&#8217;, &#8216;1&#8217;, &#8216;/&#8217;, &#8216;1&#8217;, &#8216;2&#8217;, &#8216;/&#8217;, &#8216;2&#8217;, &#8216;3&#8217;], and write each row with one char. Same problem as the price. So we use &#8220;writerow&#8221; and save row data as a list to prevent csv convert our data to multiple rows.The second is use .text in BeautifulSoup will record all the text including whitespaces and lead csv behavior unpredictable. So I will delete all whitespace and # first to prevent this situation.Here is the modified codewith open('gprice.csv','w',newline="") as f1:    for single_date in daterange(start_date, end_date):        udate = single_date.strftime("%Y/%m/%d")        urld = url + single_date.strftime("%Y/%m/%d")        #we will append row by row, so we just use default setting on csv write        writer=csv.writer(f1)        #define empty row list        row_list = []        #append datetime        row_list.append(udate)            with req.urlopen(urld) as response:            data=response.read().decode("utf-8")            root=bs4.BeautifulSoup(data, "html.parser")            prices=root.find_all("div",class_="gp")            gshops=root.find_all("div",class_="gshop")            gpdate=root.find_all("div",class_="gp_date")            for price in prices:                #get inner text and delete '#'                row = price.text.replace('#', '')                #delete all whitespaces and append price                row_list.append("".join(row.split()))        #we only append one row data, so use "writerow" instad of "writerows"        writer.writerow(row_list)

Advertisement

Answer