insert line break in python file after a fix number of elements to delimit columns in a csv file

Question

I´ve been struggling a bit to find a way in python to force this file to create a jump to a new line after some number of elements (equal to the number of columns I will need to add which is 12) the CSV currently looks like this. the text of the first row looks like this . D276",31386,10610,12122021 00:00:47840

Accepted Answer

The first thing I would try is simply split the line on commas, and write records using csv.writer, calling .writerow() with twelve elements at a time. I notice you have a double quote at the beginning, but not later, so this approach might be good enough, you would just have to remove that double quote. Of course, if any field in your file has commas within its text, my suggestion will fall appart, but it&#8217;s a place to start, since you seem to be trying to fix one specific file, rather than solving a general problem.Here&#8217;s my implementation of that suggestion:import csvout_f = open("fixed-csv.txt", mode="w")writer = csv.writer(out_f)with open("bad-csv.txt") as in_f:    for line in in_f:        fields = line.strip("nr").split(",")        for position in range(0, len(fields), 12):            writer.writerow(fields[position:position+12])Now, I noticed running that code that you don&#8217;t actually have exactly 12 columns per row, it&#8217;s more like 10 or 11, and it&#8217;s not constant.Here&#8217;s a variant that looks for D276 and makes it the first column of each row:import csvout_f = open("fixed-csv-2.txt", mode="w")writer = csv.writer(out_f)with open("bad-csv.txt") as in_f:    for line in in_f:        fields = line.strip("nr").split(",")        d276_positions = [            i            for i, value in enumerate(fields)            if i == 0 or value == "D276"        ]        d276_positions.append(len(fields))        for start, end in zip(d276_positions, d276_positions[1:]):            writer.writerow(fields[start:end])I don&#8217;t imagine all your data will have D276 as the first value in the row, so you might have to change if i == 0 or value == "D276" to something that more generally locates the field that flags a new row, but this code should set you up with that you need to solve your problem, assuming, as I said at the beginning, that you don&#8217;t have commas inside any fields in your whole data file.If you do have commas in some of your fields, I would manually edit the output file with a text editor and patch the problems by hand. If there aren&#8217;t too many of them, it shouldn&#8217;t be a lot of work.

Advertisement

Answer