Python csv: Split column to columns and then to rows by delimiter

Question

I have a column in a csv file which contains person&#8217;s details in this format: Actual csv format: I want to split them in a new csv file like this: Splitting details: Split Row Delimiter : &#8216; O&-&#8216; where & can be only &#8216;K&#8217; or &#8216;Z&#8217; Split Column Delimiter : &#8216;:&…

Accepted Answer

Based on @DeepSpace answer but with a fixed regex and new requirements added:import csvimport remembers_split_regex = re.compile(r'(O[KZ]-d+):([a-zA-Z0-9 ]+):([a-zA-Z0-9 ]+):([a-zA-Z0-9 ]+):([a-zA-Z0-9 ]+):([a-zA-Z0-9 ]+):([a-zA-Z0-9 ]+)(?= O[KZ]|$)')with open('test.csv') as input_file, open('output_csv', 'w', newline='') as output_file:    csv_reader = csv.DictReader(input_file)    fieldnames = csv_reader.fieldnames.copy()    fieldnames.remove('Members')    csv_writer = csv.DictWriter(output_file, extrasaction='ignore', fieldnames=fieldnames + ['Member_Rank', 'Member_Name', 'Member_Surname', 'Member_ID_Method', 'Member_ID_Num', 'Member_Gender', 'Member_Notes'])    csv_writer.writeheader()    for row in csv_reader:        for member_tuple in members_split_regex.findall(row['Members']):            member_dict = {}            (                member_dict['Member_Rank'],                member_dict['Member_Name'],                member_dict['Member_Surname'],                member_dict['Member_ID_Method'],                member_dict['Member_ID_Num'],                member_dict['Member_Gender'],                member_dict['Member_Notes']            ) = member_tuple            member_dict.update(row)            csv_writer.writerow(member_dict)The main difference is that I&#8217;m deleting the column from the dictionary so that we can use it to update our new dictionary. This way we do not only copy the &#8220;Team&#8221; column but any other column that is not &#8220;Members&#8221;. To do so the fieldnames of the reader are also copied, the &#8220;Members&#8221; item removed, and the new ones added to the fieldnames of the writter.The used regex doesn&#8217;t hardcode any field, allows spaces in names and surnames, capital Os in the notes, and ID fields that are not just 8-digit numbers.

Advertisement

Answer