Function failing to update spacing after comma

Question

I have a csv file that has inconsistent spacing after the comma, like this: 534323, 93495443,34234234, 3523423423, 2342342,236555, 6564354344 I have written a function that tries to read in the file and makes the spacing consistent, but it doesn&#8217;t appear to update anything. After opening the new file cr…

Accepted Answer

The original code has a couple bugs:The if "," in data condition never evaluates to true. data is a list, where each item in the list is a string representing one entire line of the file. No single line in the file is ,, so that condition never evaluates to true. To fix it, use if "," in item. That way it&#8217;s checking to see if each line has a comma.There&#8217;s also a second problem: the item.index function returns only the first instance of a comma, so if there&#8217;s inconsistent spacing twice in one the algorithm does not catch it.A simple solution that doesn&#8217;t require regular expressions or sed or indexing and looking at each word character by character is:with open(dirpath + orig_filename, "r") as f:    for line in f:        new_line = line.replace(" ", "").replace(",", ", ")        with open(dirpath + cleaned_filename, "a") as cleaned_data:            cleaned_data.writelines(new_line)What this is doing is:for line in f reads each line of the file.line.replace(" ", "").replace(",", ", ")) first removes all spaces entirely (thanks to @megakarg for the suggestion) from the line, and then makes sure there&#8217;s a single space after each comma to meet the spec.

Advertisement

Answer