Remove specific string char at the beginning of each lines of a txt file using python

Question

I&#8217;m currently working on a script in python. I want to convert an xls file into a txt file but I also want to clean and manage the data. In the xls files, there&#8217;s 4 columns which does interest me. Here is a sample of the txt I get from the conversion : To get this result I used this

Accepted Answer

Please find below the updated code.This is the input I took, added a couple of rows to ensure it works fine..Few changes:read_excel has header=None as your data did not have headerWhile writing to my_txt, I added index=False, header=False so that there is no index added to the txt file and no headers. This way your output file will only have dataI used the dataframe to clean the OPEN and &#8216;0&#8217; when they are in columns A:B or C:D ONLY. Otherwise, leave them alone.import re# Function to convert  def listToStringOpen(s):         str1 = ""         for ele in s:         str1 += ele        str1 += "n"                return str1import pandas as pddf = pd.read_excel('my_xls.xlsm', sheet_name='Sheet1', usecols="A:D", header=None)with open('my_txt.txt', 'w', encoding ='utf-8', errors='ignore') as outfile:    df.to_string(outfile, index=False, header=False)with open('my_txt.txt', 'r', encoding ='utf-8', errors='ignore') as f:    data = f.readlines()df = df.astype(str)cleaned_lines = []row = 0for row in range(len(df)):    print(df.iloc[row,:][0], df.iloc[row,:][1], df.iloc[row,:][2], df.iloc[row,:][3])    if ((df.iloc[row,:][0].strip() == 'OPEN') and (df.iloc[row,:][1].strip() == "0")) :        df.iloc[row,:][0] = ""        df.iloc[row,:][1] = ""    elif ((df.iloc[row,:][2].strip() == 'OPEN') and (df.iloc[row,:][3].strip() == "0")):        df.iloc[row,:][2] = ""        df.iloc[row,:][3] = ""with open('result_open.txt', 'w', encoding ='utf-8', errors='ignore') as f:    df.to_string(f, index=False, header=False)

Advertisement

Answer