Read through a text file and output to a dataframe by Python

Question

I have an text file as below format. I want to read through all the records in the file and output in a dataframe. Expected output: There will be two types of trans description. Code I am trying as below, but it only works for one line of the text file. How can I modify to read through all the

Accepted Answer

Try this :import pandas as pdimport numpy as npfrom io import StringIOt = """NEW ACCOUNT       ABC COMPANY  00123                  CCY/BALANCE  USD 3,600ACCOUNT APPROVAL  ABC COMPANY  00123NEW ACCOUNT       BBC COMPANY  00124                  CCY/BALANCE  USD 5,600"""names=['TRAN DESCRIPTION', 'CUSTOMER NAME', 'A/C NO.']df = pd.read_fwf(StringIO(t), header=None, names=names)# or df = pd.read_fwf(r'path_to_your_textfile.txt', header=None, names=names)df['CCY/BALANCE'] = np.where(df['CUSTOMER NAME'] == 'CCY/BALANCE', df['A/C NO.'], np.nan)df['CCY/BALANCE'] = df['CCY/BALANCE'].shift(-1)out = df[df['TRAN DESCRIPTION'].notna()].reset_index(drop=True)>>> display(out)

Read through a text file and output to a dataframe by Python

Advertisement

Answer

`>>> display(out)`