Python Pandas – Read csv with commented header line

Question

I want to read and process a csv file with pandas. The file (as seen below) contains multiple header lines which are indicated by a # tag. I can import that file easily by using However, I have a lot of such files with different header names and I don&#8217;t want to name them (Time Cd Cs &#8230;) manually. A…

Accepted Answer

What about extracting the header before you read the file?We only assume that your header lines start with #. Extraction of the header as well as its position in the file is automated. We also ensure that no more lines than necessary are read (except the first data line).with open(file) as f:    line = f.readline()    cnt = 0    while line.startswith('#'):        prev_line = line        line = f.readline()        cnt += 1        # print(prev_line)header = prev_line.strip().lstrip('# ').split()df = pd.read_csv(file, delimiter="s+",                   names=header,                   skiprows=cnt           )With this, you can also proccess the other header lines. It also gives you the position of the header in the file.

Advertisement

Answer