Skip to content
Advertisement

Read specific columns with pandas or other python module

I have a csv file from this webpage. I want to read some of the columns in the downloaded file (the csv version can be downloaded in the upper right corner).

Let’s say I want 2 columns:

  • 59 which in the header is star_name
  • 60 which in the header is ra.

However, for some reason the authors of the webpage sometimes decide to move the columns around.

In the end I want something like this, keeping in mind that values can be missing.

JavaScript

This will prevent my program to malfunction when the columns are changed again in the future, if they keep the name correct.

Until now I have tried various ways using the csv module and resently the pandas module. Both without any luck.

EDIT (added two lines + the header of my datafile. Sorry, but it’s extremely long.)

JavaScript

Advertisement

Answer

An easy way to do this is using the pandas library like this.

JavaScript

The problem here was the skipinitialspace which remove the spaces in the header. So ‘ star_name’ becomes ‘star_name’

Advertisement