Using python pandas how can we select very specific rows and associated column

Question

I am still learning python, kindly excuse if the question looks trivial to some. I have a csv file with following format and I want to extract a small segment of it and write to another csv file: So, this is what I want to do: Just extract the entries under actor_list2 and the corresponding id column and writ…

Accepted Answer

As Nour-Allah has  pointed out the formatting here is not very regular to say the least. The best you can do if that is the case that your data comes out like this every time is to skip some rows of the file:import pandas as pddf = pd.read_csv('blabla.csv', skiprows=list(range(17)), nrows=8)df_res = df.loc[:, ['actor_list2', 'ID']]This should get you the result but given how erratic formatting is, this is no way to automate. What if next time there&#8217;s another actor? Or one fewer? Even Nour-Allah&#8217;s solution would not help there.Honestly, you should just get better data.

Advertisement

Answer