How to combine rows that have the same values in two columns (Python)?

Question

I currently have a csv file as follows. The first part just shows the columns names. The g column values are the same for every f value. The only unique part is p. Using python, how could I combine this as follows: One thing to note is that the csv file is much larger and that some f values might

Accepted Answer

I hope I&#8217;ve understood your question right. You can group by &#8220;f&#8221;, &#8220;g&#8221; column and then aggregate the rows:x = df.groupby(["f", "g"], as_index=False)["p"].agg(list)for vals in x.apply(lambda x: [x["f"], *x["p"], x["g"]], axis=1):    print(vals)Prints:['foo', 'in', 'out', 'length', 'void']['goo', 'a', 'b', 'c', 'd', 'e', 'f', 'int']

Advertisement

Answer