how to split a column based on a character and append the rest of columns with each split

Question

Consider I have a dataframe: First, how do I print all the rows that has "|" in column 1? I am trying the following but it prints all rows of the frame: Second, how do I split the column 1 and column 2 on "|", so that each split in column 1 gets its corresponding split from column 2 and

Accepted Answer

You can use custom lambda function with Series.str.split and Series.explode for columns specified in list and then add all another columns in DataFrame.join:splitter = ['1','2']cols = df1.columns.difference(splitter)f = lambda x: x.str.split('|').explode()df1 = df1[splitter].apply(f).join(df1[cols]).reset_index(drop=True)print (df1)   1    2   3   40  A  def   2   31  B  xyz  56   32  C  abc  56   33  X  uiu  65  344  Y   oi  65  345  Z  kji  65  346  K  rsq  98  12For filter by | what is special regex character or add regex=False to Series.str.contains:print(df1[df1[1].str.contains("|" regex=False)])Or escape it by |:print(df1[df1[1].str.contains("|")])

Advertisement

Answer