Split column to multiple columns by another column value (complicated separator)

Question

I have dataframe like: len of column1 value may be different &#8211; from 2 to 5 words, so split with space not an option. Output should be like: That topic &#8211; How to split a dataframe string column into two columns? &#8211; didn&#8217;t help coz of separator UPD. Left &#8220;side&#8221; may have 2-5 wor…

Accepted Answer

option 1Splitting on spaces is an option, if you have a single word for the last two columns. Use rsplit:df['column1'].str.rsplit(n=2, expand=True)output:        0    1      20  abc 33  aaa  9g98f1     cde  aaa  95fwf2  12 faf  bbb  92gcs3     faf  bbb  7t87fNB. this doesn&#8217;t work with the updated exampleoption 2Alternatively, to split on the provided delimiter:df[['new_column1', 'new_column2']] = [a.split(f' {b} ') for a,b in                                      zip(df['column1'], df['column2'])]output:                column1 column2 new_column1 new_column20  abc 33 aaa 9g98f 333     aaa      abc 33   9g98f 3331         cde aaa 95fwf     aaa         cde       95fwf2      12 faf bbb 92gcs     bbb      12 faf       92gcs3         faf bbb 7t87f     bbb         faf       7t87foption 3Finally, if you have many time the same delimiters and many rows, it might be worth using vectorial splitting per group:(df .groupby('column2') .apply(lambda g: g['column1'].str.split(f's*{g.name}s*', expand=True)) )output:        0          10  abc 33  9g98f 3331     cde      95fwf2  12 faf      92gcs3     faf      7t87f

Advertisement

Answer

option 1

option 2

option 3