How to find all columns contains string and put in a new columns?

Question

I was wondering how could I find all values that start with 'orange' from all the columns and parse it into new columns. expected output : Answer Let's try stack then filter by str.contains: df1: Or melt for same order as OP: df1: regex ^orange: ^ asserts position at start of a line orange matches the characters orange literally (case

Accepted Answer

Let&#8217;s try stack then filter by str.contains:df1 = data.stack()df1 = (    df1[df1.str.contains('^orange', regex=True)]        .reset_index(drop=True)        .to_frame('category'))df1:       category0   orange 13451  orange 411342   orange 22223   orange 90874      orange 15   orange 24566  orange 221457   orange 23418   orange 00219      orange 2Or melt for same order as OP:df1 = data.melt()['value']df1 = (    df1[df1.str.contains('^orange', regex=True)]        .reset_index(drop=True)        .to_frame('category'))df1:       category0   orange 13451   orange 24562  orange 411343  orange 221454   orange 22225   orange 23416   orange 90877   orange 00218      orange 19      orange 2regex ^orange:^ asserts position at start of a lineorange matches the characters orange literally (case sensitive)

Advertisement

Answer