How to filter for columns where the first row (not header) starts with string

Question

I&#8217;m trying to filter a dataframe by the first row, but can&#8217;t seem to figure out how to do it. Here&#8217;s a sample version of the data I&#8217;m working with: What I want to do is filter for all columns that start with &#8220;Response&#8221; in the first non-header row. So in this case, just have…

Accepted Answer

First step is to select the values of the first row:df.iloc[0]  # selects the values in the first rowThen, use python&#8217;s .str StringAccessor methods for working with data values rather than column names:df.iloc[0].str.startswith('Response') # Test the result of the above lineThis will give you a Series with True/False values indexed by column name. Finally, use this to select the columns from your dataframe based on the matched labels:df.loc[:, df.iloc[0].str.startswith('Response')] # Select columns based on the testThis should do the trick!See pandas&#8217;s docs on Indexing and Selecting Data and the StringAccessor methods for more help.

Advertisement

Answer