Skip to content
Advertisement

Delete rows that do not contain specific text

I have a tabular file that looks like this:

JavaScript

I’m trying to create a script to go through and delete the entire row if column 2 (‘KEGG_KOs’) does not begin with ‘K0’. I’m trying to create an output of:

JavaScript

Previous responses have referred people to pandas DataFrame but I’ve had no luck using those responses to help. Any would be greatly appreciated, cheers.

I had tried (but this only isolates a specific K0 line.

JavaScript

Advertisement

Answer

Use boolean indexing with startswith or contains with regex for start of string ^ and parameter na=False, because missing values:

JavaScript

Or:

JavaScript
User contributions licensed under: CC BY-SA
10 People found this is helpful
Advertisement