Skip to content

Tag: dataframe

Filter pyspark DataFrame by string match

i would like check substring match between comments and keyword column and find if anyone of the keywords present in that particular row. input expected output Answer The most efficient here is to loop, you can use set intersection: Output: Used input: With a minor variation you could check for substring matc…

All cells getting updated in pandas df using loc

So I create an empty pandas df, where I initialize all the cell values to empty lists, except the diagonals, which are set to math.inf The indexes are the start position, and the column headers are the end position I want to get the start and end positions, and the difference between the days to get from star…