Skip to content
Advertisement

Test of one dataframe in another

I have a pyspark dataframe df:

JavaScript

and another smaller pyspark dataframe but with 3 rows with the same values, df2:

JavaScript

Is there a way in pyspark to create a third boolean dataframe from the rows in df2 are in df? Such as:

JavaScript

Many thanks in advance.

Advertisement

Answer

You can do a left join and assign False if all columns joined from df2 are null:

JavaScript
User contributions licensed under: CC BY-SA
2 People found this is helpful
Advertisement