Matching Two Pandas DataFrames based on values in columns

Question

I'm trying to match job candidates to mentors based on different several variables that would hopefully create a good match. There are two Pandas DataFrames (one for candidates and one for mentors) that I'm trying to connect based on experience, location, desired job, etc. For example I have a mentor DataFrame that might look something like the below: Along with

Accepted Answer

@Henry is on the right path. You&#8217;ll need to modify your candidate dataframe to a) make sure all arrays are the same length (or add NaNs if you don&#8217;t have them, and b) tweak a bit to make sure you actually have some matches.I used your mentor_df, and the following candidate_df:    candidate_df = pd.DataFrame({          "Candidate":["Candidate 1", "Candidate 2", "Candidate 3", "Candidate 4"],          "Experience":[4, 4, 5, 4],          "Location": ["US", "FR", "JP", "US"],          "Industry": ["Tech", "Media", "Medicine", "Medicine"]        })Then the merge works fine:merged = mentor_df.merge(candidate_df, how='left')Output: Mentor  Experience Location  Industry    Candidate0    Bob           3       US      Tech          NaN1   Kate           4       FR      Tech          NaN2    Joe           5       JP     Media          NaN3   Mark           4       US  Medicine  Candidate 4Note you need to get to the last row before you have both a candidate and a mentor since this is matching on experience, location, and industry, and unless all three match you get a NaN either in candidate or mentor.Good luck!

Advertisement

Answer