Is there a way to merge on Interval Index and another Column Value in pandas?

Question

So I currently have 2 dataframes. These have different columns and what I have been trying to figure out is how to merge on an interval index as well as a unique ID value. Below are 2 different examples of the dataframes I have: Creating the dataframe: Creating the dataframe: What I want to do is to be able to

Accepted Answer

Merge your dataframe on your UniqueID column then check if Trip_Date is between Start_Date and End_date. Finally, set to nan all rows where the condition is not met:# Only if this columns have not datetime64 dtypedf1['Start_Date'] = pd.to_datetime(df1['Start_Date'], dayfirst=True)df1['End_Date'] = pd.to_datetime(df1['End_Date'], dayfirst=True)df2['Trip_Date'] = pd.to_datetime(df2['Trip_Date'], dayfirst=True)out = pd.merge(df1, df2, on='UniqueID', how='left')m = out['Trip_Date'].between(out['Start_Date'], out['End_Date'])out.loc[~m, ['Trip_Date', 'Value']] = np.NaNOutput:>>> out  UniqueID Start_Date   End_Date  Trip_Date  Value0      ID1 2020-01-01 2020-08-01 2020-02-10    1.01      ID1 2020-01-01 2020-08-01 2020-02-15  207.02      ID2 2020-02-01 2020-04-01 2020-03-06   10.03      ID3 2020-03-01 2020-05-01        NaT    NaN4      ID4 2020-04-01 2020-09-01        NaT    NaN5      ID5 2020-05-01 2020-10-01        NaT    NaN6      ID6 2020-06-01 2020-11-01        NaT    NaN

Advertisement

Answer