I have two dataframes which are:
JavaScript
x
11
11
1
Value
2
Date
3
2010-06-29 3
4
2010-06-30 1
5
2010-07-01 5
6
2010-07-02 4
7
2010-07-03 9
8
2010-07-04 7
9
2010-07-05 2
10
2010-07-06 3
11
JavaScript
1
6
1
Value
2
Date
3
2010-06-29 6
4
2010-07-03 1
5
2010-07-06 4
6
The first dataframe could be created with the Python code:
JavaScript
1
12
12
1
import pandas as pd
2
3
df = pd.DataFrame(
4
{
5
'Date': ['2010-06-29', '2010-06-30', '2010-07-01', '2010-07-02', '2010-07-03', '2010-07-04', '2010-07-05', '2010-07-06'],
6
'Value': [3, 1, 5, 4, 9, 7, 2, 3]
7
}
8
)
9
10
df['Date'] = pd.to_datetime(df['Date']).dt.date
11
df = df.set_index('Date')
12
and the second dataframe:
JavaScript
1
10
10
1
df2 = pd.DataFrame(
2
{
3
'Date': ['2010-06-29', '2010-07-03', '2010-07-06'],
4
'Value': [6, 1, 4]
5
}
6
)
7
8
df2['Date'] = pd.to_datetime(df2['Date']).dt.date
9
df2 = df2.set_index('Date')
10
I want to create a second column in the first dataframe and the value of each Date in the new column will be the value of the first Date in the second dataframe equal to or earlier than the Date in the first dataframe.
So, the output is:
JavaScript
1
11
11
1
Value Value_2
2
Date
3
2010-06-29 3 6
4
2010-06-30 1 6
5
2010-07-01 5 6
6
2010-07-02 4 6
7
2010-07-03 9 1
8
2010-07-04 7 1
9
2010-07-05 2 1
10
2010-07-06 3 4
11
Also, it is my priority not to use any for-loops for the code.
How can I do this?
Advertisement
Answer
pd.merge_asof
should suffice for this
JavaScript
1
16
16
1
df.index = pd.to_datetime(df.index)
2
3
df2.index = pd.to_datetime(df2.index)
4
5
pd.merge_asof(df, df2, on='Date')
6
7
Date Value_x Value_y
8
0 2010-06-29 3 6
9
1 2010-06-30 1 6
10
2 2010-07-01 5 6
11
3 2010-07-02 4 6
12
4 2010-07-03 9 1
13
5 2010-07-04 7 1
14
6 2010-07-05 2 1
15
7 2010-07-06 3 4
16