Say I have a pandas Series:
JavaScript
x
15
15
1
index | value
2
-------------
3
0 | 2
4
1 | 0
5
2 | 8
6
3 | 0
7
4 | 1
8
5 | 2
9
6 | 7
10
7 | 4
11
8 | 2
12
9 | 9
13
10 | 0
14
11 | 0
15
I have to get a series (or array) of subrange maximum values. For example, a subrange of 5. For the first element, the value should be max{2, 0, 8, 0, 1} = 8. The second value should be max{0, 8, 0, 1, 2} = 8.
Starting from the 8th element, there are less than 5 elements in the subrange. The value should just be the maximum of the remaining elements.
It should be like:
JavaScript
1
15
15
1
index | value
2
-------------
3
0 | 8
4
1 | 8
5
2 | 8
6
3 | 7
7
4 | 7
8
5 | 9
9
6 | 9
10
7 | 9
11
8 | 9
12
9 | 9
13
10 | 0
14
11 | 0
15
I know we can simply do this by iterating the Series. But as I know, that’s not quite efficient if we use iloc
or iterate by using iterrows()
. Is there any more efficient and elegant way to do this? I heard that vector operation should be very quick. But I haven’t found out how to use that.
Advertisement
Answer
You can check rolling
JavaScript
1
16
16
1
df['value'] = df['value'].iloc[::-1].rolling(5,min_periods=1).max()
2
Out[158]:
3
0 8.0
4
1 8.0
5
2 8.0
6
3 7.0
7
4 7.0
8
5 9.0
9
6 9.0
10
7 9.0
11
8 9.0
12
9 9.0
13
10 0.0
14
11 0.0
15
Name: value, dtype: float64
16