I have a data-frame
JavaScript
x
7
1
ID P_1 P_2
2
1 NaN NaN
3
2 124 342
4
3 NaN 234
5
4 123 NaN
6
5 2345 500
7
I want to make a new column titled P_3 such that:
JavaScript
1
7
1
ID P_1 P_2 P_3
2
1 NaN NaN NaN
3
2 124 342 342
4
3 NaN 234 234
5
4 123 NaN 123
6
5 2345 500 500
7
My conditions are:
JavaScript
1
4
1
if P_1 = Nan , then P_3 == P_2
2
if P_1 != Nan and P_2 != Nan, then P_3 == P_2
3
if P_2 = Nan , then P_3 == P_1
4
I have applied the following codes:
JavaScript
1
11
11
1
conditions = [
2
(df['P_1'] == float('NaN')),
3
(df['P_1'] != float('NaN')) & (df['P_2'] != float('NaN')),
4
(df['P_1'] != float('NaN')) & (df['P_2'] == float('NaN'))
5
]
6
7
values = [df['P_2'], df['P_2'], df['P_1']]
8
9
df['P_3'] = np.select(conditions, values)
10
11
But it gives me the following error:
JavaScript
1
2
1
Length of values does not match length of index
2
Advertisement
Answer
In summary, your unique condition is:
JavaScript
1
2
1
P_3 = P_2 if P_2 != NaN else P_1
2
combine_first
: update null elements with value in the same location in other (ref: Pandas doc.)
JavaScript
1
8
1
>>> df["P_2"].combine_first(df["P_1"])
2
ID
3
1 NaN
4
2 342.0
5
3 234.0
6
4 123.0
7
5 500.0
8