I need to do some coding in python and I can’t do this code:
JavaScript
x
7
1
import pandas as pd
2
3
df = pd.DataFrame({'Interview':['1','1','2','2','3','3','4','4','5','5'],
4
'Sequence':['1st','2nd','1st','2nd','1st','2nd','1st','2nd','1st','2nd'],
5
'Product':['Orange','Banana','Banana','Orange','Apple','Banana','Apple','Apple','Orange','Apple'],
6
'Value':[6,8,5,4,3,5,7,6,7,9]})
7
I need to do something like this as result:
JavaScript
1
4
1
sequence = pd.DataFrame({'1st':['Orange','Orange','Orange','Banana','Banana','Banana','Apple','Apple','Apple'],
2
'2nd':['Orange','Banana','Apple','Orange','Banana','Apple','Orange','Banana','Apple'],
3
'Value':[0,14,16,9,0,0,0,8,13]})
4
For me the sequence matters most in my analysis. It’s a sum of the results in interviews.
Thanks guys for the help!
Advertisement
Answer
Here is another approach using reindex
and unstack
:
JavaScript
1
11
11
1
df2 = df.set_index(['Interview', 'Sequence']).unstack()
2
result = df2.Product.join(df2.Value.sum(1).rename("Value"))
3
4
# 1st 2nd Value
5
# Interview
6
# 1 Orange Banana 14
7
# 2 Banana Orange 9
8
# 3 Apple Banana 8
9
# 4 Apple Apple 13
10
# 5 Orange Apple 16
11