I have this dataframe:
JavaScript
x
5
1
A B
2
0 [0, 1, 2] 1
3
1 foo 1
4
2 [3, 4] 1
5
I would like to use explode function for column “A” and then to keep right and fair proportion for each exploded row in case with column “B” . So the result should look like this:
JavaScript
1
8
1
A B
2
0 0 0.33
3
0 1 0.33
4
0 2 0.33
5
1 foo 1
6
2 3 0.5
7
2 4 0.5
8
Would this be possible with the explode function? I would manage to come to this result with for row in data.itertuples():
but the for loop is so slow in case with large dataframe. So do you have idea how to solve this with explode or with some other fast way?
I would be very grateful with any help.
Advertisement
Answer
You can explode
“A”; then groupby
the index and transform
count
method (to count the number of each index) and divide the elements in 'B'
by their corresponding index count.
JavaScript
1
3
1
out = df.explode('A')
2
out['B'] /= out['B'].groupby(level=0).transform('count')
3
Output:
JavaScript
1
8
1
A B
2
0 0 0.333333
3
0 1 0.333333
4
0 2 0.333333
5
1 foo 1.000000
6
2 3 0.500000
7
2 4 0.500000
8