I don’t know the difference of two ways that I let columns of DataFrame. the codes are here:
JavaScript
x
9
1
# let X be any 2-d array.
2
A=pd.DataFrame(data=X, columns=['ftr1', 'ftr2'])
3
A['ftr3']=1
4
print(A['ftr3'])
5
6
B=pd.DataFrame(data=X, columns=[['ftr1', 'ftr2']])
7
B['ftr3']=1
8
print(B['ftr3'])
9
when I printed A[‘ftr3’] to see elements of ftr3 of A, there was no problem.
But when I printed B[‘ftr3’], the problem occured:
JavaScript
1
2
1
TypeError: only integer scalar arrays can be converted to a scalar index
2
Moreover, the reason I’m confused with this result was that print(A) and print(B) prints exactly same results.
JavaScript
1
7
1
A=pd.DataFrame(data=X, columns=['ftr1', 'ftr2'])
2
A['ftr3']=1
3
print(A)
4
B=pd.DataFrame(data=X, columns=[['ftr1', 'ftr2']])
5
B['ftr3']=1
6
print(B)
7
the results are here:
I don’t get the points..plz help and thank for your reply.
Advertisement
Answer
Use print(B[['ftr3']])
instead of print(B['ftr3'])
The architecture of the columns is not the same : use A.columns
and B.columns
to compare them.
A.columns
will return : Index(['ftr1', 'ftr2', 'ftr3'], dtype='object')
B.columns
will return : MultiIndex([('ftr1',),('ftr2',),('ftr3',)],)