I am trying to convert a dict to Pandas DataFrame as the following: And when I print out the DataFrame, I see the following output: I expect to see 1 row only in the DataFrame but it gives 5. And I cannot understand why. What am I doing wrong here? Answer You're not doing anything wrong. Since tags is a

Converting dict to DataFrame gives too many rows

I am trying to convert a dict to Pandas DataFrame as the following:

dff = pd.DataFrame(
{
'CEO': 'ucMMe Mhll', 
'address': 'vs5dlt3 B Se1kC eve0nre', 
'address2': '-', 
'city': 'a CSatanral', 
'companyName': 'Agilent Technologies Inc.', 
'country': 'nUatei tdetSs', 
'description': "tns oo el' yty", 
'employees': 17124, 
'exc': 'gdgdgd', 
'industry': 'sgeiTeotiroaLbtans r', 
'issueType': 'abc', 
'phone': '14087832319', 
'primarySicCode': 4008, 
'sector': ',atnSii Scilcofe,nnse TecisaPliinafs cedorhv cre', 
'securityName': 'elooIne.nen htc iisTcgAgl', 
'state': 'ailairofnC', 
'symbol': 'A', 
'tags': ['nllh he', 'gth', 'acsl', 'isiad', 'nr aitT'], 
'website': 'win.gcm.', 
'zip': '0752501-19'} )

JavaScript
​x
 
dff = pd.DataFrame(
{
'CEO': 'ucMMe Mhll', 
'address': 'vs5dlt3 B Se1kC eve0nre', 
'address2': '-', 
'city': 'a CSatanral', 
'companyName': 'Agilent Technologies Inc.', 
'country': 'nUatei tdetSs', 
'description': "tns oo el' yty", 
'employees': 17124, 
'exc': 'gdgdgd', 
'industry': 'sgeiTeotiroaLbtans r', 
'issueType': 'abc', 
'phone': '14087832319', 
'primarySicCode': 4008, 
'sector': ',atnSii Scilcofe,nnse TecisaPliinafs cedorhv cre', 
'securityName': 'elooIne.nen htc iisTcgAgl', 
'state': 'ailairofnC', 
'symbol': 'A', 
'tags': ['nllh he', 'gth', 'acsl', 'isiad', 'nr aitT'], 
'website': 'win.gcm.', 
'zip': '0752501-19'} )
​

And when I print out the DataFrame, I see the following output:

print(dff)

JavaScript
 
print(dff)
​

I expect to see 1 row only in the DataFrame but it gives 5. And I cannot understand why. What am I doing wrong here?

Answer

You’re not doing anything wrong. Since tags is a list, Pandas broadcasts all other fields to same size as tags and make a dataframe. You can do:

pd.Series(your_dict).to_frame().T

JavaScript
 
pd.Series(your_dict).to_frame().T
​

Or wrap your dict around [] indicating it’s a row (record orient):

pd.DataFrame([your_dict])

JavaScript
 
pd.DataFrame([your_dict])
​

Advertisement

Answer