Skip to content
Advertisement

add_datepart( ) produces KeyError

I’m trying to split a date column in a pandas data frame using add_datepart( ).

JavaScript

However, running trainingSetFirstCycle = add_datepart(trainingSetFirstCycle, trainingSetFirstCycle.date, drop=True) returns this error message:

JavaScript

I checked the documentation to see what I’d done wrong.

image

In the example shown, the definition of the data frame includes a dictionary consisting of the column name “date” and a list containing its first four values. So I reproduced this in my own data frame:

JavaScript

The result was this error message:

AttributeError: 'dict' object has no attribute 'encode'.

So, do you have an idea what it is I’m missing here? Thanks in advance.

Advertisement

Answer

Solved by rewording the parameter “date”

This has been a semantic bug. Python failed to compile trainingSetFirstCycle = add_datepart(trainingSetFirstCycle, trainingSetFirstCycle.date, drop=True) because it didn’t realize that trainingSetFirstCycle.date was supposed to be the column’s name. I changed the parameter to simply "date", which has solved the problem.

Apparently, my misconception was that whenever you give a data frame column as a parameter to a function, you have to refer to it as df.column , because that was the syntax of the function numpy.log(trainingSetFirstCycle.rent_count), where the data frame column “rent_count”, is the parameter of the function numpy.log.

Advertisement