How can I get automatical features with dfs, using featuretools, when I have only one dataframe?

Question

I am trying to figure out how Featuretools works and I am testing it on the Housing Prices dataset on Kaggle. Because the dataset is huge, I'll work here with only a set of it. The dataframe is: I set de dataframe properties: Then call the dfs method: I get the following warning: UnusedPrimitiveWarning: Some specified primitives were not used

Accepted Answer

Aggregation primitives cannot create features on an EntitySet with a single DataFrame.This is because the aggregation that they perform occurs over the the one-to-many relationship that exists when you have a parent-child relationship between DataFrames in an EntitySet. The Featuretools guide on primitives has a section that explains the difference here. With your data, that might look like a child DataFrame that has a non-unique house_id column over. Then, running dfs on your train DataFrame would aggregate the desired information for each Id, using every time it shows up in the child DataFrame.To get get automated feature generation with a single DataFrame, you should use Transform features. The available Transform Primitives can be found here.

Advertisement

Answer