Skip to content
Advertisement

Is it possible to create an additional pct_change column for each column of the dataframe?

I tried to solve this on my own and have searched other topics for help, yet, my problem remains. If anyone can help me or point me to the right direction I would appreciate

I’m fairly new to python and I’m trying to perform some changes on a pandas dataframe. To summarize, I want to verify percental changes over sales data.

I’m aware of the pct_change method and below is what I’ve tried.

This is a sample data that looks like my original dataframe:

JavaScript

I made a function to calculate variations over rows. it looks like this:

JavaScript

Then, using:

JavaScript

it gave me something like the following:

JavaScript

That’s not really what I want. I need to see changes for each store alone (store_id), where this type of configuration calculates over rows, no matter from which store it is.

Moving forward I tried this:

JavaScript

Finally I get my actual dataframe, the one I’m stuck with and it’s kind of like this:

JavaScript

Now the dataframe is how I need it to be, but I haven’t found a way to implement pct_change the way I want, which would be something like this, adding a percental change column for each existing column (these are dummy numbers, it’s just a visual representation of how I’d like it to be):

JavaScript

Is is even possible to do so?

Advertisement

Answer

You can use the below:

sales value should be converted to a numeric and date should be changed to datetime, then the data should be sorted. If all of these are already done , you can skip this block:

JavaScript

Calculate pct_change for each group and then unstack

JavaScript

JavaScript
day 1 pct 1 sales_value 6 pct 6 sales_value
2020-12-22 0.0 54141.0 0.0 25802.0
2020-12-23 46.0 78921.0 -11.0 22991.0
2020-12-24 0.0 0.0 -74.0 5894.0
User contributions licensed under: CC BY-SA
10 People found this is helpful
Advertisement