I am trying to develop some code that extracts the power price when a power plant starts up. To give an example refer to the following data frame.
data = {
'Power_Price': [10, 11,15, 33, 50, 10, 12, 20, 17],
'Plant_Ops_1': [0, 0, 10, 10, 10, 0, 0, 10, 10],
'Plant_Ops_2': [0, 0, 0, 50, 50, 0, 0, 0, 0]
}
df = pd.DataFrame (data, columns = ['Power_Price','Plant_Ops_1','Plant_Ops_2'])
Based on this I aiming to develop some code that would store in a dataframe the power price when the plant ops columns transitions from 0 to a number greater than 0 (i.e. when the power plant starts). In the case of the data above the output would look something along the lines of:
data_out = {
'Plant': ['Plant_Ops_1', 'Plant_Ops_1', 'Plant_Ops_2'],
'Power_price': [15, 20, 33]
}
df_out = pd.DataFrame (data_out, columns = ['Plant','Power_price'])
Hopefully this makes sense. Certainly welcome any advice or guidance you are able to provide.
Advertisement
Answer
Use DataFrame.melt with filter rows with shifted per groups equal 0 and also greater like 0 in boolean indexing:
df = df.melt('Power_Price', var_name='Plant')
df = df[df.groupby('Plant')['value'].shift().eq(0) & df['value'].gt(0)].drop('value',axis=1)
print (df)
Power_Price Plant
2 15 Plant_Ops_1
7 20 Plant_Ops_1
12 33 Plant_Ops_2
Last if necessary change order of columns:
df = df[["Plant", "Power_Price"]]