How to include future values in a time series prediction of a RNN in Keras

Question

I currently have a RNN model for time series predictions. It uses 3 input features "value", "temperature" and "hour of the day" of the last 96 time steps to predict the next 96 time steps of the feature "value". Here you can see a schema of it: and here you have the current code: Here you have some test data

Accepted Answer

The standard approach is to use an encoder-decoder architecture (see 1 and 2 for instance):The encoder takes as input the past values of the features and of the target and returns an output representation.The decoder takes as input the encoder output and the future values of the features and returns the predicted values of the target.You can use any architecture for the encoder and for the decoder and you can also consider different approaches for passing the encoder output to the decoder (e.g. adding or concatenating it to the decoder input features, adding or concatenating it to the output of some intermediate decoder layer, or adding it to the final decoder output), the code below is just an example.import pandas as pdimport numpy as npimport matplotlib.pyplot as pltfrom sklearn.preprocessing import StandardScalerfrom tensorflow.keras.layers import Input, Dense, LSTM, TimeDistributed, Concatenate, Addfrom tensorflow.keras.models import Modelfrom tensorflow.keras.optimizers import Adam# define the inputstarget = ['value']features = ['temperatures', 'hour of the day']sequence_length = 96# import the datadf = pd.read_csv('TestData.csv', sep=';', header=0, low_memory=False, infer_datetime_format=True, parse_dates={'datetime': [0]}, index_col=['datetime'])# scale the datatarget_scaler = StandardScaler().fit(df[target])features_scaler = StandardScaler().fit(df[features])df[target] = target_scaler.transform(df[target])df[features] = features_scaler.transform(df[features])# extract the input and output sequencesX_encoder = []  # past features and target valuesX_decoder = []  # future features valuesy = []          # future target valuesfor i in range(sequence_length, df.shape[0] - sequence_length):    X_encoder.append(df[features + target].iloc[i - sequence_length: i])    X_decoder.append(df[features].iloc[i: i + sequence_length])    y.append(df[target].iloc[i: i + sequence_length])X_encoder = np.array(X_encoder)X_decoder = np.array(X_decoder)y = np.array(y)# define the encoder and decoderdef encoder(encoder_features):    y = LSTM(units=100, return_sequences=True)(encoder_features)    y = TimeDistributed(Dense(units=1))(y)    return ydef decoder(decoder_features, encoder_outputs):    x = Concatenate(axis=-1)([decoder_features, encoder_outputs])    # x = Add()([decoder_features, encoder_outputs])     y = TimeDistributed(Dense(units=100, activation='relu'))(x)    y = TimeDistributed(Dense(units=1))(y)    return y# build the modelencoder_features = Input(shape=X_encoder.shape[1:])decoder_features = Input(shape=X_decoder.shape[1:])encoder_outputs = encoder(encoder_features)decoder_outputs = decoder(decoder_features, encoder_outputs)model = Model([encoder_features, decoder_features], decoder_outputs)# train the modelmodel.compile(optimizer=Adam(learning_rate=0.001), loss='mse')model.fit([X_encoder, X_decoder], y, epochs=100, batch_size=128)# extract the last predicted sequencey_true = target_scaler.inverse_transform(y[-1, :])y_pred = target_scaler.inverse_transform(model.predict([X_encoder, X_decoder])[-1, :])# plot the last predicted sequenceplt.plot(y_true.flatten(), label='actual')plt.plot(y_pred.flatten(), label='predicted')plt.show()In the example above the model takes two inputs, X_encoder and X_decoder, so in your case when generating the forecasts you can use the past observed temperatures in X_encoder and the future temperature forecasts in X_decoder.

Advertisement

Answer