PyTorch: Dataloader for time series task

Question

I have a Pandas dataframe with n rows and k columns loaded into memory. I would like to get batches for a forecasting task where the first training example of a batch should have shape (q, k) with q referring to the number of rows from the original dataframe (e.g. 0:128). The next example should be (128:256, …

Accepted Answer

You can write your analog of the TensorDataset. To do this you need to inherit from the Dataset class.from torch.utils.data import Dataset, DataLoaderclass MyDataset(Dataset):    def __init__(self, data_frame, q):        self.data = data_frame.values        self.q = q    def __len__(self):        return self.data.shape[0] // self.q    def __getitem__(self, index):        return self.data[index * self.q: (index+1) * self.q]

Advertisement

Answer