loading EMNIST-letters dataset

Question

I have been trying to find a way to load the EMNIST-letters dataset but without much success. I have found interesting stuff in the structure and can&#8217;t wrap my head around what is happening. Here is what I mean: I downloaded the .mat format in here I can load the data using it is a dictionnary with the …

Accepted Answer

Because of the way the dataset is structured, the array of image arrays can be accessed with mat['dataset'][0][0][0][0][0][0] and the array of label arrays with mat['dataset'][0][0][0][0][0][1]. For instance, print(mat['dataset'][0][0][0][0][0][0][0]) will print out the pixel values of the first image, and print(mat['dataset'][0][0][0][0][0][1][0]) will print the first image&#8217;s label. For a less&#8230;convoluted dataset, I&#8217;d actually recommend using the CSV version of the EMNIST dataset on Kaggle: https://www.kaggle.com/crawford/emnist, where each row is a separate image, there are 785 columns where the first column = class_label and each column after represents one pixel value (784 total for a 28 x 28 image).

Advertisement

Answer