Training on multiple data sets with scikit.mlpregressor

Question

I&#8217;m currently training my first neural network on a larger dataset. I have splitted my training data to several .npy binary files, that each contain batches of 20k training samples. I&#8217;m loading the data from the npy files, apply some simple pre-processing operations, and then start to train my net…

Accepted Answer

There are several possibilities.The model may have convergedThere may not be enough passes over the batches (in the example below the model doesn&#8217;t converge until ~500 iterations)(Need more info) the joblib.dump and joblib.load may be saving or loading in an unexpected wayInstead of calling a script multiple times and dumping the results between iterations, it might be easier to debug if initializing/preprocessing/training/visualizing all happens in one script. Here is a minimal example:import matplotlib.pyplot as pltfrom sklearn.neural_network import MLPRegressorfrom sklearn.datasets import make_regressionfrom sklearn.model_selection import train_test_splitX, y = make_regression(n_samples=10000, random_state=42)X_train, X_test, y_train, y_test = train_test_split(X, y)regr = MLPRegressor()losses = []test_performance = []for _ in range(100):    # Make 100 passes over the batches    for batch in range(500, 7501, 500):        # Perform partial fits on batches of 500 examples        # Simulate batches, these could also be loaded from `.npy`        X_train_batch = X_train[batch-500:batch]        y_train_batch = y_train[batch-500:batch]        regr.partial_fit(X_train_batch, y_train_batch)        losses.append(regr.loss_)        test_performance.append(regr.score(X_test, y_test))# Plotting results:fig, (ax1, ax2) = plt.subplots(1, 2)ax1.title.set_text("Training Loss")ax2.title.set_text("Score on test set")ax1.plot(range(len(losses)), losses)ax2.plot(range(len(test_performance)), test_performance)plt.show()Output:

Advertisement

Answer