Compare two dataframe columns on a histogram

Question

I have a dataframe that looks similar to: I am required to give a visual comparison of true and estimated distances. My actual df shape is: How do I show true_distance side-by-side estimated_distance on a plot, where one can easily see the difference in each row, considering the side of my df_actual? Answer Here are some ways to do it.

Accepted Answer

Here are some ways to do it.Method1import matplotlib.pyplot as pltplt.plot(df.true_distance)plt.plot(df.estimated_distance, 'o')plt.show()outputMethod 2import matplotlib.pyplot as pltimport numpy as npdef plotGraph(y_test,y_pred,regressorName):    if max(y_test) >= max(y_pred):        my_range = int(max(y_test))    else:        my_range = int(max(y_pred))    plt.scatter(range(len(y_test)), y_test, color='blue')    plt.scatter(range(len(y_pred)), y_pred, color='red')    plt.title(regressorName)    plt.show()    returny_test = range(10)y_pred = np.random.randint(0, 10, 10)plotGraph(df.true_distance, df.estimated_distance, "test")outputMethod3plt.figure(figsize=(10,10))plt.scatter(df.true_distance, df.estimated_distance, c='crimson')plt.yscale('log')plt.xscale('log')p1 = max(max(df.estimated_distance), max(df.true_distance))p2 = min(min(df.estimated_distance), min(df.true_distance))plt.plot([p1, p2], [p1, p2], 'b-')plt.xlabel('True Values', fontsize=15)plt.ylabel('Predictions', fontsize=15)plt.axis('equal')plt.show()

Advertisement

Answer