Plotting Pandas DataFrame from pivot

Question

I am trying to plot a line graph comparing the Murder Rates of particular States through the years 1960-1962 using Pandas in a Jupyter Notebook. A little context about where I am now, and how I arrived here: I'm using a crime csv file, which looks like this: I'm only interested in 3 columns for the time being: State, Year,

Accepted Answer

Given a dataframe in a long (tidy) format, pandas.DataFrame.pivot is used to transform to a wide format, which can be plotted directly with pandas.DataFrame.plotTested in python 3.8.11, pandas 1.3.3, matplotlib 3.4.3import numpy as npimport pandas as pdcontrol_1960_to_1962 = pd.DataFrame({    'State': np.repeat(['Alaska', 'Maine', 'Michigan', 'Minnesota', 'Wisconsin'], 3),    'Year': [1960, 1961, 1962]*5,    'Murder Rate': [10.2, 11.5, 4.5, 1.7, 1.6, 1.4, 4.5, 4.1, 3.4, 1.2, 1.0, .9, 1.3, 1.6, .9]})df = control_1960_to_1962.pivot(index='Year', columns='State', values='Murder Rate')# display(df)State  Alaska  Maine  Michigan  Minnesota  WisconsinYear                                                1960     10.2    1.7       4.5        1.2        1.31961     11.5    1.6       4.1        1.0        1.61962      4.5    1.4       3.4        0.9        0.9The plotsYou can tell Pandas (and through it the matplotlib package that actually does the plotting) what xticks you want explicitly:ax = df.plot(xticks=df.index, ylabel='Murder Rate')Output:ax is a matplotlib.axes.Axes object, and there are many, many customizations you can make to your plot through it.Here&#8217;s how to plot with the States on the x axis:ax = df.T.plot(kind='bar', ylabel='Murder Rate')Output:

Advertisement

Answer

The plots