I’m just starting using Matplotlib, and I’m trying to learn how to draw a box plot in Python using Colab.
My problem is: I’m not able to put the median on the graph. The graph just showed the quartiles, mean, and outliers. Can someone help me?
My code is the following.
JavaScript
x
28
28
1
from google.colab import auth
2
auth.authenticate_user()
3
import gspread
4
import numpy as np
5
from oauth2client.client import GoogleCredentials
6
gc = gspread.authorize(GoogleCredentials.get_application_default())
7
8
import pandas as pd
9
import seaborn as sns
10
import matplotlib.pyplot as pl
11
sns.set_theme(style="ticks", color_codes=True)
12
13
wb = gc.open_by_url('URL_JUST_FOR_EXAMPLE')
14
boxplot = wb.worksheet('control-Scale10to100')
15
boxplotData = boxplot.get_all_values()
16
df = pd.DataFrame(boxplotData[1:], columns=boxplotData[0])
17
df.drop(df.columns[0], 1, inplace=True)
18
df = df.apply(pd.to_numeric, errors='ignore')
19
df.dtypes
20
21
df.describe()
22
23
dfBoxPlotData = df.iloc[:,4:15]
24
dfBoxPlotData.apply(pd.to_numeric)
25
dfBoxPlotData.head()
26
props = dict(whiskers="Black", medians="Black", caps="Black")
27
ax = df.plot.box(rot=90, fontsize=14, figsize=(15, 8), color=props, patch_artist=True, grid=False, meanline=True, showmeans=True, meanprops=dict(color='red'))
28
Advertisement
Answer
I tried running your code with a sample data set where the mean and median are distinct, and like @tdy showed, as long as the parameters showmeans=True
and meanline=True
are being passed to the df.plot.box
method, the mean and median should both show up. Is it possible that in your data set, the mean and median are close enough together that they’re hard to distinguish?
JavaScript
1
14
14
1
import numpy as np
2
import pandas as pd
3
import seaborn as sns
4
import matplotlib.pyplot as pl
5
6
mu, sigma = 50., 10. # mean and standard deviation
7
np.random.seed(42)
8
s = np.random.normal(mu, sigma, 30)
9
10
df = pd.DataFrame({'values':s})
11
props = dict(whiskers="Black", medians="Black", caps="Black")
12
ax = df.plot.box(rot=90, fontsize=14, figsize=(15, 8), color=props, patch_artist=True, grid=False, meanline=True, showmeans=True, meanprops=dict(color='red'))
13
pl.show()
14