I have a dataset of 12 values, with the index being a datetime64
type and I want to plot a bar graph of the data with the x-ticks
showing the Month in English. I have used the MonthLocator
and DateFormatter
functions of matplotlib. These are working for one dataset but not with the other one. The x-ticks months are labelled wrongly. January should be the first index.
Dataset –> full_corr
JavaScript
x
15
15
1
corr
2
timestamp
3
2010-01-31 0.367613
4
2010-02-28 0.178960
5
2010-03-31 0.217788
6
2010-04-30 0.146214
7
2010-05-31 0.201297
8
2010-06-30 0.609486
9
2010-07-31 0.659257
10
2010-08-31 0.397254
11
2010-09-30 0.729701
12
2010-10-31 0.916465
13
2010-11-30 0.533646
14
2010-12-31 0.893937
15
Code used –>
JavaScript
1
7
1
plt.bar(full_corr.index, full_corr['corr'], width=10) # some bugs are there
2
ax = plt.gca()
3
locator = mdates.MonthLocator()
4
month_fmt = mdates.DateFormatter('%b')
5
ax.xaxis.set_major_locator(locator)
6
ax.xaxis.set_major_formatter(month_fmt)
7
Output is –> Output Plot
But when I plot the dataframe directly by using df.plot(kind="bar")
, the x-ticks are showed properly in the full datetime format.
Advertisement
Answer
The problem is that 2010-01-31 is too near to 2010-02-01. So when you set width to 10, it overlays Feb.
i.stack.imgur.com/EDewS.png
A soution to solve this is to convert 2010-01-31 to 2010-01.
pd.to_datetime
is used to convert series todatetime
.pd.Series.dt.strftime
convertsdatetime
series to a string in our desired format.
JavaScript
1
24
24
1
import pandas as pd
2
import datetime as datetime
3
import matplotlib.pyplot as plt
4
import matplotlib.dates as mdates
5
6
full_corr = pd.read_csv("1.csv")
7
8
# Below two lines are same with full_corr['timestamp'] = pd.to_datetime(full_corr['timestamp']).dt.strftime('%Y-%m')
9
full_corr['timestamp'] = pd.to_datetime(full_corr['timestamp'])
10
full_corr['timestamp'] = full_corr['timestamp'].apply(lambda x: datetime.datetime.strftime(x, '%Y-%m'))
11
12
full_corr['timestamp'] = pd.to_datetime(full_corr['timestamp'])
13
14
plt.bar(full_corr['timestamp'], full_corr['corr'], width=10) # some bugs are there
15
ax = plt.gca()
16
17
locator = mdates.MonthLocator()
18
month_fmt = mdates.DateFormatter('%b')
19
20
ax.xaxis.set_major_locator(locator)
21
ax.xaxis.set_major_formatter(month_fmt)
22
23
plt.show()
24