Problem with plotting peaks using find_peaks from SciPy to detect drastic up/down turns or global outliers

Question

Let&#8217;s say I have following dataframe contains value over time or date: I inspired from this answer to detect peaks and valleys via below code: This is the output: The problems: I can&#8217;t figure out how I can configure find_peaks() documentation to reach meaningful/drastic peaks & valley with res…

Accepted Answer

You need to specify height in the same domain as your dataUpper thresohld is not missing, it is on the plot, just all those lines are close to 0 and clutter on the bottom.thresh_top = np.median(x) + 1 * np.std(x)thresh_bottom = np.median(x) - 1 * np.std(x)# (you may want to use std calculated on 10-90 percentile data, without outliers)# Find indices of peakspeak_idx, _ = find_peaks(x, height=thresh_top)# Find indices of valleys (from inverting the signal)valley_idx, _ = find_peaks(-x, height=-thresh_bottom)# Plot signalplt.figure(figsize=(14,12))plt.plot(t, x   , color='b', label='data')plt.scatter(t, x, s=10,c='b',label='value')# Plot thresholdplt.plot([min(t), max(t)], [thresh_top, thresh_top],   '--',  color='r', label='peaks-threshold')plt.plot([min(t), max(t)], [thresh_bottom, thresh_bottom], '--',  color='g', label='valleys-threshold')# Plot peaks (red) and valleys (blue)plt.plot(t[peak_idx], x[peak_idx],     "x", color='r', label='peaks')plt.plot(t[valley_idx], x[valley_idx], "x", color='g', label='valleys')plt.xticks(rotation=45)plt.ylabel('value')plt.xlabel('timestamp')plt.title(f'data over time for username=target')plt.legend( loc='upper left')plt.gcf().autofmt_xdate()plt.show()

Advertisement

Answer