Smoothing Categorical Output

Question

I have a list of outputs obtained from a cow behavior detection model. Even in a video when a cow is laying, often time it identifies as standing and vice versa. In each video frame, a classification result is given by the model and we are appending it into a list. Let's assume after 20 frames, we have a series

Accepted Answer

I used the following solution &#8211;import scipy.statswindow_length = 7behave = ["stand","stand","stand","stand","lying","lying", "eating"]most_freq_val = lambda x: scipy.stats.mode(x)[0][0]smoothed = [most_freq_val(behave[i:i+window_length]) for i in range(0,len(behave)-window_length+1)]I tried the solution posted by Hugolmn but it broke at a point. In the rolling mode, the window width is provided by the user (7 here). In a certain width, if more than one values are present in the same number of times, the code does not work. It&#8217;s more like &#8211; you tried to find the statistical mode (most common item) of a list but it got more than one item with the same highest frequency.

Advertisement

Answer