How to slice and calculate the pearson correlation coefficient between one big and small array with “overlapping” windows arrays

Question

Suppose I have two very simple arrays with numpy: I would like to find which slice of array reference has the highest pearson&#8217;s correlation coefficient with array probe. To do that, I would like to slice the array reference using some sort of sub-arrays that are overlapped in a for loop, which means I s…

Accepted Answer

You can use sliding_window_view to get the successive values, for a vectorized computation of the correlation, use a custom function:from numpy.lib.stride_tricks import sliding_window_view as swvdef np_corr(X, y):    # adapted from https://stackoverflow.com/a/71253141    denom = (np.sqrt((len(y) * np.sum(X**2, axis=-1) - np.sum(X, axis=-1) ** 2)                       * (len(y) * np.sum(y**2) - np.sum(y)**2)))    return np.divide((len(y) * np.sum(X * y[None, :], axis=-1) - (np.sum(X, axis=-1) * np.sum(y))),                     denom, where=denom!=0                    )corr = np_corr(swv(reference, len(probe)), probe)Output:array([ 1.        ,  1.        , -0.65465367, -0.8660254 ,  0.        ,        0.8660254 ,  0.91766294,  1.        ,  1.        ])

Advertisement

Answer