Identify an embedded image within an image in Python [closed]

Question

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers. We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations. Closed 8 months ago. Improve this question I have, for example, the following image: Is there any way

Accepted Answer

Here&#8217;s my attempt. This is how it works:To detect text, check how different pixels are from their neighbors. This is done using an absolute difference.The previous step only detects the edges of text. Expand this with a gaussian blur.Threshold this, and remove text.Crop remaining whitespace.It uses numpy, opencv, and scipy to do it.Full code:import numpy as npimport cv2 as cvimport matplotlib.pyplot as pltimport scipy.ndimageimg_orig = cv.imread('image_extract.png')def find_text(gray, gaussian_size_px=10, text_threshold=10):    flat = gray.flatten().astype('int16')    # difference each pixel against the pixel 1 position forward    differenced_image = np.abs(flat - np.roll(flat, 1)).reshape(gray.shape)    differenced_image = scipy.ndimage.gaussian_filter(differenced_image, sigma=gaussian_size_px)    is_text = differenced_image > text_threshold    return is_textdef remove_text(img, minpool_size=3):    is_text = find_text(img)    image_only = np.where(is_text, 0, img)    # filter out small bright pixels with convolved minimum    image_only = scipy.ndimage.minimum_filter(image_only, size=minpool_size)    return image_onlydef find_subimage(img_orig):    gray = cv.cvtColor(img_orig, cv.COLOR_BGR2GRAY)    # invert colors so white = 0 and black = 255    gray = np.max(gray) - gray        image_only = remove_text(gray)    coords = cv.findNonZero(image_only)    x, y, w, h = cv.boundingRect(coords)    cropped = img_orig[y:y+h, x:x+w]    return croppedplt.imshow(find_subimage(img_orig))Output:

Advertisement

Answer