How to crop white patches in image and make passport size photo using OpenCV

Question

I have images that need to be cropped to perfect passport size photos. I have thousands of images that need to be cropped and straightened automatically like this. If the image is too blur and not able to crop I need it to be copied to the rejected folder. I tried to do using haar cascade but this approach is

Accepted Answer

If all photos have that thin white-black border around them, you can justthreshold the picturesget all contours andselect those contours thathave the correct gradientare large enoughthat reduce to 4 corners when passed through approxPolyDPget an oriented bounding boxconstruct affine transformationapply affine transformationIf those photos aren&#8217;t scans but taken with a camera from an angle (not top-down), you&#8217;ll need to use a perspective transformation calculated from the corner points themselves.If the photos aren&#8217;t flat but warped, that&#8217;s an entirely different problem.import numpy as npimport cv2 as cvim = cv.imread("Zh8QV.jpg")gray = cv.cvtColor(im, cv.COLOR_BGR2GRAY)gray = 255 - gray # invert so findContours' implicit black border doesn't bother usheight, width = gray.shapeminarea = (height * width) * 0.20# (th_level, thresholded) = cv.threshold(gray, thresh=128, maxval=255, type=cv.THRESH_OTSU)# threshold relative to estimated brightness of "white"th_level = 255 - (255 - np.median(gray)) * 0.98(th_level, thresholded) = cv.threshold(gray, thresh=th_level, maxval=255, type=cv.THRESH_BINARY)(contours, hierarchy) = cv.findContours(thresholded, mode=cv.RETR_LIST, method=cv.CHAIN_APPROX_SIMPLE)# black-to-white contours have negative area...#areas = sorted([cv.contourArea(c, oriented=True) for c in contours])large_areas = [ c for c in contours if cv.contourArea(c, oriented=True) <= -minarea ]quads = [    c for c in large_areas    if len(cv.approxPolyDP(c, epsilon=0.02 * cv.arcLength(c, True), closed=True)) == 4]# if there is no quad, or multiple, that's an error (for this example)assert len(quads) == 1, quads[quad] = quadsbbox = cv.minAreaRect(quad)(bcenter, bsize, bangle) = bboxbcenter = np.array(bcenter)bsize = np.array(bsize)# keep orientation upright, fix up bbox size(rot90, bangle) = divmod(bangle + 45, 90)bangle -= 45if rot90 % 2 != 0:    bsize = bsize[::-1]# construct affine transformationM1 = np.eye(3)M1[0:2,2] = -bcenterR = np.eye(3)R[0:2] = cv.getRotationMatrix2D(center=(0,0), angle=bangle, scale=1.0)M2 = np.eye(3)M2[0:2,2] = +bsize * 0.5M = M2 @ R @ M1bwidth, bheight = np.ceil(bsize)dsize = (int(bwidth), int(bheight))output = cv.warpAffine(im, M[0:2], dsize=dsize, flags=cv.INTER_CUBIC)cv.imshow("output", output)cv.waitKey(-1)cv.destroyWindow("output")

Advertisement

Answer