How can I read a full sequence of digits using tesseract instead of first digit only

Question

I have the following clear binary image, and I want to read the digits using tesseract. My problem is that tesseract is reading only the first digit (5)! How can I make tesseract read the full sequence? Output: 5 < o x o c > Answer You will have to do some amount of preprocessing before you push the image

Accepted Answer

You will have to do some amount of preprocessing before you push the image directly to pytesseract for extraction of text. One thing that comes to mind is using binary_fill_holes to fill the area inside edges. Here is an example of what you can do.from skimage import io, util, featurefrom scipy import ndimage as ndiimport matplotlib.pyplot as pltimport pytesseractimport numpy as np#Import imageimg = io.imread('jbAsM.jpg', as_gray=True)#Preprocessingimginv = util.invert(img)  #Invert image#Loop and fill holes iterativelyfor i in range(2):    edges = feature.canny(imginv) #find edges    imginv = ndi.binary_fill_holes(edges) #fill holes in edgesfill_inv = util.invert(imginv)  #invert againplt.imshow(fill_inv, cmap='gray') #Image to texttext = pytesseract.image_to_string(fill_inv, config='outputbase digits')print('Extracted Text ->',text)Extracted Text -> 5113EDIT: No idea why pytesseract is predicting the last digit as 3 (weird!!)You will have to find your own preprocessing pipeline that suits the other images. I would recommend looking at image segmentation and edge filling methods.

Advertisement

Answer