Image pre-processing for Tesseract
Posted: 2016-10-28T11:02:46-07:00
Hello Everybody! I'm new in the forum, I mean, I have been using it but never posted before.
I'm trying to find the best way of pre-processing an image/photo of handwritting text to then give it to tesseract.
My objective is the background to be white and the characters to be black, but with a nice shape, and without thouse black dots/pixels in random places that shouldn't be there.
I have tried some basics thresholdings but the problem is that the binary one (this gave me the best result) some of the times adds a lot of black pixels where it shouldn't, and also only doing thresholding does not enhace the characters shape.
I have found this android market application: OCR Instantly Free. This app does a very pretty good enhace of the image and what I would love to know how does it does it! Any ideas how can I achive something like it?
The pretty interesting tool is that the user can change dinamically and in real-time two values named by the app like "Exposure" and "Noise reduction" but I'm not sure witch image magick parameters this are.
Hope you can help me!
Regards,
A new user (:
I'm trying to find the best way of pre-processing an image/photo of handwritting text to then give it to tesseract.
My objective is the background to be white and the characters to be black, but with a nice shape, and without thouse black dots/pixels in random places that shouldn't be there.
I have tried some basics thresholdings but the problem is that the binary one (this gave me the best result) some of the times adds a lot of black pixels where it shouldn't, and also only doing thresholding does not enhace the characters shape.
I have found this android market application: OCR Instantly Free. This app does a very pretty good enhace of the image and what I would love to know how does it does it! Any ideas how can I achive something like it?
The pretty interesting tool is that the user can change dinamically and in real-time two values named by the app like "Exposure" and "Noise reduction" but I'm not sure witch image magick parameters this are.
Hope you can help me!
Regards,
A new user (: