I'm trying to find the best way of pre-processing an image/photo of handwritting text to then give it to tesseract.
My objective is the background to be white and the characters to be black, but with a nice shape, and without thouse black dots/pixels in random places that shouldn't be there.
I have tried some basics thresholdings but the problem is that the binary one (this gave me the best result) some of the times adds a lot of black pixels where it shouldn't, and also only doing thresholding does not enhace the characters shape.
I have found this android market application: OCR Instantly Free. This app does a very pretty good enhace of the image and what I would love to know how does it does it! Any ideas how can I achive something like it?
The pretty interesting tool is that the user can change dinamically and in real-time two values named by the app like "Exposure" and "Noise reduction" but I'm not sure witch image magick parameters this are.
data:image/s3,"s3://crabby-images/f9dcb/f9dcb73aab4b9a892b9b29db1a19ac6c82c93d84" alt="Image"
data:image/s3,"s3://crabby-images/e412b/e412b46bd4f629151b3100127766b2cc6031ad5d" alt="Image"
data:image/s3,"s3://crabby-images/542fa/542faa6187344cf15a46d75e275bb41e79de0017" alt="Image"
data:image/s3,"s3://crabby-images/b9aa8/b9aa831053b8f92b8aa7dff633fd4fee50a1b0f1" alt="Image"
data:image/s3,"s3://crabby-images/0a73c/0a73cd761e3a6d54649d3e454e17e26086cebed4" alt="Image"
Hope you can help me!
Regards,
A new user (: