Grayscale background removal for OCR
Posted: 2019-03-19T12:34:02-07:00
Hello everyone!
I'm having the following issue:
I need to prepare a huge quantity of images for OCR reading. Problem is, the information is written in a part of a document that has a grayscale background. Also, the scanner that produced this image did so in color, so when the OCR tries to read it, it make a mess out of it.
I tried the following commands, but the result came with a lot of noise, where the grayscale background used to be.
The original image: https://drive.google.com/open?id=1ugHSQ ... zOYsahLVE9
The result: https://drive.google.com/open?id=1P1j9B ... k3284sFBVk
I've also tried converting to B&W and then blurring a bit to avoid too much pixalated images, but the results were inconsistent.
Any suggestions in order to make it cleaner for OCR reading ?
Thanks!
IM version: 7.0.8-27-Q16-x64
OS: Windows 7 Pro 64-bits.
Version date: 2019-01-27.
I'm having the following issue:
I need to prepare a huge quantity of images for OCR reading. Problem is, the information is written in a part of a document that has a grayscale background. Also, the scanner that produced this image did so in color, so when the OCR tries to read it, it make a mess out of it.
I tried the following commands, but the result came with a lot of noise, where the grayscale background used to be.
Code: Select all
convert 0068_example.jpg -type Grayscale -brightness-contrast +15x100 0068_result.jpg
The result: https://drive.google.com/open?id=1P1j9B ... k3284sFBVk
I've also tried converting to B&W and then blurring a bit to avoid too much pixalated images, but the results were inconsistent.
Any suggestions in order to make it cleaner for OCR reading ?
Thanks!
IM version: 7.0.8-27-Q16-x64
OS: Windows 7 Pro 64-bits.
Version date: 2019-01-27.