Removing the background
Posted: 2015-07-07T04:37:03-07:00
Hello
I am scanning a library card catalogue and there has been a problem with the scanner. The result has been that the card has been scanned but it is a small part of the resulting scanned image.
To further complicate matters the card does not always appear in the same spot in the scanned image.
The problem we have is that we are aiming to run these through an OCR so that we can produce a text file. When i have attempted to run them through an OCR the results are rubbish. If i manually remove the unwanted area and then re read them the resulting text file is far better.
I don't really want to re scan the cards if i can help it so was wondering is there are any ways i can remove the unwanted areas in batch mode using imagemagick.
The background is not a uniform colour, it is generally darker than the card image, which is almost always white, with a few blue and purple cards thrown in.
Any ideas? Or do i go back to the scanner and try and sort out why it is producing the images this way?
I am scanning a library card catalogue and there has been a problem with the scanner. The result has been that the card has been scanned but it is a small part of the resulting scanned image.
To further complicate matters the card does not always appear in the same spot in the scanned image.
The problem we have is that we are aiming to run these through an OCR so that we can produce a text file. When i have attempted to run them through an OCR the results are rubbish. If i manually remove the unwanted area and then re read them the resulting text file is far better.
I don't really want to re scan the cards if i can help it so was wondering is there are any ways i can remove the unwanted areas in batch mode using imagemagick.
The background is not a uniform colour, it is generally darker than the card image, which is almost always white, with a few blue and purple cards thrown in.
Any ideas? Or do i go back to the scanner and try and sort out why it is producing the images this way?