Convert PDF to TIFF large file size
Posted: 2018-09-18T08:28:40-07:00
My company is moving away from using an application called SimpleIndex that could OCR files for scanned images. I'm testing ImageMagicK/Tesseract OCR (hopefully with PHP to get the job done). Initially, we have a PDF that has several scanned images combined together. I then use this command line to use ImageMagicK to convert the PDF file to a TIF.
For more info, next, I use tesseract to OCR the TIF file and output it as a PDF.
Code: Select all
magick.exe convert -strip -alpha off -density 300 100492.PDF -depth 2 -quality 100 -compress zip 100492.TIF
- The original PDF size is at 2,573 KB.
- After ImageMagicK it goes up 4,219 KB.
For more info, next, I use tesseract to OCR the TIF file and output it as a PDF.
- The end result is a 7,208 KB PDF.
- This is more than double the size of the SimpleIndex file which is at 3,589 KB.