Tech Support > Computers & Technology > Software & Applications > what is the best format for scanned documents
what is the best format for scanned documents
Posted by komar77r on June 23rd, 2005


that contains txt and lot of pictures
i heard about such a thing that automatically can choose
between best algorithm for different part of image, that is
text is compressed in grayscale in way similar to gif and
images are compresed in similar ways to jpeg, an all this is
contained in one file

i'm especially interested in these formats that are part of
irfan view:
ECW - Enhanced Compressed Wavelet
JP2 - Jpeg 2000
JPM
LDF - Lura Document Format
LWF - Lura Wave Formar

so... what is the best ?
[[ i don't need a losses format, i want simply pack as much
data as possible with high compression rate and regardless
of time and processing power ]]

Posted by Gerard Bok on June 23rd, 2005


On Thu, 23 Jun 2005 14:57:51 +0200, komar77r <komar77r@o2.pl>
wrote:

Why not use TIFF with fax group 4 compression ?
More or less the industry standard :-)

--
Kind regards,
Gerard Bok

Posted by komar77r on June 23rd, 2005


On Thu, 23 Jun 2005 15:30:31 GMT, bok118@zonnet.nl (Gerard
Bok) wrote:

but tiff's are (as i know) veeeery huge...

Posted by Gerard Bok on June 23rd, 2005


On Thu, 23 Jun 2005 20:05:50 +0200, komar77r <komar77r@o2.pl>
wrote:

No. Tiff is just a name for a way to write files.

They can be uncompressed and huge, or well compressed like fax
Group 4: A4 or letter size business documents: 35 KB average at
300 dpi B/W.

--
Kind regards,
Gerard Bok

Posted by spoon2001 on June 24th, 2005


What you are looking for is a format that saves text in binary format
(1-bit) and graphics in 24-bit color, all in the same file. This is called
"mixed raster content".

This is supported in PDF files using a method called "Adaptive Compression".

It is also supported by DjVu and in JPM files. JPM is "JPEG 2000 - Part 6".

Check out these webpages:
http://www.planetdjvu.com/djvu_vs__p...rt_6__jpm_.htm
http://www.planetdjvu.com/pdf_adapti...se_to_djvu.htm
http://www.searchpdf.com/presentatio...e thods_t.htm
http://www.planetdjvu.com/the_mrc__m...l_and_djvu.htm

It's been a while since I looked into this so there I'm sure there have been
new developments.





Similar Posts