I would like to compress scanned text (monochrome or few colours) and store it in pdf (maybe djvu) files. I remember that I got very good results with Windows/Acrobat and "ZRLE" compressed monochrome tiff embedded into pdf. The algorithm was loossless as far as I remember. Now I search a way to obtain good results on linux. It should be storage saving and avoid loss (I do not mind loosing colours, but I do not want e.g. jpeg compression which would create noisy results for text scans). I need it for batch conversion, so I was thinking of the ImageMagick convert command. But which output format should I use so I get good results and to be able to embed it into pdf files (for example using pdflatex)? Or is it generally better to use djvu files?
efficient image compression for pdf embedding with linux
2.6k Views Asked by highsciguy At
2
There are 2 best solutions below
1
yms
On
DJVU is not a bad choice, but if you want to stay in PDF for better compatibility you may want to look into lossless JBIG2 compression.
Quote from Wikipedia:
Overall, the algorithm used by JBIG2 to compress text is very similar to the JB2 compression scheme used in the DjVu file format for coding binary images.
Related Questions in LINUX
- How do I recursively find and replace only in files named index.php on Linux webserver?
- passing text with \n as one argument in shell
- kernel module does not print packet info
- How to send ESC/POS commands to thermal printer in Linux
- (x64 Nasm) Writeline function on Linux
- How do I set the Hive user to something different than the Spark user from within a Spark program?
- Default priority of thread with SCHED_FIFO
- Calling a python function with options from shell script
- How to split a directory into parts without compressing or archiving?
- Cross compile simple standard C program on Linux for Mac
- How to offload NAPI poll function to workqueue
- python netifaces - How to get currently used network interface
- Unexpected output from function
- mingw-64 conflicting declarations when cross-compiling
- Different behavior of async with Visual Studio 2013(Windows8.1) and GCC 4.9(Ubuntu14.10)
Related Questions in PDF
- Itext get special letters from pdf
- Carrierwave file upload with different file types
- Get text from a section of a pdf page with IcePdf
- itext pdf to image convert
- PDF to Text extractor in nodejs without OS dependencies
- PDF to ByteArray Conversion
- Opening PDF file in SWT Browser - XulRunner default viewer
- Generate TCPDF output to a shared drive folder
- Combine base and ggplot graphics in R figure window in different pages
- Updating a PDF Barcode Field in iOS and Android Device
- Prevent PDFsharp from saving an image file?
- Adding attachment links between lines in itext for pdf
- Crop Pdf from each edge using itextshap
- How to create a PDF with iText+XMLWorker from servlet using custom font?
- how to create a pdf editor for grails
Related Questions in IMAGE-PROCESSING
- Need help in detecting multiple blobs
- Image based steganography that survives resizing?
- WinRT Extract Thumbnail from RAW image format
- 3D B-Spline approximation
- Qualitative and Quantitative analysis of filtered back projection / iradon in matlab
- How to detect objects in an image based on colour?
- How to make sense (handle) when computes logarithm of zero in prior information
- scilab - Drawing bounding box
- Lowpass filter non working
- Get a single line representation for multiple close by lines clustered together in opencv
- error while drawing several x-marks on a binary image in matlab
- Which method should I use to find gradient direction of pixels in an image?
- Finding Circle Boundary Pixels Coordinates and RGB Intensity Values from An RGB Input Image in Matlab
- using SURF for handdetection
- Using only one tool from CLImageEditor
Related Questions in IMAGEMAGICK
- Imagemagick crop while keeping size
- Why is the following convert command resulting in Segmentation fault?
- Use GM in node.js to crop to a circled image
- JMagick: Can't Find Dependent Libraries
- PHP ImageMagick - readimage not working for certain pdfs
- How to find the brightest pixel?
- Error converting PDF to PNG Imagemagick: unable to create temporary file
- How to process each pixel of the image use Magick.NET maximally quickly?
- Create two sizes of an image using one ImageMagick command
- Protecting against user injection, running imagemagik from through command line
- Weird Behaviour with New Lines Using grunt-exec
- rake db:setup -> rake aborted! Errno::ENOENT: No such file or directory - identify
- bad access in a simple multi-thread programm on MAC OS X Yosemite
- I can't convert with ImageMagick
- Use ImageMagick to extend canvas and not have transparent section become the background colour of the new image
Related Questions in DJVU
- VB.NET set embedded object src to byte array? dynamically set src value
- DjVu thumbnails are not being generated at all
- Extract current page from Adobe Acrobat Reader and DejaVu Reader processes?
- Highlight text in djvu document
- Why there is no pure java djvu encoders?
- DJVU reader from scratch. C#
- Add background to bitonal djvu file
- PyQt4 - python-djvulibre Segmentation Fault
- Is there a good quality djvu to pdf converter out there?
- ArrayBuffer to blob conversion
- Run djvulibre on iPhone?
- efficient image compression for pdf embedding with linux
- Ghostscript (MinGW compile) issue with pipes
- imagemagick djvu delegate missing
- How to access the custom class members from a call back method
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
jbig2enc encoder for images using jbig2 compression, was originally written for GoogleBooks by Adam Langley
https://github.com/agl/jbig2enc
I forked to include latest improvements By Rubypdf and others
https://github.com/DingoDog/jbig2enc
I also built several binaries of jbig2enc for puppy linux (it can be working also on other distributions)
http://dokupuppylinux.info/programs:encoders