Full Rewrite (except for FAQ, minor changes there)

- Cleanup - Rewrite of most of document - Added GitHub Flavored Markdown - Contributors Table - Tl;DR - Misc. - etc.
2025-12-06 07:12:00 +01:00 · 2013-09-02 23:05:55 -07:00 · 2013-09-02 23:05:55 -07:00 · c1f445308c
parent acf81ead70
commit c1f445308c
2 changed files with 476 additions and 356 deletions
--- a/356
+++ b/356
@ -1,356 +0,0 @@
-lrzip README
-
-Long Range ZIP or Lzma RZIP
-
-This is a compression program optimised for large files. The larger the file
-and the more memory you have, the better the compression advantage this will
-provide, especially once the files are larger than 100MB. The advantage can
-be chosen to be either size (much smaller than bzip2) or speed (much faster
-than bzip2).
-
-
-Quick lowdown of the most used options:
-
- lrztar directory
-This will produce an archive directory.tar.lrz compressed with lzma.
-
- lrzuntar directory.tar.lrz
-This will completely extract an archived directory.
-
- lrzip filename
-This will produce an archive filename.lrz compressed with lzma (best all
-round) giving slow compression and fast decompression
-
- lrzip -z filename
-This will produce an archive filename.lrz compressed with ZPAQ giving extreme
-compression but which takes ages to compress and decompress
-
- lrzip -l filename
-This will produce an archive filename.lrz compressed with LZO giving very
-fast compression and fast decompression
-
- lrunzip filename.lrz
-This will decompress filename.lrz into filename
-
-
-Lrzip uses an extended version of rzip which does a first pass long distance
-redundancy reduction. The lrzip modifications make it scale according to
-memory size.
-The data is then either:
-1. Compressed by lzma (default) which gives excellent compression
-at approximately twice the speed of bzip2 compression
-2. Compressed by a number of other compressors chosen for different reasons,
-in order of likelihood of usefulness:
-2a. ZPAQ: Extreme compression up to 20% smaller than lzma but ultra slow
-at compression AND decompression.
-2b. LZO: Extremely fast compression and decompression which on most machines
-compresses faster than disk writing making it as fast (or even faster) than
-simply copying a large file
-2c. GZIP: Almost as fast as LZO but with better compression.
-2d. BZIP2: A defacto linux standard of sorts but is the middle ground between
-lzma and gzip and neither here nor there.
-3. Leaving it uncompressed and rzip prepared. This form improves substantially
-any compression performed on the resulting file in both size and speed (due to
-the nature of rzip preparation merging similar compressible blocks of data and
-creating a smaller file). By "improving" I mean it will either speed up the
-very slow compressors with minor detriment to compression, or greatly increase
-the compression of simple compression algorithms.
-
-The major disadvantages are:
-1. The main lrzip application only works on single files so it requires the
-lrztar wrapper to fake a complete archiver.
-2. It requires a lot of memory to get the best performance out of, and is not
-really usable (for compression) with less than 256MB. Decompression requires
-less ram and works on smaller ram machines. Sometimes swap may need to be
-enabled on these lower ram machines for the operating system to be happy.
-3. STDIN/STDOUT works fine on both compression and decompression, but larger
-files compressed in this manner will end up being less efficiently compressed.
-
-The unique feature of lrzip is that it tries to make the most of the available
-ram in your system at all times for maximum benefit. It does this by default,
-choosing the largest sized window possible without running out of memory. It
-also has a unique "sliding mmap" feature which makes it possible to even use
-a compression window larger than your ramsize, if the file is that large. It
-does this (with the -U option) by implementing one large mmap buffer as per
-normal, and a smaller moving buffer to track which part of the file is
-currently being examined, emulating a much larger single mmapped buffer.
-Unfortunately this mode can be many times slower.
-
-See the file README.benchmarks in doc/ for performance examples and what kind
-of data lrzip is very good with.
-
-
-Requires:
-pthreads
-liblzo2-dev
-libbz2-dev
-libz-dev
-libm
-tar
-(nasm on 32bit x86)
-
-To build/install:
-./configure
-make
-make install
-
-To build from the git repository do:
-./autogen.sh
-before the above steps.
-
-
-FAQS.
-Q. What encryption does lrzip use?
-A. Lrzip uses the best current proven technologies to achieve high grade
-password protected encryption. It uses a combination of sha512 to multiply
-hash the password with random salt and aes128 to do block encryption of each
-block of data with more random salt. The amount of initial hashing of the
-password increases by the date an lrzip archive is encrypted according to
-Moore's law, making it harder each year to brute force attack the password
-to keep up with the increasing computing power each year. It is virtually
-guaranteed that the same file encrypted with the same password will never
-be the same twice. The weakest link in this encryption mode by far is the
-password chosen by the user. There is currently no known attack or backdoor
-for this encryption mechanism, and there is absolutely no way of retrieving
-your password should you forget it.
-
-Q. How do I make a static build?
-A. ./configure --enable-static-bin
-
-Q. I want the absolute maximum compression I can possibly get, what do I do?
-A. Try the command line options "-Uzp 1 -L 9". This uses all available ram and
-ZPAQ compression, and even uses a compression window larger than you have ram.
-The -p 1 option disables multithreading which improves compression but at the
-expense of speed. Expect it to take many times longer.
-
-Q. I want the absolute fastest decent compression I can possibly get.
-A. Try the command line option -l. This will use the  lzo backend compression,
-and level 7 compression (1 isn't much faster).
-
-Q. How much slower is the unlimited mode?
-A. It depends on 2 things. First, just how much larger than your ram the file
-is, as the bigger the difference, the slower it will be. The second is how much
-redundant data there is. The more there is, the slower, but ultimately the
-better the compression. Why isn't it on by default? If the compression window is
-a LOT larger than ram, with a lot of redundant information it can be drastically
-slower. I may revisit this possibility in the future if I can make it any
-faster.
-
-Q. Can I use your tool for even more compression than lzma offers?
-A. Yes, the rzip preparation of files makes them more compressible by most
-other compression technique I have tried. Using the -n option will generate
-a .lrz file smaller than the original which should be more compressible, and
-since it is smaller it will compress faster than it otherwise would have.
-
-Q. 32bit?
-A. 32bit machines have a limit of 2GB sized compression windows due to
-userspace limitations on mmap and malloc, so even if you have much more ram
-you will not be able to use compression windows larger than 2GB. Also you
-may be unable to decompress files compressed on 64bit machines which have
-used windows larger than 2GB.
-
-Q. How about 64bit?
-A. 64bit machines with their ability to address massive amounts of ram will
-excel with lrzip due to being able to use compression windows limited only in
-size by the amount of physical ram.
-
-Q. Other operating systems?
-A. The code is POSIXy with GNU extensions. Patches are welcome. Version 0.43+
-should build on MacOSX 10.5+
-
-Q. Does it work on stdin/stdout?
-A. Yes it does. Compression and decompression work well to/from STDIN/STDOUT.
-However because lrzip does multiple passes on the data, it has to store a
-large amount in ram before it dumps it to STDOUT (and vice versa), thus it
-is unable to work with the massive compression windows regular operation
-provides. Thus the compression afforded on files larger than approximately
-25% RAM size will be less efficient (though still benefiting compared to
-traditional compression formats).
-
-Q. I have another compression format that is even better than zpaq, can you
-use that?
-A. You can use it yourself on rzip prepared files (see above). Alternatively
-if the source code is compatible with the GPL license it can be added to the
-lrzip source code. Libraries with functions similar to compress() and
-decompress() functions of zlib would make the process most painless. Please
-tell me if you have such a library so I can include it :)
-
-Q. What's this "Starting lzma back end compression thread..." message?
-A. While I'm a big fan of progress percentage being visible, unfortunately
-lzma compression can't currently be tracked when handing over 100+MB chunks
-over to the lzma library. Therefore you'll see progress percentage until
-each chunk is handed over to the lzma library.
-
-Q. What's this "lzo testing for incompressible data" message?
-A. Other compression is much slower, and lzo is the fastest. To help speed up
-the process, lzo compression is performed on the data first to test that the
-data is at all compressible. If a small block of data is not compressible, it
-tests progressively larger blocks until it has tested all the data (if it fails
-to compress at all). If no compressible data is found, then the subsequent
-compression is not even attempted. This can save a lot of time during the
-compression phase when there is incompressible data. Theoretically it may be
-possible that data is compressible by the other backend (zpaq, lzma etc) and not
-at all by lzo, but in practice such data achieves only minuscule amounts of
-compression which are not worth pursuing. Most of the time it is clear one way
-or the other that data is compressible or not. If you wish to disable this
-test and force it to try compressing it anyway, use -T.
-
-Q. I have truckloads of ram so I can compress files much better, but can my
-generated file be decompressed on machines with less ram?
-A. Yes. Ram requirements for decompression go up only by the -L compression
-option with lzma and are never anywhere near as large as the compression
-requirements. However if you're on 64bit and you use a compression window
-greater than 2GB, it might not be possible to decompress it on 32bit machines.
-
-Q. Why are you including bzip2 compression?
-A. To maintain a similar compression format to the original rzip (although the
-other modes are more useful).
-
-Q. What about multimedia?
-A. Most multimedia is already in a heavily compressed "lossy" format which by
-its very nature has very little redundancy. This means that there is not
-much that can actually be compressed. If your video/audio/picture is in a
-high bitrate, there will be more redundancy than a low bitrate one making it
-more suitable to compression. None of the compression techniques in lrzip are
-optimised for this sort of data. However, the nature of rzip preparation
-means that you'll still get better compression than most normal compression
-algorithms give you if you have very large files. ISO images of dvds for
-example are best compressed directly instead of individual .VOB files. ZPAQ is
-the only compression format that can do any significant compression of
-multimedia.
-
-Q. Is this multithreaded?
-A. As of version 0.540, it is HEAVILY multithreaded with the back end
-compression and decompression phase, and will continue to process the rzip
-pre-processing phase so when using one of the more CPU intensive backend
-compressions like lzma or zpaq, SMP machines will show massive speed
-improvements. Lrzip will detect the number of CPUs to use, but it can be
-overridden with the -p option if the slightly better compression is desired
-more than speed. -p 1 will give the best compression but also be the slowest.
-
-Q. This uses heaps of memory, can I make it use less?
-A. Well you can by setting -w to the lowest value (1) but the huge use of
-memory is what makes the compression better than ordinary compression
-programs so it defeats the point. You'll still derive benefit with -w 1 but
-not as much.
-
-Q. What CFLAGS should I use?
-A. With a recent enough compiler (gcc>4) setting both CFLAGS and CXXFLAGS to
-	-O2 -march=native -fomit-frame-pointer
-
-Q. What compiler does this work with?
-A. It has been tested on gcc, ekopath and the intel compiler successfully
-previously. Whether the commercial compilers help or not, I could not tell you.
-
-Q. What codebase are you basing this on?
-A. rzip v2.1 and lzma sdk920, but it should be possible to stay in sync with
-each of these in the future.
-
-Q. Do we really need yet another compression format?
-A. It's not really a new one at all; simply a reimplementation of a few very
-good performing ones that will scale with memory and file size.
-
-Q. How do you use lrzip yourself?
-A. Three basic uses. I compress large files currently on my drive with the
-l option since it is so quick to get a space saving. When archiving data for
-permanent storage I compress it with the default options. When compressing
-small files for distribution I use the -z option for the smallest possible
-size.
-
-Q. I found a file that compressed better with plain lzma. How can that be?
-A. When the file is more than 5 times the size of the compression window
-you have available, the efficiency of rzip preparation drops off as a means
-of getting better compression. Eventually when the file is large enough,
-plain lzma compression will get better ratios. The lrzip compression will be
-a lot faster though. The only way around this is to use as large compression
-windows as possible with -U option.
-
-Q. Can I use swapspace as ram for lrzip with a massive window?
-A. It will indirectly do this with -U (unlimited) mode enabled. This mode will
-make the compression window as big as the file itself no matter how big it is,
-but it will slow down proportionately more the bigger the file is than your ram.
-
-Q. Why do you nice it to +19 by default? Can I speed up the compression by
-changing the nice value?
-A. This is a common misconception about what nice values do. They only tell the
-cpu process scheduler how to prioritise workloads, and if your application is
-the _only_ thing running it will be no faster at nice -20 nor will it be any
-slower at +19.
-
-Q. What is the LZO Testing option, -T?
-A. LZO testing is normally performed for the slower back-end compression of LZMA
-and ZPAQ. The reasoning is that if it is completely incompressible by LZO then
-it will also be incompressible by them. Thus if a block fails to be compressed
-by the very fast LZO, lrzip will not attempt to compress that block with the
-slower compressor, thereby saving time. If this option is enabled, it will
-bypass the LZO testing and attempt to compress each block regardless.
-
-Q. Compression and decompression progress on large archives slows down and
-speeds up. There's also a jump in the percentage at the end?
-A. Yes, that's the nature of the compression/decompression mechanism. The jump
-is because the rzip preparation makes the amount of data much smaller than the
-compression backend (lzma) needs to compress.
-
-Q. Tell me about patented compression algorithms, GPL, lawyers and copyright.
-A. No
-
-Q. I receive an error "LZMA ERROR: 2. Try a smaller compression window."
-   what does this mean?
-A. LZMA requests large amounts of memory. When a higher compression window is
-   used, there may not be enough contiguous memory for LZMA. LZMA may request
-   up to 25% of TOTAL ram depending on compression level. If contiguous blocks
-   of memory are not free, LZMA will return an error. This is not a fatal
-   error, and a backup mode of compression will be used.
-
-Q. Where can I get more information about the internals of LZMA?
-A. See http://www.7-zip.org and http://www.p7zip.org. Also, see the file
-   ./lzma/C/lzmalib.h which explains the LZMA properties used and the LZMA
-   memory requirements and computation.
-
-Q. This version is much slower than the old version?
-A. Make sure you have set CFLAGS and CXXFLAGS. An unoptimised build will be
-almost 3 times slower.
-
-LIMITATIONS
-Due to mmap limitations the maximum size a window can be set to is currently
-2GB on 32bit unless the -U option is specified. Files generated on 64 bit
-machines with windows >2GB in size might not be decompressible on 32bit
-machines. Large files might not decompress on machines with less RAM if SWAP is
-disabled.
-
-BUGS:
-Probably lots. Tell me if you spot any :) Any known ones should be documented
-in the file BUGS.
-
-
-Links:
-rzip:
-http://rzip.samba.org/
-lzo:
-http://www.oberhumer.com/opensource/lzo/
-lzma:
-http://www.7-zip.org/
-zpaq:
-http://mattmahoney.net/dc/
-
-Thanks to Andrew Tridgell for rzip. Thanks to Markus Oberhumer for lzo.
-Thanks to Igor Pavlov for lzma. Thanks to Jean-loup Gailly and Mark Adler
-for the zlib compression library. Thanks to Christian Leber for lzma
-compat layer, Michael J Cohen for Darwin support, Lasse Collin for fix
-to LZMALib.cpp and for Makefile.in suggestions, and everyone else who coded
-along the way. Huge thanks to Peter Hyman for most of the 0.19-0.24 changes,
-and the update to the multithreaded lzma library and all sorts of other
-features. Thanks to René Rhéaume for fixing executable stacks and
-Ed Avis for various fixes. Thanks to Matt Mahoney for zpaq code. Thanks to
-Jukka Laurila for Darwin support. Thanks to George Makrydakis for lrztar.
-Thanks to Ulrich Drepper for his md5 implementation. Thanks to Michael
-Blumenkrantz for new configuration tools. Thanks to the PolarSSL authors for
-encryption code. Thanks to Serge Belyshev for extensive help, advice, and
-patches to implement encryption. Michael Blumenkrantz also for liblrzip.
-
-Con Kolivas <kernel@kolivas.org>
-Sat, 11 March 2011
-
-Also documented by
-Peter Hyman <pete@peterhyman.com>
-Sun, 04 Jan 2009
--- a/README.md
+++ b/README.md
@ -0,0 +1,476 @@
+lrzip - Long Range ZIP or LZMA RZIP
+===================================
+
+A compression utility that excels at compressing large files (usually > 10-50 MB).
+Larger files and/or more free RAM means that the utility will be able to more
+effectively compress your files (ie: faster / smaller size), especially if the
+filesize(s) exceed 100 MB. You can either choose to optimize for speed (fast
+compression / decompression) or size, but not both.
+
+
+### haneefmubarak's TL;DR for the long explanation:
+
+Just change the word `directory` to the name of the directory you wish to compress.
+
+#### Compression:
+
+```bash
+lrzdir=directory; tar cvf $lrzdir; lrzip -Ubvvp `nproc` -S .bzip2-lrz -L 9 $lrzdir.tar; rm -fv $lrzdir.tar; unset lrzdir
+```
+
+`tar`s the directory, then maxes out all of the system's processor cores
+along with sliding window RAM to give the best **BZIP2** compression while being as fast as possible,
+enables max verbosity output, attaches the extension `.bzip2-lrz`, and finally
+gets rid of the temporary tarfile. Uses a tempvar `lrzdir` which is unset automatically.
+
+#### Decompression for the kind of file from above:
+
+```bash
+lrzdir=directory; lrunzip -cdivvp `nproc` -o $lrzdir.tar $lrzdir.tar.bzip2-lrz; tar xvf $lrzdir.tar; rm -vf $lrzdir.tar
+```
+
+Checks integrity, then decompresses the directory using all of the 
+processor cores for max speed, enables max verbosity output, unarchives
+the resulting tarfile, and finally gets rid of the temporary tarfile. Uses the same kind of tempvar.
+
+
+### lrzip build/install guide:
+
+A quick guide on building and installing.
+
+#### What you will need
+
+ - gcc
+ - bash or zsh
+ - pthreads
+ - tar
+ - libc
+ - libm
+ - libz-dev
+ - libbz2-dev
+ - liblzo2-dev
+ - coreutils
+ - nasm on x86, not needed on x64
+ - git if you want a repo-fresh copy
+ - an OS with the usual *nix headers and libraries
+
+#### Obtaining the source
+
+Two different ways of doing this:
+
+Stable: Packaged tarball that is known to work:
+
+Go to <https://github.com/ckolivas/lrzip/releases> and downlaod the `tar.gz`
+file from the top. `cd` to the directory you downloaded, and use `tar xvzf lrzip-X.X.tar.gz`
+to extract the files (don't forget to replace `X.X` with the correct version). Finally, cd
+into the directory you just extracted.
+
+Latest: `git clone -v https://github.com/ckolivas/lrzip.git; cd lrzip`
+
+#### Build
+
+```bash
+./autogen.sh
+./configure
+make -j `nproc` # maxes out all cores
+```
+
+#### Install
+
+Simple 'n Easy™: `sudo make install`
+
+### lrzip 101:
+
+|Command|Result|
+|------|------|
+|`lrztar directory`|An archive `directory.tar.lrz` compressed with **LZMA**.|
+|`lrzuntar directory.tar.lrz`|A directory extracted from a `lrztar` archive.|
+|`lrzip filename`|An archive `filename.lrz` compressed with **LZMA**, meaning slow compression and fast decompression.|
+|`lrzip -z filename`|An archive "filename.lrz" compressed with **ZPAQ** that can give extreme compression, but takes a bit longer than forever to compress and decompress.|
+|`lrzip -l filename`|An archive lightly compressed with **LZO**, meaning really, really fast compression and decompression.|
+|`lrunzip filename.lrz`|Decompress filename.lrz to filename.|
+
+### lrzip internals
+
+lrzip uses an extended version of [rzip](http://rzip.samba.org/) which does a first pass long distance
+redundancy reduction. lrzip's modifications allow it to scale to accomodate various memory sizes.
+
+Then, one of the following scenarios occurs:
+
+ - Compressed
+  - (default) **LZMA** gives excellent compression @ ~2x the speed of bzip2 
+  - **ZPAQ** gives extreme compression while taking forever
+  - **LZO** gives insanely fast compression that can actually be faster than simply copying a large file
+  - **GZIP** gives compression almost as fast as LZO but with better compression
+  - **BZIP2** is a defacto linux standard and hacker favorite which usually gives
+  quite good compression (ZPAQ>LZMA>BZIP2>GZIP>LZO) while staying fairly fast (LZO>GZIP>BZIP2>LZMA>ZPAQ); 
+  in other words, a good middle-ground and a good choice overall
+ - Uncompressed, in the words of the software's original author:
+
+> Leaving it uncompressed and rzip prepared. This form improves substantially
+> any compression performed on the resulting file in both size and speed (due to
+> the nature of rzip preparation merging similar compressible blocks of data and
+> creating a smaller file). By "improving" I mean it will either speed up the
+> very slow compressors with minor detriment to compression, or greatly increase
+> the compression of simple compression algorithms.
+> 
+> (Con Kolivas, from the original lrzip README)
+
+
+The only real disadvantages:
+
+ - The main program, lrzip, only works on single files, and therefore
+ requires the use of an lrztar wrapper to fake a complete archiver.
+ - lrzip requires quite a bit of memory along with a modern processor
+ to get the best performance in reasonable time. This usually means that
+ it is somewhat unusable with less than 256 MB. However, decompression
+ usually requires less RAM and can work on less powerful machines with much
+ less RAM. On machines with less RAM, it may be a good idea to enable swap
+ if you want to keep your operating system happy.
+ - Piping output to and/or from STDIN and/or STDOUT works fine with both
+ compression and decompression, but larger files compressed this way will
+ likely end up being compressed less efficiently. Decompression doesn't
+ really have any issues with piping, though.
+
+One of the more unique features of lrzip is that it will try to use all of
+the available RAM as best it can at all times to provide maximum benefit. This
+is the default operating method, where it will create and use the single
+largest memory window that will still fit in available memory without freezing
+up the system. It does this by `mmap`ing the small portions of the file that
+it is working on. However, it also has a unique "sliding `mmap`" feature, which
+allows it to use compression windows that far exceed the size of your RAM if
+the file you are compressing is large. It does this by using one large `mmap`
+along with a smaller moving `mmap` buffer to track the part of the file that
+is currently being examined. From a higher level, this can be seen as simply
+emulating a single, large `mmap` buffer. The unfortunate thing about this
+feature is that it can become extremely slow. The counter-argument to
+being slower is that it will usually give a better compression factor.
+
+The file `doc/README.benchmarks` has some performance examples to show
+what kind of data lrzip is good with.
+
+
+
+### FAQ
+
+> Q: What  kind of encryption does lrzip use?
+
+> A: lrzip uses SHA2-512 repetitive hashing of the password along with a salt
+> to provide a key which is used by AES-128 to do block encryption. Each block
+> has more random salts added to the block key. The amount of initial hashing
+> increases as the timestamp goes forward, in direct relation to Moore's law,
+> which means that the amount of time required to encrypt/decrypt the file
+> stays the same on a contemporary computer. It is virtually
+> guaranteed that the same file encrypted with the same password will never
+> be the same twice. The weakest link in this encryption mode by far is the
+> password chosen by the user. There is currently no known attack or backdoor
+> for this encryption mechanism, and there is absolutely no way of retrieving
+> your password should you forget it.
+
+> Q: How do I make a static build?
+
+> A: `./configure --enable-static-bin`
+
+> Q: I want the absolute maximum compression I can possibly get, what do I do?
+
+> A: Try the command line options "-Uzp 1 -L 9". This uses all available ram and
+> ZPAQ compression, and even uses a compression window larger than you have ram.
+> The -p 1 option disables multithreading which improves compression but at the
+> expense of speed. Expect it to take many times longer.
+
+> Q: I want the absolute fastest decent compression I can possibly get.
+
+> A: Try the command line option -l. This will use the  lzo backend compression,
+> and level 7 compression (1 isn't much faster).
+
+> Q: How much slower is the unlimited mode?
+
+> A: It depends on 2 things. First, just how much larger than your ram the file
+is, as the bigger the difference, the slower it will be. The second is how much
+redundant data there is. The more there is, the slower, but ultimately the
+better the compression. Why isn't it on by default? If the compression window is
+a LOT larger than ram, with a lot of redundant information it can be drastically
+slower. I may revisit this possibility in the future if I can make it any
+faster.
+
+> Q: Can I use your tool for even more compression than lzma offers?
+
+> A: Yes, the rzip preparation of files makes them more compressible by most
+other compression technique I have tried. Using the -n option will generate
+a .lrz file smaller than the original which should be more compressible, and
+since it is smaller it will compress faster than it otherwise would have.
+
+> Q: 32bit?
+
+> A: 32bit machines have a limit of 2GB sized compression windows due to
+userspace limitations on mmap and malloc, so even if you have much more ram
+you will not be able to use compression windows larger than 2GB. Also you
+may be unable to decompress files compressed on 64bit machines which have
+used windows larger than 2GB.
+
+> Q: How about 64bit?
+
+> A: 64bit machines with their ability to address massive amounts of ram will
+excel with lrzip due to being able to use compression windows limited only in
+size by the amount of physical ram.
+
+> Q: Other operating systems?
+
+> A: The code is POSIXy with GNU extensions. Patches are welcome. Version 0.43+
+should build on MacOSX 10.5+
+
+> Q: Does it work on stdin/stdout?
+
+> A: Yes it does. Compression and decompression work well to/from STDIN/STDOUT.
+However because lrzip does multiple passes on the data, it has to store a
+large amount in ram before it dumps it to STDOUT (and vice versa), thus it
+is unable to work with the massive compression windows regular operation
+provides. Thus the compression afforded on files larger than approximately
+25% RAM size will be less efficient (though still benefiting compared to
+traditional compression formats).
+
+> Q: I have another compression format that is even better than zpaq, can you
+use that?
+
+> A: You can use it yourself on rzip prepared files (see above). Alternatively
+if the source code is compatible with the GPL license it can be added to the
+lrzip source code. Libraries with functions similar to compress() and
+decompress() functions of zlib would make the process most painless. Please
+tell me if you have such a library so I can include it :)
+
+> Q: What's this "Starting lzma back end compression thread..." message?
+
+> A: While I'm a big fan of progress percentage being visible, unfortunately
+lzma compression can't currently be tracked when handing over 100+MB chunks
+over to the lzma library. Therefore you'll see progress percentage until
+each chunk is handed over to the lzma library.
+
+> Q: What's this "lzo testing for incompressible data" message?
+
+> A: Other compression is much slower, and lzo is the fastest. To help speed up
+the process, lzo compression is performed on the data first to test that the
+data is at all compressible. If a small block of data is not compressible, it
+tests progressively larger blocks until it has tested all the data (if it fails
+to compress at all). If no compressible data is found, then the subsequent
+compression is not even attempted. This can save a lot of time during the
+compression phase when there is incompressible dat
+> A: Theoretically it may be
+possible that data is compressible by the other backend (zpaq, lzma etc) and not
+at all by lzo, but in practice such data achieves only minuscule amounts of
+compression which are not worth pursuing. Most of the time it is clear one way
+or the other that data is compressible or not. If you wish to disable this
+test and force it to try compressing it anyway, use -T.
+
+> Q: I have truckloads of ram so I can compress files much better, but can my
+generated file be decompressed on machines with less ram?
+
+> A: Yes. Ram requirements for decompression go up only by the -L compression
+option with lzma and are never anywhere near as large as the compression
+requirements. However if you're on 64bit and you use a compression window
+greater than 2GB, it might not be possible to decompress it on 32bit machines.
+
+> Q: Why are you including bzip2 compression?
+
+> A: To maintain a similar compression format to the original rzip (although the
+other modes are more useful).
+
+> Q: What about multimedia?
+
+> A: Most multimedia is already in a heavily compressed "lossy" format which by
+its very nature has very little redundancy. This means that there is not
+much that can actually be compressed. If your video/audio/picture is in a
+high bitrate, there will be more redundancy than a low bitrate one making it
+more suitable to compression. None of the compression techniques in lrzip are
+optimised for this sort of dat
+> A: However, the nature of rzip preparation
+means that you'll still get better compression than most normal compression
+algorithms give you if you have very large files. ISO images of dvds for
+example are best compressed directly instead of individual .VOB files. ZPAQ is
+the only compression format that can do any significant compression of
+multimedi
+> A:
+
+> Q: Is this multithreaded?
+
+> A: As of version 0.540, it is HEAVILY multithreaded with the back end
+compression and decompression phase, and will continue to process the rzip
+pre-processing phase so when using one of the more CPU intensive backend
+compressions like lzma or zpaq, SMP machines will show massive speed
+improvements. Lrzip will detect the number of CPUs to use, but it can be
+overridden with the -p option if the slightly better compression is desired
+more than speed. -p 1 will give the best compression but also be the slowest.
+
+> Q: This uses heaps of memory, can I make it use less?
+
+> A: Well you can by setting -w to the lowest value (1) but the huge use of
+memory is what makes the compression better than ordinary compression
+programs so it defeats the point. You'll still derive benefit with -w 1 but
+not as much.
+
+> Q: What CFLAGS should I use?
+
+> A: With a recent enough compiler (gcc>4) setting both CFLAGS and CXXFLAGS to
+	-O2 -march=native -fomit-frame-pointer
+
+> Q: What compiler does this work with?
+
+> A: It has been tested on gcc, ekopath and the intel compiler successfully
+previously. Whether the commercial compilers help or not, I could not tell you.
+
+> Q: What codebase are you basing this on?
+
+> A: rzip v2.1 and lzma sdk920, but it should be possible to stay in sync with
+each of these in the future.
+
+> Q: Do we really need yet another compression format?
+
+> A: It's not really a new one at all; simply a reimplementation of a few very
+good performing ones that will scale with memory and file size.
+
+> Q: How do you use lrzip yourself?
+
+> A: Three basic uses. I compress large files currently on my drive with the
+-l option since it is so quick to get a space saving. When archiving data for
+permanent storage I compress it with the default options. When compressing
+small files for distribution I use the -z option for the smallest possible
+size.
+
+> Q: I found a file that compressed better with plain lzm
+> A: How can that be?
+
+> A: When the file is more than 5 times the size of the compression window
+you have available, the efficiency of rzip preparation drops off as a means
+of getting better compression. Eventually when the file is large enough,
+plain lzma compression will get better ratios. The lrzip compression will be
+a lot faster though. The only way around this is to use as large compression
+windows as possible with -U option.
+
+> Q: Can I use swapspace as ram for lrzip with a massive window?
+
+> A: It will indirectly do this with -U (unlimited) mode enabled. This mode will
+make the compression window as big as the file itself no matter how big it is,
+but it will slow down proportionately more the bigger the file is than your ram.
+
+> Q: Why do you nice it to +19 by default? Can I speed up the compression by
+changing the nice value?
+
+> A: This is a common misconception about what nice values do. They only tell the
+cpu process scheduler how to prioritise workloads, and if your application is
+the _only_ thing running it will be no faster at nice -20 nor will it be any
+slower at +19.
+
+> Q: What is the LZO Testing option, -T?
+
+> A: LZO testing is normally performed for the slower back-end compression of LZMA
+and ZPA> Q: The reasoning is that if it is completely incompressible by LZO then
+it will also be incompressible by them. Thus if a block fails to be compressed
+by the very fast LZO, lrzip will not attempt to compress that block with the
+slower compressor, thereby saving time. If this option is enabled, it will
+bypass the LZO testing and attempt to compress each block regardless.
+
+> Q: Compression and decompression progress on large archives slows down and
+speeds up. There's also a jump in the percentage at the end?
+
+> A: Yes, that's the nature of the compression/decompression mechanism. The jump
+is because the rzip preparation makes the amount of data much smaller than the
+compression backend (lzma) needs to compress.
+
+> Q: Tell me about patented compression algorithms, GPL, lawyers and copyright.
+
+> A: No
+
+> Q: I receive an error "LZMA ERROR: 2. Try a smaller compression window."
+   what does this mean?
+
+> A: LZMA requests large amounts of memory. When a higher compression window is
+   used, there may not be enough contiguous memory for LZM
+> A: LZMA may request
+   up to 25% of TOTAL ram depending on compression level. If contiguous blocks
+   of memory are not free, LZMA will return an error. This is not a fatal
+   error, and a backup mode of compression will be used.
+
+> Q: Where can I get more information about the internals of LZMA?
+
+> A: See http://www.7-zip.org and http://www.p7zip.org. Also, see the file
+   ./lzma/C/lzmalib.h which explains the LZMA properties used and the LZMA
+   memory requirements and computation.
+
+> Q: This version is much slower than the old version?
+
+> A: Make sure you have set CFLAGS and CXXFLAGS. An unoptimized build will be
+almost 3 times slower.
+
+#### LIMITATIONS
+Due to mmap limitations the maximum size a window can be set to is currently
+2GB on 32bit unless the -U option is specified. Files generated on 64 bit
+machines with windows >2GB in size might not be decompressible on 32bit
+machines. Large files might not decompress on machines with less RAM if SWAP is
+disabled.
+
+#### BUGS:
+Probably lots. <https://github.com/ckolivas/lrzip/issues> if you spot any :D
+
+ Any known ones should be documented
+in the file BUGS.
+
+
+
+#### Backends:
+
+rzip:
+<http://rzip.samba.org/>
+
+lzo:
+<http://www.oberhumer.com/opensource/lzo/>
+
+lzma:
+<http://www.7-zip.org/>
+
+zpaq:
+<http://mattmahoney.net/dc/>
+
+### Thanks (CONTRIBUTORS)
+
+|Person(s)|Thanks for|
+|---|---|
+|`Andrew Tridgell`|`rzip`|
+|`Markus Oberhumer`|`lzo`|
+|`Igor Pavlov`|`lzma`|
+|`Jean-Loup Gailly & Mark Adler`|`zlib`|
+|***`Con Kolivas`***|***Original Code, binding all of this together, managing the project, original `README`***|
+|`Christian Leber`|`lzma` compatibility layer|
+|`Michael J Cohen`|Darwin/OSX support|
+|`Lasse Collin`|fixes to `LZMALib.cpp` and `Makefile.in`|
+|Everyone else who coded along the way (add yourself where appropriate if that's you)|Miscellaneous Coding|
+|**`Peter Hyman`**|Most of the `0.19` to `0.24` changes|
+|`^^^^^^^^^^^`|Updating the multithreaded `lzma` lib
+|`^^^^^^^^^^^`|All sorts of other features
+|`René Rhéaume`|Fixing executable stacks|
+|`Ed Avis`|Various fixes|
+|`Matt Mahoney`|`zpaq` integration code|
+|`Jukka Laurila`|Additional Darwin/OSX support|
+|`George Makrydakis`|`lrztar` wrapper|
+|`Ulrich Drepper`|*special* implementation of md5|
+|**`Michael Blumenkrantz`**|New config tools|
+|`^^^^^^^^^^^^^^^^^^^^`|`liblrzip`|
+|Authors of `PolarSSL`|Encryption code|
+|`Serge Belyshev`|Extensive help, advice, and patches to implement secure encryption|
+|`Jari Aalto`|Fixing typos, esp. in code|
+|`Carlo Alberto Ferraris`|Code cleanup
+|`Peter Hyman`|Additional documentation|
+|`Haneef Mubarak`|Cleanup, Rewrite, and GH Markdown of `README` --> `README.md`|
+
+Persons above are listed in chronological order of first contribution to **lrzip**. Person(s) with names in **bold** have multiple major contributions, person(s) with names in *italics* have made massive contributions, person(s) with names in ***both*** have made innumerable massive contributions.
+
+#### README Authors
+
+Con Kolivas (`ckolivas` on GitHub) <kernel@kolivas.org>
+Sat, 11 March 2011: README
+
+Also documented by
+Peter Hyman <pete@peterhyman.com>
+Sun, 04 Jan 2009: README
+
+Mostly Rewritten + GFMified:
+Haneef Mubarak (haneefmubarak on GitHub)
+Sun/Mon Sep 01-02 2013: README.md