lrzip

mirror of https://github.com/ckolivas/lrzip.git synced 2025-12-06 07:12:00 +01:00

Author	SHA1	Message	Date
Con Kolivas	5505097b2f	Implement multithreaded back end compression by splitting up the compression stream into multiple threads, dependant on the number of CPUs detected. This facilitates a massive speed up on SMP machines proportional to the number of CPUs during the back end compression phase, but does so at some cost to the final size. Limit the number of threads to ensure that each thread at least works on a window of STREAM_BUFSIZE. Disable the lzma threading library as it does not contribute any more to the scalability of this new approach, yet compromises compression. Increase the size of the windows passed to all the compression back end types now as they need more to split them up into multiple threads, and the number of blocks increases the compressed size slightly.	2010-11-10 20:56:17 +11:00
Con Kolivas	ead0e54182	Drop the upper limit on lzma compression window on 64 bit. It is not necessary. zpaq will fail with windows bigger than 600MB on 32 bit due to failing testing the 3* malloc test, so limit it to 600MB as well as lzma on 32 bit.	2010-11-07 01:57:23 +11:00
Con Kolivas	b8528abee9	Consciously check page size, even though no one's going to build this on a machine with a different page size. Clean up builds, removing ifdefs from main code. Make output choices consistent.	2010-11-06 18:17:33 +11:00
Con Kolivas	017ec9e85a	Move the ram allocation phase into rzip_fd to be able to get a more accurate measure of percentage done. Prevent failure when offset is not a multiple of page size. Add chunk percentage complete to output. Tweak output at various verbosities. Update documentation to reflect improved performance of unlimited mode. Update benchmark results. More tidying.	2010-11-05 23:02:58 +11:00
Con Kolivas	a66dafe66a	Updated benchmark results. More tidying up.	2010-11-05 14:52:14 +11:00
Con Kolivas	296534921a	Unlimited mode is now usable in a meaningful timeframe! Modify the sliding mmap window to have a 64k smaller buffer which matches the size of the search size, and change the larger lower buffer to make it slide with the main hash search progress. This makes for a MUCH faster unlimited mode, making it actually usable. Limit windows to 2GB again on 32 bit, but do it when determining the largest size possible in rzip.c. Implement a linux-kernel like unlikely() wrapper for inbuilt expect, and modify most fatal warnings to be unlikely, and a few places where it's also suitable. Minor cleanups.	2010-11-05 12:16:43 +11:00
Con Kolivas	29b166629a	Huge rewrite of buffer reading in rzip.c. We use a wrapper instead of accessing the buffer directly, thus allowing us to have window sizes larger than available ram. This is implemented through the use of a "sliding mmap" implementation. Sliding mmap uses two mmapped buffers, one large one as previously, and one page sized smaller one. When an attempt is made to read beyond the end of the large buffer, the small buffer is remapped to the file area that's being accessed. While this implementation is 100x slower than direct mmapping, it allows us to implement unlimited sized compression windows. Implement the -U option with unlimited sized windows. Rework the selection of compression windows. Instead of trying to guess how much ram the machine might be able to access, we try to safely buffer as much ram as we can, and then use that to determine the file buffer size. Do not choose an arbitrary upper window limit unless -w is specified. Rework the -M option to try to buffer the entire file, reducing the buffer size until we succeed. Align buffer sizes to page size. Clean up lots of unneeded variables. Fix lots of minor logic issues to do with window sizes accepted/passed to rzip and the compression backends. More error handling. Change -L to affect rzip compression level directly as well as backend compression level and use 9 by default now. More cleanups of information output. Use 3 point release numbering in case one minor version has many subversions. Numerous minor cleanups and tidying. Updated docs and manpages.	2010-11-04 21:14:55 +11:00
Con Kolivas	c106128d1a	Fix darwin build and clean up ifdef and incorrect ordered includes at the same time. All builds were using fakememopen due to incorrect ifdef usage, so make GNU builds actually use memopen again.	2010-11-03 13:14:46 +11:00
Con Kolivas	9c00229ee1	Fix fd_hist check occurring too early on decompress. fsync after flushing buffer. Remove unnecessary memset after anonymous mmap. Do test malloc before compression backend to see how big a chunk can be passed.	2010-11-02 13:50:15 +11:00
Con Kolivas	1e88273ffc	Minor fixes.	2010-11-02 00:08:35 +11:00
Con Kolivas	772fbf602e	Reinstitute 2GB window limit on 32 bit. It still doesn't work. However we can now decompress larger windows. Do more mmap in place of malloc. Update docs. Remove redundant code.	2010-11-01 22:55:59 +11:00
Con Kolivas	0e09c9ec24	Reinstate the temporary file for decompression from stdin. Seeking makes that option broken without the temporary file. Start documenting the new features in this version. Minor tidying.	2010-11-01 21:37:55 +11:00
Con Kolivas	49336a5e87	Update magic header info. Move offset check to after reading chunk width. Realloc instead of free and malloc.	2010-11-01 15:27:35 +11:00
Con Kolivas	cb27097cb4	A few more cleanups to avoid using temporary files for stdout and testing on decompression.	2010-11-01 12:50:20 +11:00
Con Kolivas	3a22eb09b3	Fix output to work correctly to screen when stdout is selected. Make stdout write directly to stdout on decompression without the need for temporary files since there is no need to seek backwards. Make file testing not actually write the file during test. More tidying up.	2010-11-01 11:18:58 +11:00
Con Kolivas	a9ad1aef0e	Start cleaning up all the flag testing with neat macros.	2010-11-01 04:53:53 +11:00
Con Kolivas	8d9b64e1ec	Change byte width to be dependant on file size. This will increase speed of compression and generate a smaller file, but not be backward compatible. Tweak the way memory is allocated to optimise chances of success and minimise slowdown for the machine. fsync to empty dirty data before allocating large ram to increase chance of mem allocation and decrease disk thrash of write vs read. Add lots more information to verbose mode. Lots of code tidying and minor tweaks.	2010-10-31 15:09:05 +11:00
Con Kolivas	c5da3a1adb	Add more robust checking. Premalloc ram to improve early detection of being unable to allocate that much ram. Make sure to always make chunk size a multiple of page size for mmap to work. Begin changes to make variable byte width offsets in rzip chunks. Decrease header entries to only 2 byte wide as per original rzip. Random other tidying.	2010-10-31 10:35:04 +11:00
Con Kolivas	d972496aa8	Make messages come via stdout instead of stderror courtesy of Alexander Saprykin	2010-04-25 16:26:00 +10:00
Con Kolivas	6dcceb0b1b	Initial import	2010-03-29 10:07:08 +11:00

20 commits