lrzip

mirror of https://github.com/ckolivas/lrzip.git synced 2025-12-06 07:12:00 +01:00

Author	SHA1	Message	Date
Con Kolivas	a7b4708bd2	Use a different failure mode for when perror is unlikely to be set. Add 2 unlikely wrappers.	2011-02-21 14:51:20 +11:00
Con Kolivas	74df2b5973	Minor updates to man pages, lrzip.conf example file. Update main help screen to include environment settings. Update to respect $TMP environment variable for TMP files. Updated control structure to include tmpdir pointer. Update lrzip.conf parser to respect -U -M options. Update lrzip.conf example to include new parameters. Reorder main Switch loop in main.c for readability. Have MAXRAM and control.window be exclusive. MAXRAM wins. Have UNLIMITED and control.window be exclusive. UNLIMITED wins. Have UNLIMITED and MAXRAM be exclusive. UNLIMITED wins. Corrects heuristic computation in rzip.c which would override MAXRAM or UNLIMITED if control.window set Show heuristically computed control.window when computed. Remove display compression level from control.window verbose output. Update print_verbose format for Testing for incompressible data in stream.c to omit extra \n. Changes by Peter Hyman <pete@peterhyman.com>	2011-02-21 12:03:08 +11:00
Con Kolivas	0b81d38259	Update version to 0.560	2011-02-21 00:14:03 +11:00
Con Kolivas	57e25da244	Update copyright yeah in updated files.	2011-02-20 23:04:44 +11:00
Con Kolivas	c9863e0e60	Change default behaviour to deleting broken or damaged files that occur by interrupting lrzip or that fail integrity testing. Implement the -k option to optionally keep broken or damaged files.	2011-02-20 23:02:15 +11:00
Con Kolivas	9b264959f5	Implement the ability to test the integrity of the file written to disk on decompression.	2011-02-20 22:29:49 +11:00
Con Kolivas	8a27dc5057	Changes to make md5 be used for integrity testing. Add the md5 value to the end of each archive. This can then be used for integrity testing instead of crc32. Keep crc in new archives to maintain compatibility with version 0.5 files. Use md5 integrity testing on decompression when available in preference, and disable calculation of crc32. Display the choice of integrity testing in verbose output and when -i is used. Display the md5 and crc values when max verbosity, file info, or display hash is enabled. Store a new flag in the magic header to show that the md5 value is stored at the end of the file. Update the magic header information document.	2011-02-20 18:01:19 +11:00
Con Kolivas	fb2a12744a	Implement md5 checking on decompression. Implement hash check flag to determine whether to show md5sum on compression/decompression or not.	2011-02-18 15:16:13 +11:00
Con Kolivas	c2417877bc	Being implementing md5 hash checking by importing the md5 code from coreutils. Implement md5 hash checking on compression by doing the md5 hash check as each sb low buffer has been allocated to avoid going over the file again where possible.	2011-02-18 09:09:40 +11:00
Con Kolivas	626e0be281	Convert semaphore primitives to pthread_mutexes making them more portable, thus allowing multithreading to work on OSX.	2011-02-17 00:24:28 +11:00
Con Kolivas	05c5326df3	Revert "OSX doesn't support unnamed semaphores so to make it work, fake the threading by just creating the threads and waiting for them to finish." This reverts commit `b81542cea4`. Revert the change bypassing semaphores in OSX in preparation for changing the semaphores to mutexes.	2011-02-16 17:40:50 +11:00
Con Kolivas	ea9b00c839	Update version number to 0.552 and update documentation.	2011-02-11 13:22:29 +11:00
Con Kolivas	b81542cea4	OSX doesn't support unnamed semaphores so to make it work, fake the threading by just creating the threads and waiting for them to finish. This is done by making the semaphore wrappers null functions on osx and closing the thread in the creation wrapper. Move the wrappers to rzip.h to make this change clean.	2011-02-11 12:22:09 +11:00
Con Kolivas	2a0553bc54	Revert "Decompress more than one stream at a time if there are threads free and the end of one stream is reached." This reverts commit `8ee9ef64f5`. This change is unreliable. Hence revert it and all dependent patches.	2011-02-09 12:39:15 +11:00
Con Kolivas	8239635038	Revert "Limit the maximum number of threads on stream 0 to 1 again as stream 1 data always appear after a chunk of stream 0 data." This reverts commit `0b0f6db606`.	2011-02-09 12:39:02 +11:00
Con Kolivas	0b0f6db606	Limit the maximum number of threads on stream 0 to 1 again as stream 1 data always appear after a chunk of stream 0 data.	2011-02-09 10:45:03 +11:00
Con Kolivas	8ee9ef64f5	Decompress more than one stream at a time if there are threads free and the end of one stream is reached. Still limit total threads running to control.threads. This affords a small speedup on decompression.	2011-02-08 11:58:01 +11:00
Con Kolivas	9c2b86fec6	We are flushing the wrong file on decompression. Make sure we flush the file out.	2011-02-08 08:27:22 +11:00
Con Kolivas	191a55d03c	Apply correct fix for freeBSD not implementing mremap thanks to Brian Carlson.	2011-01-02 14:23:29 +11:00
Con Kolivas	2cabb335cb	Update copyright notices courtesy of Jari Aalto.	2010-12-16 09:45:21 +11:00
Con Kolivas	e83612fc0a	Freebsd doesn't do mremap. Use the fake one.	2010-12-16 09:42:34 +11:00
Con Kolivas	654ae02ba6	Quick bump to version 0.551. Update docs.	2010-12-12 17:48:07 +11:00
Con Kolivas	981859fdba	Version to 0.550.	2010-12-12 10:51:29 +11:00
Con Kolivas	a6ab7c875b	Limit the number of threads decompressing stream 0 to just 1 since it's always followed by stream 1 chunks, and it may lead to failure to decompress due to running out of memory by running too many threads.	2010-12-11 00:04:30 +11:00
Con Kolivas	50437a8447	Move the threading on compression to higher up in the code, allowing the next stream to start using compression threads before the previous stream has finished. This overlapping of compressing streams means that when files are large enough to be split into multiple blocks, all CPUs will be used more effectively throughout the compression, affording a nice speedup. Move the writing of the chunk byte size and initial headers into the compthread to prevent any races occurring. Fix a few dodgy callocs that may have been overflowing! The previous commit reverts were done because the changes designed to speed it up actually slowed it down instead.	2010-12-10 23:51:59 +11:00
Con Kolivas	c3dfcfcec1	Update version number to 0.544. Change suggested maximum compression in README to disable threading with -p 1. Use bzip2 as a fallback compression when lzma fails due to internal memory errors as may happen on 32 bits.	2010-12-04 21:36:51 +11:00
Con Kolivas	22c87ed3ee	Fix missing define.	2010-12-03 19:38:21 +11:00
Con Kolivas	2da407a178	Change decompression threading to have a group of threads for each stream (2 in total), thus making mulithreaded decompression more robust.	2010-12-03 19:35:48 +11:00
Con Kolivas	d39d3959e8	uclibc doesn't return valid values via sysconf for ram, so work around it by reading /proc if possible.	2010-11-30 11:24:29 +11:00
Con Kolivas	22da2ee76d	Push version number to 0.543. Update docs.	2010-11-24 21:08:35 +11:00
Con Kolivas	6f2b94be3b	Fix the case where a compressed file has more than one stream 0 entry per block. Limit lzma windows to 300MB in the right place on 32 bit only. Make the main process less nice than the backend threads since it tends to be the rate limiting step.	2010-11-24 20:12:19 +11:00
Con Kolivas	75e675e6dd	Bump version number to 0.542. Choose sane defaults for memory usage since linux ludicriously overcommits. Use sliding mmap for any compression windows greater than 2/3 ram. Consolidate and simplify testing of allocatable ram. Minor tweaks to output. Round up the size of the high buffer in sliding mmap to one page. Squeeze a little more out of 32 bit compression windows.	2010-11-20 01:23:08 +11:00
Con Kolivas	591d791791	Bump version to 0.541. Limit LZMA window to 300MB on 32 bit as per reports of failure when larger. Minor documentation and display clean ups.	2010-11-18 23:33:43 +11:00
Con Kolivas	0a4f6807e5	Increase version number to 0.540. Fix compression type reporting on -i. Remove some unnecessary output when -i is used. Update docs and benchmarks.	2010-11-16 23:14:35 +11:00
Con Kolivas	2b08c6e280	Implement massive multithreading decompression. This is done by taking each stream of data on read in into separate buffers for up to as many threads as CPUs. As each thread's data becomes available, feed it into runzip once it is requests more of the stream. Provided there are enough chunks in the originally compressed data, this provides a massive speedup potentially proportional to the number of CPUs. The slower the backend compression, the better the speed up (i.e. zpaq is the best sped up). Fix the output of zpaq compress and decompress from trampling on itself and racing and consuming a lot of CPU time printing to the console. When limiting cwindow to 6 on 32 bits, ensure that control.window is also set. When testing for the maximum size of testmalloc, the multiple used was out by one, so increase it. Minor output tweaks.	2010-11-16 21:25:32 +11:00
Con Kolivas	1637598c3f	Bump version number up to 0.530. Update all documentation. Minor fixes by Jari Aalto for build and docs.	2010-11-13 21:37:17 +11:00
Con Kolivas	6e4fdc97f8	Implement -p option to specify number of processors to determine thread count. Remove -P option as failing to set permissions only issues a warning now, removing any requirement for -P. Change default compression level back to 7 as 9 was not giving significantly better compression but was slowing things down.	2010-11-13 17:36:21 +11:00
Con Kolivas	e9957e1115	Fix zpaq compression now updating the console too much because it's now so much faster it uses up a lot of CPU time just ouputting to the screen. Do this by updating only every 10%, and print separate updates for each thread.	2010-11-13 08:33:30 +11:00
Con Kolivas	02de002c58	Reworked the multithreading massively. Place the data from each stream into a buffer that then is handed over to one thread which is allowed to begin doing the backend compression while the main rzip stream continues operating. Fork up to as many threads as CPUs and feed data to them in a ring fashion, parallelising the workload as much as possible. This causes a big speed up on the compression side on SMP machines. Thread compression is limited to a minimum of 10MB compressed per thread to minimise the compromise to compression of smaller windows. Alter the progress output to match some of the changes in verbose modes.	2010-11-13 01:26:09 +11:00
Con Kolivas	5505097b2f	Implement multithreaded back end compression by splitting up the compression stream into multiple threads, dependant on the number of CPUs detected. This facilitates a massive speed up on SMP machines proportional to the number of CPUs during the back end compression phase, but does so at some cost to the final size. Limit the number of threads to ensure that each thread at least works on a window of STREAM_BUFSIZE. Disable the lzma threading library as it does not contribute any more to the scalability of this new approach, yet compromises compression. Increase the size of the windows passed to all the compression back end types now as they need more to split them up into multiple threads, and the number of blocks increases the compressed size slightly.	2010-11-10 20:56:17 +11:00
Con Kolivas	b469e7b56c	Distros don't like 3 point versioning scheme so change back to 2 point and change version to 0.520.	2010-11-08 10:43:05 +11:00
Con Kolivas	91228195a7	Bump version up to 0.5.2 for real.	2010-11-07 15:34:25 +11:00
Con Kolivas	ead0e54182	Drop the upper limit on lzma compression window on 64 bit. It is not necessary. zpaq will fail with windows bigger than 600MB on 32 bit due to failing testing the 3* malloc test, so limit it to 600MB as well as lzma on 32 bit.	2010-11-07 01:57:23 +11:00
Con Kolivas	b8528abee9	Consciously check page size, even though no one's going to build this on a machine with a different page size. Clean up builds, removing ifdefs from main code. Make output choices consistent.	2010-11-06 18:17:33 +11:00
Con Kolivas	296534921a	Unlimited mode is now usable in a meaningful timeframe! Modify the sliding mmap window to have a 64k smaller buffer which matches the size of the search size, and change the larger lower buffer to make it slide with the main hash search progress. This makes for a MUCH faster unlimited mode, making it actually usable. Limit windows to 2GB again on 32 bit, but do it when determining the largest size possible in rzip.c. Implement a linux-kernel like unlikely() wrapper for inbuilt expect, and modify most fatal warnings to be unlikely, and a few places where it's also suitable. Minor cleanups.	2010-11-05 12:16:43 +11:00
Con Kolivas	29b166629a	Huge rewrite of buffer reading in rzip.c. We use a wrapper instead of accessing the buffer directly, thus allowing us to have window sizes larger than available ram. This is implemented through the use of a "sliding mmap" implementation. Sliding mmap uses two mmapped buffers, one large one as previously, and one page sized smaller one. When an attempt is made to read beyond the end of the large buffer, the small buffer is remapped to the file area that's being accessed. While this implementation is 100x slower than direct mmapping, it allows us to implement unlimited sized compression windows. Implement the -U option with unlimited sized windows. Rework the selection of compression windows. Instead of trying to guess how much ram the machine might be able to access, we try to safely buffer as much ram as we can, and then use that to determine the file buffer size. Do not choose an arbitrary upper window limit unless -w is specified. Rework the -M option to try to buffer the entire file, reducing the buffer size until we succeed. Align buffer sizes to page size. Clean up lots of unneeded variables. Fix lots of minor logic issues to do with window sizes accepted/passed to rzip and the compression backends. More error handling. Change -L to affect rzip compression level directly as well as backend compression level and use 9 by default now. More cleanups of information output. Use 3 point release numbering in case one minor version has many subversions. Numerous minor cleanups and tidying. Updated docs and manpages.	2010-11-04 21:14:55 +11:00
Con Kolivas	c106128d1a	Fix darwin build and clean up ifdef and incorrect ordered includes at the same time. All builds were using fakememopen due to incorrect ifdef usage, so make GNU builds actually use memopen again.	2010-11-03 13:14:46 +11:00
Con Kolivas	ba1bf30d78	Minor tidying.	2010-11-01 19:59:27 +11:00
Con Kolivas	232b268367	Fix file size recording in the magic header when stdin is used.	2010-11-01 19:18:20 +11:00
Con Kolivas	3a22eb09b3	Fix output to work correctly to screen when stdout is selected. Make stdout write directly to stdout on decompression without the need for temporary files since there is no need to seek backwards. Make file testing not actually write the file during test. More tidying up.	2010-11-01 11:18:58 +11:00

1 2

57 commits