d6/xz-analysis-mirror

Commit Graph

Author	SHA1	Message	Date
Jia Tan	340505c033	xz: Hide the number of input files with recursive mode. In recursive mode we don't know how many files to process at the beginning. So just like when using --files or --files0, the number of total files will not be shown.	2024-01-29 21:40:53 +08:00
Jia Tan	9ffdb5f006	xz: Parse directories in recursive mode. This directory parsing method prioritizes lower memory usage and file descriptor utilization at the cost of more complicated code and a higher number of small allocations. This method makes no recursive calls and instead keeps a queue of directories to parse.Only one directory file descriptor is ever needed at one time. The directory_iterator abstracts the implementation of the directory parsing to allow for an easy interface for both POSIX and MSVC. Currently the MSVC builds suffers from MAX_PATH being limited to 260 by default. This restricts the usefulness of recursive mode on Windows. A user can edit a registry config in Windows 10, Version 1607 and later to remove this low path limit. Alternatively, we can prefix the absolute path with "\\?\" to also remove the restriction. Note, this restriction also applies to the compatibility functions so MSVC builds cannot read or write to files with paths longer than 260 characters.	2024-01-29 21:40:53 +08:00
Jia Tan	f8c199bcdc	xz: Restrict when recursive mode can be used. If we are not compiling with dirent.h or MSVC, then we cannot use recursive mode. Unfortunatly, there is not a great portable way to parse directory contents. There are _find_next() functions available for DOS like platforms, but Windows version of these functions is different. Since we do not have a good way to test these functions, support will not be added at this time.	2024-01-29 21:40:53 +08:00
Jia Tan	fe1af552d3	xz: Allow directories in io_open_src() in recursive mode. If the directory is a symlink, it is skipped to prevent a loop in the directory structure that would cause infinite recursion.	2024-01-29 21:40:53 +08:00
Jia Tan	e08d65acaf	Build: Check for dirent.h. For both CMake and Autotools, define HAVE_DIRENT_H if the header file is found.	2024-01-29 21:40:53 +08:00
Jia Tan	8d07e9bb7c	xz: Enable -r, --recursive option.	2024-01-29 21:40:53 +08:00
Jia Tan	b10b2e4a8f	xz: Change the way coder_run() and list_run() are called in main(). Previously, a function pointer was used to determine if coder_run() or list_run() should be called in the main entry processing loop. This was replaced by an extra function call to process_entry(). coder_run() and list_run() were changed to accept a file_pair * argument instead of a filename. The common repeated code was moved to process_entry() instead.	2024-01-29 21:40:53 +08:00
Jia Tan	a3bac71fe3	xz: Reorder #include order in private.h.	2024-01-29 21:40:53 +08:00
Jia Tan	882aad963e	xz: Move some list_file() checks to args_parse(). The checks enforce that list mode will only run on .xz files. The opt_format is only set during argument parsing and will not change after. So we only need to check this once instead of every call to list_file(). Additionally, this will cause the error to be detected slightly earlier.	2024-01-29 21:40:53 +08:00
Jia Tan	d6d1e40f19	xz: Add a function to print Windows specific error messages. Native Windows C API functions do not use errno, but instead have to call GetLastError(). There is not an easy way to convert this error code into a helpful message, so this creates a wrapper around the slightly complicated FormatMessage() function. The new message_windows_error() function calls message_error() under the hood, so it will set the exit status to 1.	2024-01-29 21:40:53 +08:00
Jia Tan	b43c3e48bf	Bump version number for 5.5.1alpha.	2024-01-26 19:05:51 +08:00
Jia Tan	c7a7ae1500	Add NEWS for 5.5.1alpha	2024-01-26 19:00:52 +08:00
Jia Tan	0ef8192e8d	Add NEWS for 5.4.6.	2024-01-26 18:54:24 +08:00
Lasse Collin	93de7e751d	Move doc/logo/xz-logo.png to "doc" and Doxygen footer to "doxygen". The footer isn't a complete HTML file so having it in the doxygen directory is a tiny bit clearer.	2024-01-24 20:00:57 +02:00
Jia Tan	00fa01698d	README: Add COPYING.CC-BY-SA-4.0 entry to section 1.1. The Overall documentation section (1.1) table spacing had to be adjusted since the filename was very long.	2024-01-25 01:39:35 +08:00
Jia Tan	e280470040	Build: Add the logo and license to the release.	2024-01-25 01:39:35 +08:00
Jia Tan	b1ee6cf259	COPYING: Add the license for the XZ logo.	2024-01-25 01:39:29 +08:00
Jia Tan	31293ae707	Doxygen: Added the XZ logo and copyright information. The PROJECT_LOGO field is now used to include the XZ logo. The footer of each page now lists the copyright information instead of the default footer. The license is also copied to statisfy the copyright and so the link in the documentation can be local.	2024-01-25 01:06:01 +08:00
Lasse Collin	6daa4d0ea4	xz: Use threaded mode by defaut (as if --threads=0 was used). This hopefully does more good than bad: + It's faster by default. + Only the threaded compressor creates files that can be decompressed in threaded mode. - Compression ratio is worse, usually not too much though. When it matters, -T1 must be used. - Memory usage increases. - Scripts that assume single-threaded mode but don't use -T1 will possibly use too much resources, for example, if they run multiple xz processes in parallel to compress multiple files. - Output from single-threaded and multi-threaded compressors differ but such changes could happen for other reasons too (they just haven't happened since 5.0.0).	2024-01-23 18:29:28 +02:00
Jia Tan	a2dd2dc8e5	CI: Use RISC-V filter when building with BCJ support.	2024-01-23 23:55:44 +08:00
Jia Tan	3060e1070b	Tests: Use smaller dictionary size in RISC-V test files.	2024-01-23 23:55:44 +08:00
Jia Tan	44ff2fa5c9	Tests: Skip RISC-V test files if decoder was not built.	2024-01-23 23:55:39 +08:00
Lasse Collin	6133a3f300	xz: Man page: Add more examples of LZMA2 options with BCJ filters.	2024-01-23 23:05:47 +08:00
Lasse Collin	50255feeaa	liblzma: RISC-V filter: Use byte-by-byte access. Not all RISC-V processors support fast unaligned access so it's better to read only one byte in the main loop. This can be faster even on x86-64 when compared to reading 32 bits at a time as half the time the address is only 16-bit aligned. The downside is larger code size on archs that do support fast unaligned access.	2024-01-23 23:05:47 +08:00
Jia Tan	db5eb5f563	xz: Update xz -lvv for RISC-V filter. Version 5.6.0 will be shown, even though upcoming alphas and betas will be able to support this filter. 5.6.0 looks nicer in the output and people shouldn't be encouraged to use an unstable version in production in any way.	2024-01-23 23:05:47 +08:00
Jia Tan	e2870db5be	Tests: Add two RISC-V Filter test files. These test files achieve 100% code coverage in src/liblzma/simple/riscv.c. They contain all of the instructions that should be filtered and a few cases that should not.	2024-01-23 23:05:47 +08:00
Jia Tan	b26a898693	xz: Update message in --long-help for RISC-V Filter.	2024-01-23 23:05:47 +08:00
Jia Tan	283f778908	xz: Update the man page for the RISC-V Filter. A special note was added to suggest using four-byte alignment when the compressed instruction extension is not present in a RISC-V binary.	2024-01-23 23:05:47 +08:00
Jia Tan	ac3691ccca	Tests: Add RISC-V Filter test in test_compress.sh.	2024-01-23 23:05:47 +08:00
Jia Tan	2959dbc735	liblzma: Update string_conversion.c to support RISC-V Filter.	2024-01-23 23:05:47 +08:00
Jia Tan	34372a5adb	CMake: Support RISC-V BCJ Filter for encoding and decoding.	2024-01-23 23:05:47 +08:00
Jia Tan	440a2eccb0	liblzma: Add RISC-V BCJ filter. The new Filter ID is 0x0B. Thanks to Chien Wong <m@xv97.com> for the initial version of the Filter, the xz CLI updates, and the Autotools build system modifications. Thanks to Igor Pavlov for his many contributions to the design of the filter.	2024-01-23 23:05:41 +08:00
Jia Tan	5540f4329b	Docs: Update .xz file format specification to 1.2.0. The new RISC-V filter was added to the specification, in addition to updating the specification URL.	2024-01-19 23:08:14 +08:00
Jia Tan	22d86192f8	xz: Update website URLs in the man pages.	2024-01-19 23:08:14 +08:00
Jia Tan	6b63c4c613	liblzma: Update website URL.	2024-01-19 23:08:14 +08:00
Jia Tan	fce4758018	Docs: Update website URLs.	2024-01-19 23:08:14 +08:00
Jia Tan	c26812c5b2	Build: Update website URL.	2024-01-19 23:08:14 +08:00
Lasse Collin	fbb3ce541e	liblzma: CRC: Add a comment to crc_x86_clmul.h about BUILDING_ macros.	2024-01-11 15:25:00 +02:00
Lasse Collin	4f518c1b6b	liblzma: CRC: Remove crc_always_inline, use lzma_always_inline instead. Now crc_simd_body() in crc_x86_clmul.h is only called once in a translation unit, we no longer need to be so cautious about ensuring the always-inline behavior.	2024-01-11 15:24:35 +02:00
Lasse Collin	35c03ec6bf	liblzma: CRC: Update CLMUL comments to more generic wording.	2024-01-11 14:39:46 +02:00
Lasse Collin	66f080e801	liblzma: Rename arch-specific CRC functions and macros. CRC_CLMUL was split to CRC_ARCH_OPTIMIZED and CRC_X86_CLMUL. CRC_ARCH_OPTIMIZED is defined when an arch-optimized version is used. Currently the x86 CLMUL implementations are the only arch-optimized versions, and these also use the CRC_x86_CLMUL macro to tell when crc_x86_clmul.h needs to be included. is_clmul_supported() was renamed to is_arch_extension_supported(). crc32_clmul() and crc64_clmul() were renamed to crc32_arch_optimized() and crc64_arch_optimized(). This way the names make sense with arch-specific non-CLMUL implementations as well.	2024-01-11 14:29:42 +02:00
Lasse Collin	3dbed75b0b	liblzma: Fix a comment in crc_common.h.	2024-01-11 14:29:42 +02:00
Lasse Collin	419f55f9df	liblzma: Avoid extern lzma_crc32_clmul() and lzma_crc64_clmul(). A CLMUL-only build will have the crcxx_clmul() inlined into lzma_crcxx(). Previously a jump to the extern lzma_crcxx_clmul() was needed. Notes about shared liblzma on ELF platforms: - On platforms that support ifunc and -fvisibility=hidden, this was silly because CLMUL-only build would have that single extra jump instruction of extra overhead. - On platforms that support neither -fvisibility=hidden nor linker version script (liblzma*.map), jumping to lzma_crcxx_clmul() would go via PLT so a few more instructions of overhead (still not a big issue but silly nevertheless). There was a downside with static liblzma too: if an application only needs lzma_crc64(), static linking would make the linker include the CLMUL code for both CRC32 and CRC64 from crc_x86_clmul.o even though the CRC32 code wouldn't be needed, thus increasing code size of the executable (assuming that -ffunction-sections isn't used). Also, now compilers are likely to inline crc_simd_body() even if they don't support the always_inline attribute (or MSVC's __forceinline). Quite possibly all compilers that build the code do support such an attribute. But now it likely isn't a problem even if the attribute wasn't supported. Now all x86-specific stuff is in crc_x86_clmul.h. If other archs The other archs can then have their own headers with their own is_clmul_supported() and crcxx_clmul(). Another bonus is that the build system doesn't need to care if crc_clmul.c is needed. is_clmul_supported() stays as inline function as it's not needed when doing a CLMUL-only build (avoids a warning about unused function).	2024-01-11 14:29:42 +02:00
Lasse Collin	e3833e297d	liblzma: crc_clmul.c: Add crc_attr_target macro. This reduces the number of the complex #if directives.	2024-01-11 14:29:42 +02:00
Lasse Collin	d164ac0e62	liblzma: Simplify existing cases with lzma_attr_no_sanitize_address.	2024-01-11 14:29:42 +02:00
Lasse Collin	9523c1300d	liblzma: #define crc_attr_no_sanitize_address in crc_common.h.	2024-01-11 14:29:38 +02:00
Lasse Collin	93d144f093	liblzma: CRC: Add empty lines. And remove one too.	2024-01-10 17:19:03 +02:00
Lasse Collin	0c7e854ffd	liblzma: crc_clmul.c: Tidy up the location of MSVC pragma. It makes no difference in practice.	2024-01-10 17:19:03 +02:00
Lasse Collin	15cf3f04f2	Update THANKS.	2023-12-28 17:17:39 +02:00
Lasse Collin	cd64dd70d5	liblzma: Use 8-byte method in memcmplen.h on ARM64. It requires fast unaligned access to 64-bit integers and a fast instruction to count leading zeros in a 64-bit integer (__builtin_ctzll()). This perhaps should be enabled on some other archs too. Thanks to Chenxi Mao for the original patch: https://github.com/tukaani-project/xz/pull/75 (the first commit) According to the numbers there, this may improve encoding speed by about 3-5 %. This enables the 8-byte method on MSVC ARM64 too which should work but wasn't tested.	2023-12-28 17:17:39 +02:00

1 2 3 4 5 ...

2166 Commits All Branches Search

2166 Commits

All Branches