ncdu-zig

mirror of https://code.blicky.net/yorhel/ncdu.git synced 2026-01-13 01:08:41 -09:00

Author	SHA1	Message	Date
Yorhel	8ad61e87c1	Stick with zstd-4 + 64k block, add --compress-level, fix 32bit build And do dynamic buffer allocation for bin_export, removing 128k of .rodata that I accidentally introduced earlier and reducing memory use for parallel scans. Static binaries now also include the minimal version of zstd, current sizes for x86_64 are: 582k ncdu-2.5 601k ncdu-new-nocompress 765k ncdu-new-zstd That's not great, but also not awful. Even zlib or LZ4 would've resulted in a 700k binary.	2024-08-03 13:16:44 +02:00
Yorhel	cd00ae50d1	refactor: Merge sink.Special and bin_export.ItemType into model.EType Simplifies code a little bit and saves one whole byte off of file entries.	2024-08-01 14:24:56 +02:00
Yorhel	f25bc5cbf4	Experimental new export format The goals of this format being: - Streaming parallel export with minimal mandatory buffering. - Exported data includes cumulative directory stats, so reader doesn't have to go through the entire tree to calculate these. - Fast-ish directory listings without reading the entire file. - Built-in compression. Current implementation is missing compression, hardlink counting and actually reading the file. Also need to tune and measure stuff.	2024-07-30 14:27:41 +02:00
Yorhel	87d336baeb	Add progress indicator to hardlink counting + fix import/mem UI updating	2024-07-28 10:54:58 +02:00
Yorhel	3c055810d0	Split mem import and json export out of sink.zig Mainly to make room for another export format, though that'll take a lot more experimenting before it'll get anywhere.	2024-07-27 11:58:08 +02:00
Yorhel	08d373881c	Fix JSON export of "otherfs" excluded type The exporter would write "othfs" while the import code was expecting "otherfs". This bug also exists in the 1.x branch and is probably as old as the JSON import/export feature. D'oh. Normalized the export to use "otherfs" now (which is what all versions can read correctly) and fixed the importer to also accept "othfs" (which is what all previous versions exported).	2024-07-24 10:30:30 +02:00
Yorhel	dc42c91619	Fix JSON export of special entries	2024-07-24 07:34:12 +02:00
Yorhel	a5e57ee5ad	Fix use of u64 atomic integers on 32-bit platforms	2024-07-18 10:53:27 +02:00
Yorhel	99f92934c6	Improve JSON export performance When you improve performance in one part of the code, another part becomes the new bottleneck. The slow JSON writer was very noticeable with the parallel export option. This provides a 20% improvement on total run-time when scanning a hot directory with 8 threads.	2024-07-18 07:11:32 +02:00
Yorhel	9b517f27b1	Add support for multithreaded scanning to JSON export by scanning into memory first.	2024-07-17 16:40:02 +02:00
Yorhel	705bd8907d	Move nlink count from inode map into Link node This adds another +4 bytes* to Link nodes, but allows for the in-memory tree to be properly exported to JSON, which we'll need for multithreaded export. It's also slightly nicer conceptually, as we can now detect inconsistencies without throwing away the actual data, so have a better chance of recovering on partial refresh. Still unlikely, anyway, but whatever. (* but saves 4+ bytes per unique inode in the inode map, so the memory increase is only noticeable when links are repeated in the scanned tree. Admittedly, that may be the common case)	2024-07-17 14:15:53 +02:00
Yorhel	6bb31a4653	More consistent handling of directory read errors These are now always added as a separate dir followed by setReadError(). JSON export can catch these cases when the error happens before any entries are read, which is the common error scenario.	2024-07-17 09:09:04 +02:00
Yorhel	7558fd7f8e	Re-add single-threaded JSON export That was the easy part, next up is fixing multi-threaded JSON export.	2024-07-17 07:05:18 +02:00
Yorhel	d2e8dd8a90	Reimplement JSON import + minor fixes Previous import code did not correctly handle a non-empty directory with the "read_error" flag set. I have no clue if that can ever happen in practice, but at least ncdu 1.x can theoretically emit such JSON so we handle it now. Also fixes mtime display of "special" files. i.e. don't display the mtime of the parent directory - that's confusing. Split a generic-ish JSON parser out of the import code for easier reasoning and implemented a few more performance improvements as well. New code is ~30% faster in both ReleaseSafe and ReleaseFast.	2024-07-16 14:20:30 +02:00
Yorhel	ddbed8b07f	Some fixes in mtime propagation and hardlink refresh counting	2024-07-15 11:00:14 +02:00
Yorhel	db51987446	Re-add hard link counting + parent suberror & stats propagation Ended up turning the Links into a doubly-linked list, because the current approach of refreshing a subdirectory makes it more likely to run into problems with the O(n) removal behavior of singly-linked lists. Also found a bug that was present in the old scanning code as well; fixed here and in `c41467f240`.	2024-07-14 20:17:34 +02:00
Yorhel	cc12c90dbc	Re-add scan progress UI + directory refreshing	2024-07-14 20:17:19 +02:00
Yorhel	f2541d42ba	Rewrite scan/import code, experiment with multithreaded scanning (again) Benchmarks are looking very promising this time. This commit breaks a lot, though: - Hard link counting - Refreshing - JSON import - JSON export - Progress UI - OOM handling is not thread-safe All of which needs to be reimplemented and fixed again. Also haven't really tested this code very well yet so there's likely to be bugs. There's also a behavioral change: --exclude-kernfs is not checked on the given root directory anymore, meaning that the filesystem the user asked to scan is being scanned even if that's a 'kernfs'. I suspect that's more sensible behavior. The old scan.zig was quite messy and hard for me to reason about and extend, this new sink API is looking to be less confusing. I hope it stays that way as more features are added.	2024-07-14 20:17:18 +02:00

18 commits