ncdu-zig

mirror of https://code.blicky.net/yorhel/ncdu.git synced 2026-01-13 09:18:40 -09:00

Author	SHA1	Message	Date
Yorhel	66b875eb00	Add --delete-command option Fixes #215. delete.zig's item replacement/refresh code is pretty awful and may be buggy in some edge cases. Existing refresh infrastructure wasn't designed to update an individual file.	2025-07-15 18:27:10 +02:00
Yorhel	58e6458130	Use Stat.mtime() instead of .mtim Aha, so that's why the mtime() method exists, the field has a different name on some other systems. Fixes #258	2025-05-01 14:51:11 +02:00
Yorhel	beac59fb12	Use std.c.fstatat() instead of @cImport()ed version Because translate-c can't handle struct stat as defined by musl. (Should have done this in the first place, but wasn't aware fstatat() had been properly wrapped in std.c)	2025-04-28 13:22:03 +02:00
Yorhel	35a9faadb2	Work around panic in Zig fstatat wrapper https://github.com/ziglang/zig/issues/23463	2025-04-06 10:36:36 +02:00
Eric Joldasov	93a81a3898	`ArrayList.pop` now returns optional like removed `popOrNull` See https://github.com/ziglang/zig/pull/22720 . Signed-off-by: Eric Joldasov <bratishkaerik@landless-city.net>	2025-03-03 22:46:51 +05:00
Eric Joldasov	607b07a30e	change deprecated `timespec.tv_sec` to `timespec.sec` Part of the reorganization of `std.c` namespace. See https://github.com/ziglang/zig/pull/20679 . Signed-off-by: Eric Joldasov <bratishkaerik@landless-city.net>	2025-03-03 22:46:49 +05:00
Yorhel	bd442673d2	Consolidate @cImports into a single c.zig Which is, AFAIK, a recommended practice. Reduces the number of times translate-c is being run and (most likely) simplifies a possible future transition if/when @cImport is thrown out of the language. Also uses zstd.h instead of my own definitions, mainly because I plan to use the streaming API as well and those need more definitions.	2024-10-26 14:35:05 +02:00
Yorhel	39517c01a8	Remove kernfs dev id cache Kernfs checking was previously done for every directory scanned, but the new parallel scanning code only performs the check when the dev id is different from parent, which isn't nearly as common. (In fact, in typical scenarios this only ever happens once per dev id, rendering the cache completely useless. But even people will 10k bind mounts are unlikely to notice a performance impact)	2024-08-25 09:29:41 +02:00
Yorhel	c30699f93b	Track which extended mode fields we have + bugfixes This prevents displaying invalid zero values or writing such values out in JSON/bin exports. Very old issue, actually, but with the new binfmt experiments it's finally started annoying me.	2024-08-09 18:32:47 +02:00
Yorhel	cd00ae50d1	refactor: Merge sink.Special and bin_export.ItemType into model.EType Simplifies code a little bit and saves one whole byte off of file entries.	2024-08-01 14:24:56 +02:00
Yorhel	f25bc5cbf4	Experimental new export format The goals of this format being: - Streaming parallel export with minimal mandatory buffering. - Exported data includes cumulative directory stats, so reader doesn't have to go through the entire tree to calculate these. - Fast-ish directory listings without reading the entire file. - Built-in compression. Current implementation is missing compression, hardlink counting and actually reading the file. Also need to tune and measure stuff.	2024-07-30 14:27:41 +02:00
Yorhel	6bb31a4653	More consistent handling of directory read errors These are now always added as a separate dir followed by setReadError(). JSON export can catch these cases when the error happens before any entries are read, which is the common error scenario.	2024-07-17 09:09:04 +02:00
Yorhel	cc12c90dbc	Re-add scan progress UI + directory refreshing	2024-07-14 20:17:19 +02:00
Yorhel	f2541d42ba	Rewrite scan/import code, experiment with multithreaded scanning (again) Benchmarks are looking very promising this time. This commit breaks a lot, though: - Hard link counting - Refreshing - JSON import - JSON export - Progress UI - OOM handling is not thread-safe All of which needs to be reimplemented and fixed again. Also haven't really tested this code very well yet so there's likely to be bugs. There's also a behavioral change: --exclude-kernfs is not checked on the given root directory anymore, meaning that the filesystem the user asked to scan is being scanned even if that's a 'kernfs'. I suspect that's more sensible behavior. The old scan.zig was quite messy and hard for me to reason about and extend, this new sink API is looking to be less confusing. I hope it stays that way as more features are added.	2024-07-14 20:17:18 +02:00
Yorhel	c41467f240	Fix entries getting removed when their type changes on refresh Somewhat surprised nobody reported this one yet, it is rather weird and obviously buggy behavior. A second refresh would fix it again, but still.	2024-07-14 20:01:19 +02:00
Yorhel	574a4348a3	Fix --one-file-system to exclude other-fs-symlink targets with --follow-symlinks	2024-07-12 12:36:17 +02:00
Yorhel	0215f3569d	Fix fd leak with --exclude-caches checking	2024-07-12 12:33:45 +02:00
Yorhel	6db150cc98	Fix crash on invalid utf8 when scanning in -1 UI mode	2024-05-26 11:16:22 +02:00
Yorhel	d60bcb2113	Copyright: remove year & use alias Tired of bumping files every year and slowly moving stuff to my alias.	2024-04-20 15:49:51 +02:00
Eric Joldasov	946d2a0316	src: update to standard library changes in Zig 0.12.0-dev.3385+3a836b480 * rearrangment of entries in `std.os` and `std.c`, `std.posix` finally extracted in https://github.com/ziglang/zig/pull/19354 . Signed-off-by: Eric Joldasov <bratishkaerik@landless-city.net>	2024-03-20 23:06:20 +05:00
Eric Joldasov	f03eee5443	src: update to stdlib changes in Zig 0.12.0-dev.1710+2bffd8101 * std.fs.Dir/IterableDir separation was reverted in https://www.github.com/ziglang/zig/pull/18076 , fix breaks ability to compile with Zig 0.11.0. It was planned since at least October, 16th: https://github.com/ziglang/zig/pull/12060#issuecomment-1763671541 . Signed-off-by: Eric Joldasov <bratishkaerik@landless-city.net>	2024-03-20 23:02:38 +05:00
Yorhel	a2eb84e7d3	Update parent dir suberr on refresh Fixes #233	2023-12-05 12:06:14 +01:00
Eric Joldasov	c83159f076	fix new "var never mutated" error on Zig 0.12.0-dev.1663+6b1a823b2 Fixes these errors (introduced in https://github.com/ziglang/zig/pull/18017 and `6b1a823b2b` ): ``` src/main.zig:290:13: error: local variable is never mutated var line_ = line_fbs.getWritten(); ^~~~~ src/main.zig:290:13: note: consider using 'const' src/main.zig:450:17: error: local variable is never mutated var path = std.fs.path.joinZ(allocator, &.{p, "ncdu", "config"}) catch unreachable; ^~~~ src/main.zig:450:17: note: consider using 'const' ... ``` Will be included in future Zig 0.12, this fix is backward compatible: ncdu still builds and runs fine on Zig 0.11.0. Signed-off-by: Eric Joldasov <bratishkaerik@getgoogleoff.me>	2023-11-20 14:45:02 +06:00
Eric Joldasov	ab6dc5be75	Update to Zig 0.11.0 Signed-off-by: Eric Joldasov <bratishkaerik@getgoogleoff.me>	2023-08-04 14:41:49 +06:00
Yorhel	e6cfacfa06	scan.zig: Add explicit cast for struct statfs.f_type Hopefully fixes #221.	2023-04-02 11:58:41 +02:00
Yorhel	ec233ff33a	Version 2.2.2 + copyright year bump	2023-01-19 08:00:27 +01:00
Yorhel	c002d9fa92	Work around a Zig ReleaseSafe mode performance regression With a little help from IRC: <ifreund> Ayo: its probaly stupidly copying that array to the stack to do the safety check, pretty sure there's an open issue on this still <ifreund> you may be able to work around the compiler's stupidity by using a pointer to the array or slice or something <Ayo> ifreund: Yup, (&self.rdbuf)[self.rdoff] does the trick, thanks. <ifreund> no problem! should get fixed eventually	2023-01-11 10:39:49 +01:00
Yorhel	890e5a4af7	Slightly less hacky Entry struct allocation and initialization	2022-11-02 14:39:05 +01:00
Yorhel	91281ef11f	Use extern instead of packed structs for the data model Still using a few embedded packed structs for those fields that benefit from bit packing. This isn't much cleaner than using packed structs for everything, but it does have better semantics. In particular, all fields (except those inside nested packed structs) are now guaranteed to be byte-aligned and I don't have to worry about the memory representation of integers when pointer-casting between the different Entry types.	2022-11-02 11:32:35 +01:00
Yorhel	f7e774ee6e	Fixes for stdlib changes	2022-10-26 13:34:06 +02:00
Torbjörn Lönnemark	d6728bca95	Fix incorrect format string causing invalid export files Zig requires alignment to be specified when specifying a fill character, as otherwise digits specified after ':' are interpreted as part of the field width. The missing alignment specifier caused character codes < 0x10 to be serialized incorrectly, producing an export file ncdu could not import. For example, a character with code 1 would be serialized as '\u00 1' instead of '\u0001'. A directory of test files can be generated using: mkdir test_files; i=1; while [ $i -le 255 ]; do c="$(printf "$(printf "\\\\x%02xZ" "$i")")"; c="${c%Z}"; touch "test_files/$c"; i=$((i+1)); done	2022-10-15 21:00:17 +02:00
Yorhel	d523a77fdc	Improve exclude pattern matching performance (and behavior, a bit) Behavioral changes: - A single wildcard ('') does not cross directory boundary anymore. Previously 'ab' would also match 'a/b', but no other tool that I am aware of matches paths that way. This change breaks compatibility with old exclude patterns but improves consistency with other tools. - Patterns with a trailing '/' now prevent recursing into the directory. Previously any directory excluded with such a pattern would show up as a regular directory with all its contents excluded, but now the directory entry itself shows up as excluded. - If the path given to ncdu matches one of the exclude patterns, the old implementation would exclude every file/dir being read, this new implementation would instead ignore the rule. Not quite sure how to best handle this case, perhaps just exit with an error message? Performance wise, I haven't yet found a scenario where this implementation is slower than the old one and it's significantly faster in some cases - in particular when using a large amount of patterns, especially with literal paths and file names. That's not to say this implementation is anywhere near optimal: - A list of relevant patterns is constructed for each directory being scanned. It may be possible to merge pattern lists that share the same prefix, which could both reduce memory use and the number of patterns that need to be matched upon entering a directory. - A hash table with dynamic arrays as values is just garbage from a memory allocation point of view. - This still uses libc fnmatch(), but there's an opportunity to precompile patterns for faster matching.	2022-08-10 09:46:39 +02:00
Yorhel	5f383966a9	Fix bad assertion in scan.zig:addSpecial() While it's true that the root item can't be a special, the first item to be added is not necessarily the root item. In particular, it isn't when refreshing. Probably fixes #194	2022-03-24 07:32:55 +01:00
Yorhel	e42db579a0	scan: Add UI message when counting hard links That usually doesn't take longer than a few milliseconds, but it can take a few seconds for some extremely large dirs, on very slow computers or with optimizations disabled. Better display a message than make it seem as if ncdu has stopped doing anything.	2022-02-05 09:19:15 +01:00
Yorhel	01f1e9188a	Version 2.0.1 + copyright year bump	2022-01-01 16:01:47 +01:00
Yorhel	6a68cd9b89	Fixes and updates for Zig 0.9.0	2021-12-21 10:34:44 +01:00
Yorhel	5b462cfb7a	Fix export feature ...by making sure that Context.parents is properly initialized to null when not scanning to RAM. Fixes #179.	2021-11-02 15:29:12 +01:00
Yorhel	929cc75675	Fix import of "special" dirs and excluded items	2021-10-06 14:32:02 +02:00
Yorhel	4b1da95835	Add configuration file support	2021-10-06 11:05:56 +02:00
Yorhel	3acab71fce	Fix reporting of fatal scan error in -0 or -1 UIs	2021-07-28 11:13:03 +02:00
Yorhel	0d314ca0ca	Implement a more efficient hard link counting approach As aluded to in the previous commit. This approach keeps track of hard links information much the same way as ncdu 1.16, with the main difference being that the actual /counting/ of hard link sizes is deferred until the scan is complete, thus allowing the use of a more efficient algorithm and amortizing the counting costs. As an additional benefit, the links listing in the information window now doesn't need a full scan through the in-memory tree anymore. A few memory usage benchmarks: 1.16 2.0-beta1 this commit root: 429 162 164 backup: 3969 1686 1601 many links: 155 194 106 many links2: 155 602 106 (I'm surprised my backup dir had enough hard links for this to be an improvement) ( this is the same as the "many links" benchmarks, but with a few parent directories added to increase the tree depth. 2.0-beta1 doesn't like that at all) Performance-wise, refresh and delete operations can still be improved a bit.	2021-07-28 10:35:56 +02:00
Yorhel	36bc405a69	Add parent node pointers to Dir struct + remove Parents abstraction While this simplifies the code a bit, it's a regression in the sense that it increases memory use. This commit is yak shaving for another hard link counting approach I'd like to try out, which should be a LOT less memory hungry compared to the current approach. Even though it does, indeed, add an extra cost of these parent node pointers.	2021-07-26 14:03:10 +02:00
Yorhel	a915fc0836	Fix counting of sizes for new directories	2021-07-19 16:58:34 +02:00
Yorhel	b96587c25f	scan: Don't allocate directory iterator on the stack I had planned to checkout out async functions here so I could avoid recursing onto the stack alltogether, but it's still unclear to me how to safely call into libc from async functions so let's wait for all that to get fleshed out a bit more.	2021-07-18 16:43:02 +02:00
Yorhel	c8636b8982	Add REUSE-compliant copyright headers	2021-07-18 11:50:50 +02:00
Yorhel	ee92f403ef	Add Makefile with some standard/handy tools + a failed initial attempt at producing static binaries.	2021-07-18 09:40:59 +02:00
Yorhel	3a21dea2cd	Implement file deletion + a bunch of bug fixes	2021-07-16 16:18:13 +02:00
Yorhel	6c2ab5001c	Implement directory refresh This complicated the scan code more than I had anticipated and has a few inherent bugs with respect to calculating shared hardlink sizes. Still, the merge approach avoids creating a full copy of the subtree, so that's another memory usage related win compared to the C version. On the other hand, it does leak memory if nodes can't be reused. Not quite as well tested as I should have, so I'm sure there's bugs.	2021-07-13 13:45:08 +02:00
Yorhel	ff3e3bccc6	Add link path listing to information window Two differences compared to the C version: - You can now select individual paths in the listing, pressing enter will open the selected path in the browser window. - Creating this listing is much slower and requires, in the worst case, a full traversal through the in-memory tree. I've tested this without the same-dev and shared-parent optimizations (i.e. worst case) on an import with 30M files and performance was still quite acceptable - the listing completed in a second - so I didn't bother adding a loading indicator. On slower systems and even larger trees this may be a little annoying, though. (also, calling nonl() apparently breaks detection of the return key, neither \n nor KEY_ENTER are emitted for some reason)	2021-07-06 18:33:31 +02:00
Yorhel	cc1966d6a9	Make some space for shared size in UI + speed up JSON import a bit It still feels kind of sluggish, but not entirely sure how to improve it.	2021-06-01 16:14:01 +02:00

1 2

64 commits