mirrors/zig - "Borealis" Git by INX: Hosted by INX "Xenon".

mirror of https://codeberg.org/ziglang/zig.git synced 2025-12-06 13:54:21 +00:00

Author	SHA1	Message	Date
Kendall Condon	93775de45f	rework fuzz testing to be smith based -- On the standard library side: The `input: []const u8` parameter of functions passed to `testing.fuzz` has changed to `smith: testing.Smith`. `Smith` is used to generate values from libfuzzer or input bytes generated by libfuzzer. `Smith` contains the following base methods: `value` as a generic method for generating any type * `eos` for generating end-of-stream markers. Provides the additional guarantee `true` will eventually by provided. * `bytes` for filling a byte array. * `slice` for filling part of a buffer and providing the length. `Smith.Weight` is used for giving value ranges a higher probability of being selected. By default, every value has a weight of zero (i.e. they will not be selected). Weights can only apply to values that fit within a u64. The above functions have corresponding ones that accept weights. Additionally, the following functions are provided: * `baselineWeights` which provides a set of weights containing every possible value of a type. * `eosSimpleWeighted` for unique weights for `true` and `false` * `valueRangeAtMost` and `valueRangeLessThan` for weighing only a range of values. -- On the libfuzzer and abi side: --- Uids These are u32s which are used to classify requested values. This solves the problem of a mutation causing a new value to be requested and shifting all future values; for example: 1. An initial input contains the values 1, 2, 3 which are interpreted as a, b, and c respectively by the test. 2. The 1 is mutated to a 4 which causes the test to request an extra value interpreted as d. The input is now 4, 2, 3, 5 (new value) which the test corresponds to a, d, b, c; however, b and c no longer correspond to their original values. Uids contain a hash component and type component. The hash component is currently determined in `Smith` by taking a hash of the calling `@returnAddress()` or via an argument in the corresponding `WithHash` functions. The type component is used extensively in libfuzzer with its hashmaps. --- Mutations At the start of a cycle (a run), a random number of values to mutate is selected with less being exponentially more likely. The indexes of the values are selected from a selected uid with a logarithmic bias to uids with more values. Mutations may change a single values, several consecutive values in a uid, or several consecutive values in the uid-independent order they were requested. They may generate random values, mutate from previous ones, or copy from other values in the same uid from the same input or spliced from another. For integers, mutations from previous ones currently only generates random values. For bytes, mutations from previous mix new random data and previous bytes with a set number of mutations. --- Passive Minimization A different approach has been taken for minimizing inputs: instead of trying a fixed set of mutations when a fresh input is found, the input is instead simply added to the corpus and removed when it is no longer valuable. The quality of an input is measured based off how many unique pcs it hit and how many values it needed from the fuzzer. It is tracked which inputs hold the best qualities for each pc for hitting the minimum and maximum unique pcs while needing the least values. Once all an input's qualities have been superseded for the pcs it hit, it is removed from the corpus. -- Comparison to byte-based smith A byte-based smith would be much more inefficient and complex than this solution. It would be unable to solve the shifting problem that Uids do. It is unable to provide values from the fuzzer past end-of-stream. Even with feedback, it would be unable to act on dynamic weights which have proven essential with the updated tests (e.g. to constrain values to a range). -- Test updates All the standard library tests have been updated to use the new smith interface. For `Deque`, an ad hoc allocator was written to improve performance and remove reliance on heap allocation. `TokenSmith` has been added to aid in testing Ast and help inform decisions on the smith interface.	2025-11-23 14:58:22 -05:00
Matthew Lugg	010dcd6a9b	fuzzer: account for runtime address slide This is relevant to PIEs, which are notably enabled by default on macOS. The build system needs to only see virtual addresses, that is, those which do not have the slide applied; but the fuzzer itself naturally sees relocated addresses (i.e. with the slide applied). We just need to subtract the slide when we communicate addresses to the build system.	2025-11-20 10:42:20 +00:00
Matthew Lugg	0caca625eb	std.debug: split up Mach-O debug info handling Like ELF, we now have `std.debug.MachOFile` for the host-independent parts, and `std.debug.SelfInfo.MachO` for logic requiring the file to correspond to the running program.	2025-11-20 10:42:20 +00:00
Alex Rønne Petersen	9ab7eec23e	represent Mac Catalyst as aarch64-maccatalyst-none rather than aarch64-ios-macabi Apple's own headers and tbd files prefer to think of Mac Catalyst as a distinct OS target. Earlier, when DriverKit support was added to LLVM, it was represented a distinct OS. So why Apple decided to only represent Mac Catalyst as an ABI in the target triple is beyond me. But this isn't the first time they've ignored established target triple norms (see: armv7k and aarch64_32) and it probably won't be the last. While doing this, I also audited all Darwin OS prongs throughout the codebase and made sure they cover all the tags.	2025-11-14 11:33:35 +01:00
Matthew Lugg	92bc619c49	std.debug: allow fp unwind from context It's easy to do FP unwinding from a CPU context: you just report the captured ip/pc value first, and then unwind from the captured fp value. All this really needed was a couple of new functions on the `std.debug.cpu_context` implementations so that we don't need to rely on `std.debug.Dwarf` to access the captured registers. Resolves: #25576	2025-11-12 21:02:38 +00:00
qilme	8347791ce3	std.os.windows: eliminate forwarder function in kernel32 (#25766 ) #1840 kernel32.AddVectoredExceptionHandler -> ntdll.RtlAddVectoredExceptionHandler kernel32.RemoveVectoredExceptionHandler -> ntdll.RtlRemoveVectoredExceptionHandler kernel32.ExitProcess -> ntdll.RtlExitUserProcess kernel32.InitializeCriticalSection -> ntdll.RtlInitializeCriticalSection kernel32.EnterCriticalSection -> ntdll.RtlEnterCriticalSection kernel32.LeaveCriticalSection -> ntdll.RtlLeaveCriticalSection kernel32.DeleteCriticalSection -> ntdll.RtlDeleteCriticalSection kernel32.TryAcquireSRWLockExclusive -> ntdll.RtlTryAcquireSRWLockExclusive kernel32.AcquireSRWLockExclusive -> ntdll.RtlAcquireSRWLockExclusive kernel32.ReleaseSRWLockExclusive -> ntdll.RtlReleaseSRWLockExclusive kernel32.WakeConditionVariable -> ntdll.RtlWakeConditionVariable kernel32.WakeAllConditionVariable -> ntdll.RtlWakeAllConditionVariable kernel32.HeapReAlloc -> ntdll.RtlReAllocateHeap kernel32.HeapAlloc -> ntdll.RtlAllocateHeap	2025-10-31 13:54:50 +00:00
Matthew Lugg	74931fe25c	std.debug.lockStderrWriter: also return ttyconf `std.Io.tty.Config.detect` may be an expensive check (e.g. involving syscalls), and doing it every time we need to print isn't really necessary; under normal usage, we can compute the value once and cache it for the whole program's execution. Since anyone outputting to stderr may reasonably want this information (in fact they are very likely to), it makes sense to cache it and return it from `lockStderrWriter`. Call sites who do not need it will experience no significant overhead, and can just ignore the TTY config with a `const w, _` destructure.	2025-10-30 09:31:28 +00:00
Andrew Kelley	a072d821be	Merge pull request #25592 from ziglang/init-std.Io std: Introduce `Io` Interface	2025-10-29 13:51:37 -07:00
Alex Rønne Petersen	a7119d4269	remove all IBM AIX and z/OS support As with Solaris (`dba1bf9353`), we have no way to actually audit contributions for these OSs. IBM also makes it even harder than Oracle to actually obtain these OSs. closes #23695 closes #23694 closes #3655 closes #23693	2025-10-29 14:25:51 +01:00
Andrew Kelley	8b269f7e18	std: make signal numbers into an enum fixes start logic for checking whether IO/POLL exist	2025-10-29 06:20:51 -07:00
Andrew Kelley	46f7e3ea9f	std.Io.Threaded: add ioBasic which disables networking	2025-10-29 06:20:51 -07:00
Andrew Kelley	aadd8d4a3e	std: back out the StackTrace byval changes Let's keep passing this thing by pointer	2025-10-29 06:20:50 -07:00
Andrew Kelley	10b1eef2d3	std: fix compilation errors on Windows	2025-10-29 06:20:50 -07:00
Andrew Kelley	89412fda77	std.Io: implement fileStat	2025-10-29 06:20:48 -07:00
Alex Rønne Petersen	dba1bf9353	remove all Oracle Solaris support There is no straightforward way for the Zig team to access the Solaris system headers; to do this, one has to create an Oracle account, accept their EULA to download the installer ISO, and finally install it on a machine or VM. We do not have to jump through hoops like this for any other OS that we support, and no one on the team has expressed willingness to do it. As a result, we cannot audit any Solaris contributions to std.c or other similarly sensitive parts of the standard library. The best we would be able to do is assume that Solaris and illumos are 100% compatible with no way to verify that assumption. But at that point, the solaris and illumos OS tags would be functionally identical anyway. For Solaris especially, any contributions that involve APIs introduced after the OS was made closed-source would also be inherently more risky than equivalent contributions for other proprietary OSs due to the case of Google LLC v. Oracle America, Inc., wherein Oracle clearly demonstrated its willingness to pursue legal action against entities that merely copy API declarations. Finally, Oracle laid off most of the Solaris team in 2017; the OS has been in maintenance mode since, presumably to be retired completely sometime in the 2030s. For these reasons, this commit removes all Oracle Solaris support. Anyone who still wishes to use Zig on Solaris can try their luck by simply using illumos instead of solaris in target triples - chances are it'll work. But there will be no effort from the Zig team to support this use case; we recommend that people move to illumos instead.	2025-10-27 07:35:38 -07:00
Alex Rønne Petersen	d8cb8b7bae	std.debug: fix FP unwinding for hppa/hppa64	2025-10-23 19:34:02 +02:00
Alex Rønne Petersen	c13355abda	std.debug: fix FP unwind progress check for stackGrowth() == .up targets	2025-10-23 19:34:02 +02:00
Alex Rønne Petersen	a689c38197	std.debug: FP unwinding is impossible on alpha, microblaze, sh	2025-10-23 19:34:02 +02:00
Alex Rønne Petersen	38caa4902f	Merge pull request #25623 from alexrp/or1k Add `or1k-linux` support (via CBE)	2025-10-19 11:50:06 +02:00
GasInfinity	1bca158c6e	fix(std): don't add the default `_start` and `panic` in homebrew targets * even if std supported those targets, they're not posixy to be in that codepath.	2025-10-18 23:54:27 +02:00
Alex Rønne Petersen	49cd0e6f7c	std.debug: fix frame pointer unwinding on or1k	2025-10-18 22:27:35 +02:00
Alex Rønne Petersen	2eca0e42e5	std.debug: FP-based unwinding is impossible on avr, csky, msp430, and xcore The ABIs do not define a frame pointer register, nor do they define a guaranteed and fixed area on the stack where one might find saved registers such as a frame pointer or return address.	2025-10-18 00:36:52 +02:00
Alex Rønne Petersen	e0f10da270	std.debug: FP-based unwinding is ideal on SPARC The way SPARC works due to its ABI built around register windows means that we can always do fast FP-based unwinding.	2025-10-15 13:59:17 +02:00
Alex Rønne Petersen	dd7819220a	std.debug: fix return addresses being off on SPARC The return address points to the call instruction on SPARC, so the actual return address is 8 bytes after. This means that we shouldn't do the return address adjustment that we normally do.	2025-10-15 13:59:17 +02:00
Alex Rønne Petersen	912fed3380	std.debug: use the SP as the initial FP on SPARC The FP would point to the register save area for the previous frame, while the SP points to the register save area for the current frame. So use the latter.	2025-10-15 13:59:17 +02:00
Alex Rønne Petersen	6de2d61a0c	std.debug: work around latest SPARC register window not being spilled on signal I have no idea if this is a QEMU bug or real kernel behavior. Either way, the register save area specifically exists for asynchronous spilling of incoming and local registers, so there should be no harm in doing this.	2025-10-15 13:59:17 +02:00
Alex Rønne Petersen	78bc5d46e0	std.debug: the SPARC stack bias is only used on the 64-bit ABI	2025-10-15 13:59:17 +02:00
Alex Rønne Petersen	ebc0b90eb7	std.debug: rename some constants for clarity	2025-10-15 13:59:17 +02:00
Alex Rønne Petersen	62a8cfd5fe	std.debug: fix an invalid read in StackIterator.next() We're overwriting the memory that unwind_context sits in, so we need to do the getFp() call earlier.	2025-10-15 13:59:17 +02:00
Alex Rønne Petersen	b8dd40fde8	std.debug.cpu_context.Sparc: flush register windows in current() It's better to do this here than in StackIterator.init() so that std.debug.cpu_context.Native.current() isn't a footgun on SPARC.	2025-10-15 13:59:17 +02:00
Alex Rønne Petersen	3d3aff0da9	std.debug: flush SPARC register windows from a new window flushw and ta 3 flush all windows except the current one. So we need to do this in a new register window to get all of the ones we care about.	2025-10-15 13:59:17 +02:00
Jacob Young	2e31077fe0	Coff: implement threadlocal variables	2025-10-10 22:47:47 -07:00
Alex Rønne Petersen	f33d3a5166	std.debug: greatly expand target support for segfault handling/unwinding I made a couple of decisions for this based on the fact that we don't expose the signal_ucontext_t type outside of the file: * Adding all the floating point and vector state to every ucontext_t and mcontext_t variant was way, way too much work, especially when we don't even use the stuff. So I deleted all that and kept only the bare minimum needed to reach into general-purpose registers. * There is no particularly compelling reason to stick to the naming and struct nesting used in the system headers. So we can actually unify the access patterns for almost all of these variants by taking some liberties here; as a result, fromPosixSignalContext() is now much nicer to read and extend.	2025-10-10 04:43:15 +02:00
Alex Rønne Petersen	3f5e782357	std.debug: fix FP unwinding for LoongArch	2025-10-09 20:43:32 +02:00
Alex Rønne Petersen	98f0bf9b67	std.debug: fix SelfInfo default for freestanding ELF targets	2025-10-09 20:43:32 +02:00
mlugg	80f6b8c4b3	std.debug: fix incorrect FP unwinding on RISC-V and SPARC I broke this when porting this logic for the `std.debug` rework in https://github.com/ziglang/zig/pull/25227. The offset that I copied was actually being treated as relative to the address of the saved base pointer. I think it makes more sense to do what I did and just treat all offsets as relative to this frame's base.	2025-10-09 19:31:44 +01:00
Alex Rønne Petersen	fdd109420d	std.debug: add noinline to functions that capture the current stack trace Fixes stack traces missing a frame depending on inlining decisions. ref https://github.com/ziglang/zig/issues/25418	2025-10-07 16:47:57 +02:00
Alex Rønne Petersen	9760068826	std.debug: prefer FP unwinding on targets where it is ideal If the ABI requires a backchain pointer, FP unwinding is always possible, safe, and fast, so there's really no reason not to use it.	2025-10-07 16:44:25 +02:00
Alex Rønne Petersen	e6e4792a58	std.debug: completely disable FP-based unwinding on mips	2025-10-05 07:18:50 +02:00
Alex Rønne Petersen	b54bdace75	Merge pull request #25457 from linusg/more-serenity std.debug: Add unwind support for serenity	2025-10-04 07:09:59 +02:00
Alex Rønne Petersen	9dbfa5b294	std.debug: consider FP-based unwinding on hexagon and powerpc safe The ABIs make this safe and reliable due to their backchain requirements.	2025-10-04 03:22:40 +02:00
Alex Rønne Petersen	d8268fac98	std.debug: fix FP-based unwinding on powerpc64 This just needs to do the same thing as powerpc64le. Note that the saved LR is at the same position in both ELF v1 and v2.	2025-10-04 03:03:54 +02:00
Linus Groh	b0f280f4a4	std.debug: Add unwind support for serenity	2025-10-03 22:59:40 +01:00
Alex Rønne Petersen	0f56d7afe2	std.debug: use correct return address offset for s390x Makes FP-based unwinding work.	2025-10-03 03:29:20 +02:00
Alex Rønne Petersen	771410cbf2	std.debug.SelfInfo: rename Darwin to MachO	2025-10-01 23:47:47 +02:00
Alex Rønne Petersen	e1fb662f60	std.debug: don't use SelfInfo.Windows for UEFI It is, in fact, Windows-only.	2025-10-01 23:47:47 +02:00
Alex Rønne Petersen	59633e54a2	std.debug: select SelfInfo using ObjectFormat.default()	2025-10-01 23:47:47 +02:00
mlugg	1120546f72	std.debug.SelfInfo: remove shared logic There were only a few dozen lines of common logic, and they frankly introduced more complexity than they eliminated. Instead, let's accept that the implementations of `SelfInfo` are all pretty different and want to track different state. This probably fixes some synchronization and memory bugs by simplifying a bunch of stuff. It also improves the DWARF unwind cache, making it around twice as fast in a debug build with the self-hosted x86_64 backend, because we no longer have to redundantly go through the hashmap lookup logic to find the module. Unwinding on Windows will also see a slight performance boost from this change, because `RtlVirtualUnwind` does not need to know the module whatsoever, so the old `SelfInfo` implementation was doing redundant work. Lastly, this makes it even easier to implement `SelfInfo` on freestanding targets; there is no longer a need to emulate a real module system, since the user controls the whole implementation! There are various other small refactors here in the `SelfInfo` implementations as well as in the DWARF unwinding logic. This change turned out to make a lot of stuff simpler!	2025-09-30 14:18:26 +01:00
mlugg	156cd8f678	std.debug: significantly speed up capturing stack traces By my estimation, these changes speed up DWARF unwinding when using the self-hosted x86_64 backend by around 7x. There are two very significant enhancements: we no longer iterate frames which don't fit in the stack trace buffer, and we cache register rules (in a fixed buffer) to avoid re-parsing and evaluating CFI instructions in most cases. Alongside this are a bunch of smaller enhancements, such as pre-caching the result of evaluating the CIE's initial instructions, avoiding re-parsing of CIEs, and big simplifications to the `Dwarf.Unwind.VirtualMachine` logic.	2025-09-30 13:44:56 +01:00
mlugg	b0f222777c	std.debug: cap total stack trace frames ...just in case there is broken debug info and/or bad values on the stack, either of which could cause stack unwinding to potentially loop forever.	2025-09-30 13:44:56 +01:00

1 2 3 4 5 ...

552 commits